Machine Learning Investigation of Retail Demand Shocks, ETF Investing, and Limits to Arbitrage
Keywords:
Retail demand shocks; Exchange-traded funds; Limits to arbitrage; Machine learning; ETF mispricing; Investor sentiment; Order flow; Asset pricingAbstract
: The paper explores the connection between the demand shocks within the retail sector, exchange traded fund (ETF) mispricing and the constraints that hindered the ability of the arbitrageurs to correct the said deviations based on a set of machine learning (ML) models estimated on a large sample of equity ETFs listed in the United States between the years 2015 (January) and 2023 (December). Using granular retail order flow data broken down through the odd-lot imbalance methodology of Boehmer et al. (2021), social media sentiment indices based on Reddit and Google Trends, we create time-varying demand shock proxies and incorporate them into gradient-boosted tree models (XGBoost) and long short-term memory (LSTM) neural networks and random forests compared to penalised linear regressions. Evaluations based on expanding-window out-ofsample scheme that maintains temporal sequence and removes look-ahead contamination are applied to models. We find that the most informative predictors of short-horizon ETFs premium and discount dynamics are retail demand shocks, which yield out-of-sample R2 values exceeding linear benchmarks (8 to 14 percentage) and long-short arbitrage strategy (annualised Sharpe ratio of 1.47). Significantly, the predictive advantage is concentrated: it is concentrated during periods of large market volatility, constrained by authorised participants balance sheets, and large short interest; exactly the circumstances when classical limitsto-arbitrage theory hypothesises that professional capital will be gradual to rectify mispricings. These findings form part of an increasing literature relating retail investor heterogeneity to the presence of institutional arbitrage capacity and they offer practitioner-valued instruments to identify when ETF mispricing is probable not to end but to continue
Most read articles by the same author(s)
- Adebayo Adegbenro, Arinze Madueke, Aniedi Ojo, Cynthia Alabi, AI-Driven Wealth Advisory: Machine Learning Models for Personalized Investment Portfolios and Risk Optimization , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Abdulateef Oluwakayode Disu, Henry Makinde, Olajide Alex Ajide, Aniedi Ojo, Martin Mbonu, Artificial Intelligence in Investment Banking: Automating Deal Structuring, Market Intelligence, and Client’s Insights Through Machine Learning , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Emurode Williams, Lawrence Abakah, Aniedi Ojo, Chidinma Jonah, AI-Driven Analysis of Information Processing Capacity and Financial Stability in Delegated Asset , Communication In Physical Sciences: Vol. 9 No. 4 (2023): VOLUME 9 ISSUE 4
- Emurode Williams, Victoria Enoc-Ahiamadu, Lawrence Abakah, Aniedi Ojo, Decentralized Finance (DeFi) as a Catalyst for SME Resilience , Communication In Physical Sciences: Vol. 10 No. 3: VOLUME 10 ISSUE 3 (2023-2024)
Similar Articles
- Joy Nnenna Okolo, A Review of Machine and Deep Learning Approaches for Enhancing Cybersecurity and Privacy in the Internet of Devices , Communication In Physical Sciences: Vol. 9 No. 4 (2023): VOLUME 9 ISSUE 4
- Confidence Ifeoma Odoh, Nweze Rosemary Chika Nweze, Ukamaka Victoria Maduahonwu, Development of an Enhanced Predictive Maintenance Models for Industrial Systems using Deep Learning Techniques , Communication In Physical Sciences: Vol. 13 No. 1 (2026): VOLUME 13 ISSUE 1
- Ademilola Olowofela Adeleye, Oluwafemi Clement Adeusi, Aminath Bolaji Bello, Israel Ayooluwa Agbo-Adediran, Intelligent Machine Learning Approaches for Data-Driven Cybersecurity and Advanced Protection , Communication In Physical Sciences: Vol. 7 No. 4 (2021): VOLUME 7 ISSUE 4
- Olaleye Ibiyeye, Joy Nnenna Okolo, Samuel Adetayo Adeniji, A Comprehensive Evaluation of AI-Driven Data Science Models in Cybersecurity: Covering Intrusion Detection, Threat Analysis, Intelligent Automation, and Adaptive Decision-Making Systems , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Aramide Ajayi, Anuoluwapo Rogers, Emmanuel Egyam, Justin Nnam, Chidinma Jonah, Leveraging Machine Learning for Predictive Analytics in Mergers and Acquisitions: Valuation, Risk Assessment, and Post-Merger Performance , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Bright Okore Osu, Prisca Udodiri Duruojinkeya, The Modeling of the Worth of an Asset Using a Skew Random Pricing Tree , Communication In Physical Sciences: Vol. 10 No. 1 (2023): VOLUME 10 ISSUE 1
- Edith Agberxonu, Abdulateef Disu, Chidin Dike, Toyosi Mustapha, Lawrence Abakah, Machine Learning and Artificial Intelligence in FinTech: Driving Innovation in Digital Payments, Fraud Detection, and Financial Inclusion , Communication In Physical Sciences: Vol. 9 No. 4 (2023): VOLUME 9 ISSUE 4
- Dahunsi Samuel Adeyemi , Autonomous Response Systems in Cybersecurity: A Systematic Review of AI-Driven Automation Tools , Communication In Physical Sciences: Vol. 9 No. 4 (2023): VOLUME 9 ISSUE 4
- Abubakar Tahiru, Oluwasanmi M. Odeniran, Shardrack Amoako, Developing Artificial Intelligence-Powered Circular Bioeconomy Models That Transform Forestry Residues into High-Value Materials and Renewable Energy Solutions , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Gideon Wyasu, Determination of Bacteriological and some physicochemical properties of Hospital wastewater , Communication In Physical Sciences: Vol. 4 No. 2 (2019): VOLUME 4 ISSUE 2
You may also start an advanced similarity search for this article.



