Machine Learning Investigation of Retail Demand Shocks, ETF Investing, and Limits to Arbitrage
Keywords:
Retail demand shocks; Exchange-traded funds; Limits to arbitrage; Machine learning; ETF mispricing; Investor sentiment; Order flow; Asset pricingAbstract
The paper explores the connection between the demand shocks within the retail sector, exchange traded fund (ETF) mispricing and the constraints that hindered the ability of the arbitrageurs to correct the said deviations based on a set of machine learning (ML) models estimated on a large sample of equity ETFs listed in the United States between the years 2015 (January) and 2023 (December). Using granular retail order flow data broken down through the odd-lot imbalance methodology of Boehmer et ak. (2021), social media sentiment indices based on Reddit and Google Trends, we create time-varying demand shock proxies and incorporate them into gradient-boosted tree models (XGBoost) and long short-term memory (LSTM) neural networks and random forests compared to penalised linear regressions. Evaluations based on an expanding-window out-of-sample scheme that maintains temporal sequence and removes look-ahead contamination are applied to models. We find that the most informative predictors of short-horizon ETFs premium and discount dynamics are retail demand shocks, which yield out-of-sample R2 values exceeding linear benchmarks (8 to 14 percentage) and a long-short arbitrage strategy (annualised Sharpe ratio of 1.47). Significantly, the predictive advantage is concentrated: it is concentrated during periods of large market volatility, constrained by authorised participants balance sheets, and large short interest; exactly the circumstances when classical limit-to-arbitrage theory hypothesises that professional capital will be slow to rectify mispricings. These findings form part of an increasing literature relating retail investor heterogeneity to the presence of institutional arbitrage capacity and they offer practitioner-valued instruments to identify when ETF mispricing is probable not to end but to continue.
Downloads
Published
Issue
Section
How to Cite
Most read articles by the same author(s)
- Adebayo Adegbenro, Arinze Madueke, Aniedi Ojo, Cynthia Alabi, AI-Driven Wealth Advisory: Machine Learning Models for Personalized Investment Portfolios and Risk Optimization , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Abdulateef Oluwakayode Disu, Henry Makinde, Olajide Alex Ajide, Aniedi Ojo, Martin Mbonu, Artificial Intelligence in Investment Banking: Automating Deal Structuring, Market Intelligence, and Client’s Insights Through Machine Learning , Communication In Physical Sciences: Vol. 8 No. 4 (2022): VOLUME 8 ISSUE 4
- Emurode Williams, Lawrence Abakah, Aniedi Ojo, Chidinma Jonah, AI-Driven Analysis of Information Processing Capacity and Financial Stability in Delegated Asset , Communication In Physical Sciences: Vol. 9 No. 4 (2023): VOLUME 9 ISSUE 4
- Emurode Williams, Aniedi Ojo, Deborah Warmate, Chidinma Jonah, Embedded Finance and Sustainable Business Models: Conceptualizing the Role of AI-Driven Automation in Reshaping Cross-Sector Value Creation and Programme Delivery , Communication In Physical Sciences: Vol. 12 No. 8 (2025): VOLUME 12 ISSUE 8
- Emurode Williams, Victoria Enoc-Ahiamadu, Lawrence Abakah, Aniedi Ojo, Decentralized Finance (DeFi) as a Catalyst for SME Resilience , Communication In Physical Sciences: Vol. 10 No. 3: VOLUME 10 ISSUE 3 (2023-2024)
- Aniedi Ojo, Victoria Enoc-Ahiamadu, Lawrence Abakah, Emurode Williams, Deborah Warmate, Machine Learning Investigation of Retail Demand Shocks, ETF Investing, and Limits to Arbitrage , Communication In Physical Sciences: Vol. 10 No. 3: VOLUME 10 ISSUE 3 (2023-2024)
Similar Articles
- Olumide Oni, Kenechukwu Francis Iloeje, Optimized Fast R-CNN for Automated Parking Space Detection: Evaluating Efficiency with MiniFasterRCNN , Communication In Physical Sciences: Vol. 12 No. 2 (2025): VOLUME 12 ISSUE 2
- Sanusi Abdullahi Sidi, Anas Tukur Balarabe, Abdulrashid Sani, Bashar Aliyu Yauri, Zahriya L. Hassan, YOLOv8-Based Deep Learning System for Liver Tumor Detection , Communication In Physical Sciences: Vol. 13 No. 2 (2026): VOLUME 13 ISSUE 2
- M. Musah, M. M. Ndamitso, H. Yerima, J. T. Mathew, G. O. Iwuchukwu, Nutritional Assessment of Vigna unguiculata sub spp. sesquipedalis Seeds , Communication In Physical Sciences: Vol. 5 No. 4 (2020): VOLUME 5 ISSUE 4
- Temitope Deborah Babayemi, Nafisat Olabisi Raji, Osita Victor Egwuatu, Oludoyi Mayowa Olumide, Integrating Artificial Intelligence with Assistive Technology to Expand Educational Access through Speech to Text, Eye Tracking and Augmented Reality , Communication In Physical Sciences: Vol. 7 No. 4 (2021): VOLUME 7 ISSUE 4
- Olalekan Akanji Bello, Sani Ibrahim Doguwa, Abubakar Yahaya, Haruna Mohammed Jibril , A Type I Half Logistic Exponentiated-G Family of Distributions: Properties and Application , Communication In Physical Sciences: Vol. 7 No. 3 (2021): VOLUME 7 ISSUE 3
- Uba Sani, Abdulkadir Ibrahim, Akande, Esther Oluwatoyosi, John, Oghenetega Mercy, Murtala, Mohammed Rumah, Assessment of Surface Water Quality in Zaria Metropolis: Implications for Environmental Health and Sustainable Management , Communication In Physical Sciences: Vol. 11 No. 3 (2024): VOLUME 11 ISSUE 3
- Felicia Uchechukwu Okwunodulu, Okechukwu Chimereoke Lord Ubani, Immaculate Nnedimma Onaka, Emeka Junior Chika, Kinetic studies on the sequestration of Pb2+ and Ni2+ ions in aqueous solutions using manganese nanoparticles anchored in ixora coccinea , Communication In Physical Sciences: Vol. 12 No. 8 (2025): VOLUME 12 ISSUE 8
- F. S. Bakpo, A Petri Net Computational Model for Web-based Students Attendance Monitoring , Communication In Physical Sciences: Vol. 1 No. 1 (2010): VOLUME 1 ISSUE 1
- Patricia Ese Umoru, Ameh David Onu, The Redox Reaction between Di-μ-Oxo-Tetrakis (2, 2’- bipyridine) – Dimanganese (III, IV) Perchlorate and 1, 3-Dihydroxybenzene in Hydrochloric Acid , Communication In Physical Sciences: Vol. 6 No. 1 (2020): VOLUME 6 ISSUE 1
- Henry Ekene Ohaegbuchu, Boniface Ikechukwu Ijeh, Paul Igienekpeme Aigba, Obinna Christian Dinneya, Integrated Geophysical Study of Geothermal and Mineralization Potential for Energy and Strategic Resources in the Lower Benue Trough, Nigeria , Communication In Physical Sciences: Vol. 12 No. 7 (2025): VOLUME 12 ISSUE 7
You may also start an advanced similarity search for this article.



