Abstract
Predicting IPO first-day returns is inherently challenging due to the wide range of contributing factors, each with distinct statistical properties. We assess the performance of several machine learning (ML) techniques and identify XGBoost as the most statistically effective model for forecasting first-day returns. Using a comprehensive set of 863 pre-IPO variables, our high-performing predictive model accurately estimates both the direction and magnitude of IPO first-day returns. The most influential predictors include underwriter agency measures, price revision, and the free-float fraction. Using a rolling-window predictive approach, the model demonstrates substantial practical value, generating approximately $300 billion in gains from IPOs with positive first-day returns and avoiding more than $22 billion in losses from those with negative returns over the 2000–2016 period.
| Original language | English |
|---|---|
| Article number | 107500 |
| Peer-reviewed scientific journal | Journal of Banking and Finance |
| Volume | 178 |
| ISSN | 0378-4266 |
| DOIs | |
| Publication status | Published - 10.06.2025 |
| MoE publication type | A1 Journal article - refereed |
Keywords
- 512 Business and Management
- eXtreme gradient boosting
- Initial public offering
- IPO first-day returns
- IPO underpricing
- Machine learning
- Return prediction