Predicting IPO first-day returns: Evidence from machine learning analyses*

Gonul Colak, Mengchuan Fu, Iftekhar Hasan*

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Predicting IPO first-day returns is inherently challenging due to the wide range of contributing factors, each with distinct statistical properties. We assess the performance of several machine learning (ML) techniques and identify XGBoost as the most statistically effective model for forecasting first-day returns. Using a comprehensive set of 863 pre-IPO variables, our high-performing predictive model accurately estimates both the direction and magnitude of IPO first-day returns. The most influential predictors include underwriter agency measures, price revision, and the free-float fraction. Using a rolling-window predictive approach, the model demonstrates substantial practical value, generating approximately $300 billion in gains from IPOs with positive first-day returns and avoiding more than $22 billion in losses from those with negative returns over the 2000–2016 period.

Original languageEnglish
Article number107500
Peer-reviewed scientific journalJournal of Banking and Finance
Volume178
ISSN0378-4266
DOIs
Publication statusPublished - 10.06.2025
MoE publication typeA1 Journal article - refereed

Keywords

  • 512 Business and Management
  • eXtreme gradient boosting
  • Initial public offering
  • IPO first-day returns
  • IPO underpricing
  • Machine learning
  • Return prediction

Fingerprint

Dive into the research topics of 'Predicting IPO first-day returns: Evidence from machine learning analyses*'. Together they form a unique fingerprint.

Cite this