TY - JOUR
T1 - Using machine learning to develop customer insights from user-generated content
AU - Mustak, Mekhail
AU - Hallikainen, Heli
AU - Laukkanen, Tommi
AU - Plé, Loïc
AU - Hollebeek, Linda D.
AU - Aleem, Majid
N1 - Publisher Copyright:
© 2024 The Authors
PY - 2024/8/15
Y1 - 2024/8/15
N2 - Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies.
AB - Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies.
KW - 512 Business and Management
KW - Artificial intelligence
KW - Big data
KW - Customer insights
KW - Machine learning
KW - Marketing
KW - Natural language processing
KW - NLP
KW - Sentiment analysis
KW - Topic modeling
KW - UGC
KW - User-generated content
UR - http://www.scopus.com/inward/record.url?scp=85201377662&partnerID=8YFLogxK
U2 - 10.1016/j.jretconser.2024.104034
DO - 10.1016/j.jretconser.2024.104034
M3 - Article
AN - SCOPUS:85201377662
SN - 0969-6989
VL - 81
JO - Journal of Retailing and Consumer Services
JF - Journal of Retailing and Consumer Services
M1 - 104034
ER -