S&P Global Offerings
Featured Topics
Featured Products
Events
S&P Global Offerings
Featured Topics
Featured Products
Events
S&P Global Offerings
Featured Topics
Featured Products
Events
S&P Global Offerings
Featured Topics
Featured Products
Events
Financial and Market intelligence
Fundamental & Alternative Datasets
Banking & Capital Markets
Economy & Finance
Energy Transition & Sustainability
Technology & Innovation
Podcasts & Newsletters
Financial and Market intelligence
Fundamental & Alternative Datasets
Banking & Capital Markets
Economy & Finance
Energy Transition & Sustainability
Technology & Innovation
Podcasts & Newsletters
Research — Sept 2025
By Mengmeng Ao, Frank Zhao, Ronen Feldman, Ilan Attar, Leonid Hatskin, and Benjamin Rozenfeld
Large language models (LLMs) have garnered widespread attention for their ability to understand natural language. Their application in equity investing, however, remains in its infancy due to novelty and cost. This study leverages LLM-extracted features from earnings call transcripts and transforms them into actionable stock selection signals. These features are significantly correlated with their traditional (rules-based) NLP counterparts — the lexicon-driven approach that flags and scores individual phrases as positive or negative — confirming that both approaches measure the same ‘ground truth’. The added computational cost and opaqueness of the LLMs appear justified: a fine-tuned LLM-based sentiment strategy would have delivered twice the long-short return performance (LLM 8.4% versus lexicon 4.2%), with a notable advantage in recent years as mispricing opportunities have narrowed.
Key findings in the US market since 2010 are:
Explore the data used to conduct this research:
ProntoNLP Transcript Analytics
The ProntoNLP Transcript Analytics dataset is a powerful new tool for analyzing Machine Readable Transcripts data. ProntoNLP efficiently processes and extracts insights around performance metrics, generating key indicators for future corporate performance across essential KPIs. By using Natural Language Processing and an optimized Large Language Model, ProntoNLP accurately recognizes and scores important phrases, helping you easily identify valuable information and separate it from the noise.
Transcripts is a global data set that was added to the S&P Global Market Intelligence's Xpressfeed product in September 2017. Among its key features, the data set captures the different segmentations of earnings calls in the follow ways:
TDA was launched in October 2019 and is productized from Quantitative Research & Solution's previous publications with an advanced suite of analytics and metrics added in May 2022. It is an off-the-shelf NLP solution that tailors to our Machine-Readable Transcripts and outputs over 800 predictive and descriptive analytics for equity investing and various data science workflows. The analytics could be accessed via SQL, Snowflake or (Databricks) Workbench.
Content Type
Segment