Home / / Datasets / / financial_phrasebank
The financial_phrasebank dataset is a polar sentiment dataset consisting of 4,840 sentences from English language financial news. Each sentence is categorized based on sentiment, and the dataset is divided by agreement rate of 5-8 annotators. The dataset is designed to be used for tasks related to sentiment classification in the financial domain.
The financial_phrasebank dataset has a size of 2.73 MB for the downloaded dataset files and 1.17 MB for the auto-generated Parquet files. It contains a total of 14,780 rows.
The financial_phrasebank dataset is primarily intended for sentiment classification tasks in the financial domain. It can be used to train machine learning models for sentiment analysis of sentences from financial news. The dataset can be helpful in building and benchmarking alternative modeling techniques for financial sentiment analysis. Potential use cases for the dataset include stock market prediction, sentiment analysis of financial reports, and sentiment-based investment decision-making.
The financial_phrasebank dataset is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. The license allows for non-commercial use and sharing of the dataset. For commercial use, users need to contact the dataset curators for an appropriate license.