Please use this identifier to cite or link to this item:

TitleAutomatic creation of stock market lexicons for sentiment analysis using StockTwits data
Author(s)Oliveira, Nuno
Cortez, Paulo
Areal, Nelson
KeywordsSentiment analysis
Opinion mining
Stock market
Microblogging data
Information retrieval
Issue date2014
Abstract(s)Sentiment analysis has been increasingly applied to the stock market domain. In particular, investor sentiment indicators can be used to model and predict stock market variables. In this context, the quality of the sentiment analysis is highly dependent of the opinion lexicon adopted. However, there is a lack of lexicons adjusted to microblogging stock market data. In this work, we propose an automatic procedure for the creation of such lexicon by exploring a large set of labeled messages from StockTwits, a popular financial microblogging service, and using four statistical measures: adaptations of the known TF-IDF, Information Gain, Class Percentage, and a newly proposed Weighted Class Probability. The obtained lexicons are competitive when compared with a set of six reference lexicons. Moreover, we verified that it is beneficial to use continuous sentiment scores instead of sentiment labels.
TypeConference paper
Publisher versionThe original publication is available at
AccessOpen access
Appears in Collections:CAlg - Artigos em livros de atas/Papers in proceedings
EEG - Comunicações e Conferências

Files in This Item:
File Description SizeFormat 
2014-ideas.pdf347,51 kBAdobe PDFView/Open

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID