EMP-SA: Ensemble Model based Market Prediction using Sentiment Analysis
Anuradha Yenkikar1, Manish Bali2, Narendra Babu3 

1Anuradha Yenkikar, Research Scholar Department of Computer Science, Ramaiah University of Applied Sciences, Bangalore, India.
2Manish Bali, Adjunct Professor, Department of Computer Science, Presidency University, Bangalore, India.
3Narendra Babu, Associate Professor, Department of Computer Science Engineering, Ramaiah University of Applied Sciences, Bangalore, India.

Manuscript received on 03 March 2019 | Revised Manuscript received on 08 March 2019 | Manuscript published on 30 July 2019 | PP: 6445-6452 | Volume-8 Issue-2, July 2019 | Retrieval Number: B2197078219/2019©BEIESP | DOI: 10.35940/ijrte.B2197.078219
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Predicting stock market trend is an extremely complicated task and calls for extensive study and insights into the context at hand. Primary requirement for any investor is to assess this trend to help invest for maximizing his returns. The advances in Machine learning and data analytics in particular have changed the way investors can approach this matter. Sentiment analysis or Opinion mining can be carried out by taking into consideration public sentiments regarding the stock market conditions and to understand the ups and down of this most volatile sector. In this paper, public sentiments from Twitter along with news feed related to the stock market conditions for predicting the nature of market is considered to analyse the stock market trend. The data is collected from twitter and various news sites to generate a gross sentiment score regarding the market. The gross sentiment score is used to find a correlation between market price and sentiments to train the proposed models for prediction using Linear and robustness regression techniques such as Ordinary Least squares (OLS), RANSAC, Theil-Sen estimator, Huber Regression and Ridge regression. Ensemble method is used to achieve reliable and better prediction accuracy instead of a single method. Ensemble method combines models and carries out majority voting among them to produce one final model to increase prediction accuracy. The obtained results reveal that public opinion does make a significant impact on market behaviour with the prediction accuracy between 65-91% depending on the dataset.
Index Terms: Ensemble Method, Machine Learning, Opinion Mining, Sentiment Analysis.

Scope of the Article: Machine Learning