Does High Frequency Social Media Data Improve Forecasts of Low Frequency Consumer Confidence Measures?
Social media data presents challenges for forecasters since one must convert text into data and deal with issues related to these measures being collected at different frequencies and volumes than traditional financial data. In this paper, we use a deep learning algorithm to measure sentiment within Twitter messages on an hourly basis and introduce a new method to undertake MIDAS that allows for a weaker discounting of historical data that is well-suited for this new data source. To evaluate the performance of approach relative to alternative MIDAS strategies, we conduct an out of sample forecasting exercise for the consumer confidence index with both traditional econometric strategies and machine learning algorithms. Irrespective of the estimator used to conduct forecasts, our results show that (i) including consumer sentiment measures from Twitter greatly improves forecast accuracy, and (ii) there are substantial gains from our proposed MIDAS procedure relative to common alternatives.
Published Versions
Steven Lehrer & Tian Xie & Tao Zeng, 2021. "Does High-Frequency Social Media Data Improve Forecasts of Low-Frequency Consumer Confidence Measures?," Journal of Financial Econometrics, vol 19(5), pages 910-933. citation courtesy of