AM  Vol.9 No.1 , January 2018
Investigating Relationship between Google Index and Corporate Profit Using Random Forest
An automatic analysis of financial figures is common way for investors to analyze financial reports. However, using solely financial statements does not represent the comprehensive financial story of a company. Recently, many people express their opinions and search for information on the Internet. The adoption of the Internet has generated another type of data for analysis, i.e. Google Index. The purpose of this research is to prove Google Index is a good indicator for investors to analyze companies’ status. In this study, random forest (RF) is used to investigate the relationship between company’s financial performance and financial ratios and Google Index. From the results of RF model, we can see Google trend also plays a major role in determining the company’s profit except the stock index and operating margin.
Cite this paper: Yuan, F. and Lee, C. (2018) Investigating Relationship between Google Index and Corporate Profit Using Random Forest. Applied Mathematics, 9, 35-43. doi: 10.4236/am.2018.91004.

[1]   Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., Smolinski, M.S. and Brilliant, L. (2009) Detecting Influenza Epidemics Using Search Engine Query Data. Nature, 457, 1012-1014.

[2]   Carneiro, H.A. and Mylonakis, E. (2009) Google Trends: A Web-Based Tool for Real-Time Surveillance of Disease Outbreaks. Clinical Infectious Diseases, 49, 1557-1564.

[3]   Althouse, B.M., Ng, Y.Y. and Cummings, D.A. (2011) Prediction of Dengue Incidence Using Search Query Surveillance. PLoS Neglected Tropical Diseases, 5, e1258.

[4]   Vaughan, L. and Romero-Frías, E. (2014) Web Search Volume as a Predictor of Academic Fame: An Exploration of Google Trends. Journal of the Association for Information Science and Technology, 65, 707-720.

[5]   Baram-Tsabari, A. and Segev, E. (2011) Exploring New Web-Based Tools to Identify Public Interest in Science. Public Understanding of Science, 20, 130-143.

[6]   Carrière-Swallow, Y. and Labbé, F. (2013) Nowcasting with Google Trends in an Emerging Market. Journal of Forecasting, 32, 289-298.

[7]   D’Amuri, F. (2009) Predicting Unemployment in Short Samples with Internet Job Search Query Data. University Library of Munich, Germany.

[8]   Marcucci, J. (2009) “Google it!” Forecasting the US Unemployment Rate with a Google Job Search Index. University Library of Munich, Germany.

[9]   Askitas, N. and Zimmermann, K.F. (2009) Google Econometrics and Unemployment Forecasting. Applied Economics Quarterly, 55, 107-120.

[10]   Choi, H. and Varian, H. (2009) Predicting the Present with Google Trends.

[11]   Vosen, S. and Schmidt, T. (2011) Forecasting Private Consumption: Survey-Based Indicators vs. Google Trends. Journal of Forecasting, 30, 565-578.

[12]   McLaren, N. and Shanbhogue, R. (2011) Using Internet Search Data as Economic Indicators. Bank of England Quarterly Bulletin Q2, 134-140.

[13]   Kholodilin, K., Podstawski, M., Siliverstovs, B. and Bürgi, C. (2009) Google Searches as a Means of Improving the Nowcasts of Key Macroeconomic Variables (No. 946). Discussion Papers, German Institute for Economic Research.

[14]   Dzielinski, M. (2012) Measuring Economic Uncertainty and Its Impact on the Stock Market. Finance Research Letters, 9, 167-175.

[15]   Wu, L. and Brynjolfsson, E. (2015) The Future of Prediction: How Google Searches Foreshadow Housing Prices and Sales. In: Goldfarb, A.S.M. and Tucker, C.E., Economic Analysis of the Digital Economy, University of Chicago Press, 89-118.

[16]   Hand, C. and Judge, G. (2012) Searching for the Picture: Forecasting UK Cinema Admissions using Google Trends Data. Applied Economics Letters, 19, 1051-1055.

[17]   Scott, S.L. and Varian, H.R. (2015) Bayesian Variable Selection for Nowcasting Economic Time Series. In: Goldfarb, A.S.M. and Tucker, C.E., Eds., Economic Analysis of the Digital Economy, University of Chicago Press, Chicago, 119-135.

[18]   Goel, S., Hofman, J.M., Lahaie, S., Pennock, D.M. and Watts, D.J. (2010) Predicting Consumer Behavior with Web Search. Proceedings of the National Academy of Sciences, 107, 17486-17490.

[19]   Pai, P.-F., Hung, K.-C. and Lin, K.-P. (2014) Tourism Demand Forecasting using Novel Hybrid System. Expert Systems with Applications, 41, 3691-3702.

[20]   Saidi, N., Scacciavillani, F. and Ali, F. (2010) Forecasting Tourism in Dubai. Dubai International Finance Centre, Economic Note No. 8.

[21]   Lehtinen, J. (1996) Financial Ratios in an International Comparison: Validity and Reliability. UniversitasWasaensis, Vaasa.

[22]   Breiman, L. (2001) Random Forests. Machine Learning, 45, 5-32.

[23]   Liaw, A. and Wiener, M. (2002) Classification and Regression by Random Forest. R News, 2, 18-22.

[24]   Myers, J.L. and Well, A.D. (2003) Research Design and Statistical Analysis. 2nd Edition, Lawrence Erlbaum Associates, Mahwah.