A capital market is a part of the financial system in which debt or equity backed securities are bought and sold. The buying and selling of the equities are generally carried out by the individuals or the institutions. As a trend, these trades are generally long term in nature, which have locks in a period of more than a year. The capital market is a much broader landscape which also includes stock markets which are used to trade the securities. In one of the papers (Hilbert  on Big Data for development), he stresses that Big Data delivers the very cost-effective prospect to improve decision-making in critical economic development areas such as economic development, health care, security and resource management through effective decision making process. The use of Big Data to improve the effectiveness of decision-making process also contributes to the theory of economic development. At the verge of big data, crossing Moore’s chasm as the capital markets has started focusing on the unstructured data for new avenues for innovation and has offered immense opportunities for growth and sustainability. The concept of “Big” in a retail or scientific environment is different from what is considered to be big in a capital market context. The capital markets tend to deal largely with structured data sets from a more limited set of defined sources―market data vendors, market infrastructures, and counterparties. Some unstructured datasets have, however, become important to capital markets institutions in areas such as sentiment analysis, market surveillance for profiling certain trends and fraud prevention, improving the predictability, maturity and activities in the market and within firms. But these were not traditionally being the data sets of primary importance to the business. There has always been a gap or chasm among innovators, early adopters and laggards for the adoption of discontinuous or disruptive technologies, and that’s why Big Data is today in capital markets.
One of the milestone finance theoretical premises published by Fama and Malkiel in Journal of Finance in 1970  , with the title “Efficient Capital Markets: A Review of Theory and Empirical Work”, changed the full perspective of all the scholars and practitioners looking for the predictability and forecasting patterns for the stock market. Fama  suggested three levels of the market efficiency “Weak-form” efficiency were security prices, reflecting all information found in the record of past prices and volumes. Prices in “Semi-strong form” of efficiency reflected not only all information found in the record of past prices and volumes but also all other publicly available information whereas prices in “Strong-form” efficiency reflected all the available public information as well as private information. In another milestone paper “Market Efficiency, Long-Term Returns, and Behavioral Finance” in 1998, Fama  observed that in line with the market efficiency hypothesis that the capital market anomalies were chance results, the apparent overreaction to information was as common as under reaction, and post-event continuation of pre-event abnormal returns was about as frequent as post-event reversal. He also mentioned that in line with the market efficiency prediction that apparent anomalies could be due to methodology, most long-term return anomalies tended to disappear with reasonable changes in technique.
In the recent days, massive eruptions of the information flow, technological changes and availability of structured and unstructured data for consumption have generated a fresh interest in looking into the efficiency and maturity of the capital market from the fresh perspective. This brings a great intersection of the theoretical premise of the efficient market hypothesis (EMH), Random walk theory  , circulation of the massive data flow and its consumption through the newer technologically landscape of Big Data. To support this finding, previous scholars such as Singh  concluded that adoption of new technologies such as Big Data could drive efficiency and promote productivity. Singh  and Singh  explained that R&D helped to create a vehicle to drive these synergies as technology spillovers. To explain their findings, Singh  argued that large organizations needed to adopt Big Data aligned information technology strategies in order to reduce costs and their reliance on large manual manpower. One of the strategies for the merger is driven by the technical advancements (Singh  and Singh  ), which can be further refined and matured through Big Data strategies.
The two basic principles of the financial world are foundation and basic premise to carry out the study further Attigeri  : 1) Profit cannot be generated out of nothing and 2) No opportunities for arbitrage i.e. there is no potential to generate profit without risk which is also called “No Arbitrage principle”. The concept of the efficient capital market stemmed from an incidental discovery by a British statistician Kendall  his paper “The Analysis of Economic Time-Series―Part 1: Prices” in 1953  on the behavior of stock and commodity prices. Though the Kendall idea was not completely new, similar idea was proposed in almost forgotten thesis by written 53 years earlier by French doctoral student Bachelier  , accompanying development of the mathematical theory of random processes in reference to Einstein famous work on the Random Brownian Motion of colliding gas molecules1. Kendal had expected to find the regular price cycles but he was surprised to observe that they did not seem to exist and he conceived the thought of “Random Walk”. The Random walk theory  and Attigeri  stressed that changes in the stock prices are independent of each other, so the past movement or trend of a market cannot be used to predict the future movement. It is expressed as
where v(n): price of a stock at a time n and : price of the stock at a time .
: change in the price of the stock at a time n, d(n): dividend at time n, and c(n): adjustment term at time n.
Since c(n) encompasses the impact of all publicly and privately available information on the stock, which makes the prediction of the pattern of very complicated.
While Kendall suggested that stock prices follow a random walk, he implied that the price changes are independent of one another just as the gains and losses in coin tossing game are independent.
In general, investment decisions are aimed to maximize the NPV (Net Present Value), however it is more difficult to add value through the financing decisions because the capital markets are generally very competitive and efficient. In one of literature of the financial economics, Fama introduced the concept of “efficient” capital market which is understood in the normal way that capital markets are generally very competitive and efficient. This was indeed a sweeping hypothesis which is discussed and challenged even today. A efficient market is understood as one in which the market price of the security is an unbiased estimate of its intrinsic value though market price does not equals to intrinsic value at every point in time. This means that price can deviate from intrinsic value but the deviations are random and uncorrelated with any observable variable. If the deviations of market price from intrinsic value are random, it is not possible to consistently identify over or under valued securities. While working on the theme of “Efficiency” Fama  mentioned in his paper “Agency Problems and the theory of the firm” that how the separation of security/equity ownership and control, typically of large corporations, can be an efficient form of economic organization.
There are various misconceptions about the efficient market hypothesis one of the very prominent one is that efficient market theory implies that market has perfect forecasting abilities but in reality this hypothesis only implies that security prices impound all available information which does not mean that the market possesses the forecasting abilities.
There are expert discussions and views on the ongoing efficient market hypothesis which fundamentally implies that market establishes the right price of the equity however the alternative paradigm suggests that there is often a divergence between the fundamental value and market price. Eminent experts on this subjects like Graham & Dodd, Irwin, Manmohan Singh, William Baumol, LC Gupta etc have observed that “Knowledge” and “Information” is only one ingredient in arriving at the stock’s price, it is also dependent on bullish enthusiasm, influence or bullish disillusionment etc.2
RVL Cooper  mentioned that there is contradiction between the quantity theory of money and capital market efficiency. The relationship between the money supply and equity prices refutes the other findings of the capital market efficiency. Using the quantity theory of money, Sprinkel  finds that money supply changes could be used to predict the stock prices. However if the capital market is efficient, past information like money supply cannot be used for predicting the stock prices.
Timmermann and Granger  mentioned that the efficient market hypothesis gives rise to forecasting tests in the context of a given past information set. However, there are also important differences arising from the fact that market efficiency tests rely on establishing profitable trading opportunities in “real time”. As the Forecasters continuously try for the predictable patterns and affect prices when they attempt to exploit trading opportunities. Stable forecasting patterns are therefore unlikely to persist for long periods of time and will self-destruct when discovered by a large number of investors. This gives rise to non-stationarities in the time series of financial returns and complicates the formal tests of market efficiency and the search for successful forecasting pattern.
Based upon all the arguments and discussion, it seems that Efficient Market Hypothesis like all theories is an imperfect and provides limited description of the stock market. However at least for the present, it seems to be better alternative. This hypothesis is too broad and too flexible to be rejected.
2.1. Finding of Key Papers: Efficient Market Hypothesis
2.2. Capital Market and Economy
In the current landscape of the economies, the world’s economy could be broadly divided in three categories3 based upon the income in three categories Low income, Middle income and High income. In the similar way, the world’s capital markets could be divided in three broad categories based upon the efficiency of the capital markets like Weak form of efficiency, Semi strong form of efficiency and Strong form of efficiency. In general, the developed economies like USA shows the characteristics of the strong form of the efficiency however the developing economies like India shows the characteristics of semi strong form of the efficiency.
3. Social Media and Emerging Technologies
The big data strategy and algorithms does use the power of high capacity computing to affect the high frequency trading which improves the efficiency in the market however high frequency trading also poses many regulatory challenges for the security and exchange commission. Social media and microblogs affect the risk appetite of the investors. The sentiment and decision-making pattern of the investors are influenced by the continuous flows of the information through the social media which affects the capital markets.
Within the financial services sector, Big Data has gained far more traction within retail banking and insurance sector due to the increasing desire of these financial institutions to profile their customers in a similar manner to early adopters of Big Data strategy such as Amazon, Baidu or Google. Big data is defined as, “data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage, and process data” (Snijders et al.)  . However, Big Data strategies have begun to make significant impact in a selected few areas of the capital markets including the sentiment analysis for trading and growth in volume, risk analytics, fraud prevention and market surveillance. The global financial markets were fragmented due to rapid globalization and technological changes (Funk et al.)  . In one of study on liquidity, Blocher et al.  identified three key components of the financial market like 1) fund management for long term investors, 2) low-frequency trading (LFT) by the traditional brokers and 3) and high-frequency trading (HFT) by proprietary financial firms which used the Big Data and aggregated information to set the trading strategy. The capital market industry has varied data source, which included structured data like traditional banking transaction data and market data which is also called the “system of records”. At the same time, it also generates mammoth volume of unstructured data through corporate news, feeds, blogs, micro blogs, macro and micro economic indicators, social media updates and contents. In recent years, capital markets have gone through an unprecedented change, resulting in the generation of massive amounts of high-velocity and heterogeneous data. Similar trends could be observed in the financial services sector as well, where Big Data has been increasingly becoming the most significant, promising, and differentiating asset for the financial services enterprises (Seth et al.)  . These massive data troves could be processed through Big Data strategy, tools and techniques which could be game changer. Traditional tools cannot process such large datasets, however the Big Data based approach can analyze structured and un-structured data and create logical pattern to help business to take decisions. The speed and agility of this processing is exponentially faster than that could be done by the traditional mechanism. This provides near real time information with the actionable intelligence which could be used in the decision-making process. The analytical and predictive power of information generated from online big data for the capital market activity is supported by numerous studies ranging from stock markets to housing markets.
Big Data strategy could help the capital market to address key use cases in the area of 1) Trading Strategy, 2) Reporting, 3) Compliance, 4) productivity improvement and maturity and 5) Operational simplifications. Investment banks have an untapped opportunity to use Big Data to solve many of their business problems. The age of Big Data offers new creative opportunities across the board. For finance researchers, Big Data allows the field to settle old debates and discover new puzzles. For investment practitioners, the Big Data approach offers limitless possibilities to gain informational advantage over the competition by cleverly analyzing public data sources. Big Data strategies in the capital markets tend to be synonymous with analytical tasks or those related to reporting or governance functions but in the in the recent years the consumption of text-based, audio & video unstructured data had also been a significant driver for some projects. Regulatory, client, and internal drivers have forced most firms to re-evaluate the core reference data sets on which they are basing their trading, risk management, prediction, forecasting and operational decisions.
3.1. Capital Markets and Emerging Technologies
Big Data had been a much misused and misunderstood term within the financial services industry and capital markets for some time. A data driven culture is driving the research field in Finance and Management since the advent of structured, semi-structured and unstructured socio-economic and demographic information from social media, mobile devices, blogs and microblogs. Big Data which encompasses this revolution, involves the usage of new tools for financial professionals and academic researchers due to the size of data involved. In the capital market, the technology focus is often towards improving real-time systems, transaction processing, operational efficiency and establishing data warehouses of structured data. The adoption of discontinuous or disruptive technologies, there is a gap or chasm between innovators and early adopters and that is where big data is today in capital markets. Some unstructured data sets have however, become important to capital markets institutions in areas such as sentiment analysis and market surveillance for profiling certain trends and activities in the market and within firms, but there have not traditionally been the data sets of primary importance to the business. The Big Data refers to data that could be categorized in the four categories: 1) Volume & Quantity of the data generated, 2) Velocity & Speed at which data is generated, 3) Variety & Types of data like test, audio and video etc. and 4) Veracity & Integrity and quality of the data being processed. In the context of Big Data, it was represented by 4 Vs (Variety, Velocity, Veracity and Volume) but while understanding its context in terms of capital market, it is important to consider the other attributes of the data (Variability, Value and Visualization). These 7 Vs made the complete view of the data in the context of capital markets. The large-scale enterprise level NLPs affect the capital market strategy for the HFTs and LFTs. These new emerging technologies and cutting edge technologies are bound to have significant impact on the capital markets in the developed and emerging economies. Big data platform has been in the forefront to bring innovation in the areas of Data Science, Artificial Intelligence, Machine Learning, Deep Learning and Natural language processing (NLP). At the verge of big data crossing Moore’s chasm as the capital markets has started focusing on the unstructured data for new avenues for innovation and has offered immense opportunities for growth and sustainability.
3.2. Big Data and Capital Market Efficiency
In the world of finance, the Efficient Market Hypothesis (EMH) asserts that financial markets were “informationally efficient” which means current stock prices were already reflecting all known information (structured or unstructured) and all occurred events and facts. Therefore, investors cannot make excess profits from the market if their trading strategies are based on known information, because market prices are efficiently collecting and aggregating various information and keep changing without delay (Zhang and Skiena)  . Enterprise headquarters faces a tradeoff between the cost of attaining an accurate private information and the value of the information, this could lead to the inefficacy in the capital market. Foster  studied the efficiency and inefficiency of the capital market and explained the influence of information on the capital market. Fama  made the simple hypothesis that security prices fully reflect all available information considering the precondition of that information and trading costs, the costs of getting prices to reflect information, are always zero. It is clear that those who arbitrage make no return from their costly activity in case market is ideally information efficient. Hence, the assumptions that all markets, including that for information, are always in equilibrium and always perfectly arbitraged are inconsistent. In the advent of structured and un-structured information flow in the capital market through the social media, the role of information becomes even more relevant. In case the access of information with all the actors in the capital market not in equilibrium, then it leads the market toward arbitrage and makes less efficient. However effective usage of the Big Data information could reduce the opportunity of arbitrage and make it more efficient. Fama  was also of the opinion that predictability of stock returns based upon the dividend yields is not in itself evidence for or against market efficiency. While understanding the behavior of the capital market, Sharpe  mentioned that to predict the behavior of capital markets in the state of equilibrium, capital asset prices have adjusted so that the investor, if he follows rational procedures through diversification is able to attain desired point along a capital market line. Investor should be able to absorb all the available information and adjust and diversify its portfolio so that he could attain the desired goal along the Capital Market Line (CML).
It was difficult to observe the investors views on the capital markets and understand the decision-making process before the advent of the Big data. With the ubiquity of information technology and the Internet, an increasing number of investors are gathering information from the Internet for analysis. Ye and Li  observed that Google processed 2 million searches per minute in 2012, 204,166,667 emails were sent per minute, Twitter users posted approximately 100,000 tweets per minute pertaining to the capital markets. In one study, Bartov et al.  aggregated the opinion from individual tweets to successfully predict a firm’s forthcoming quarterly earnings and announcement returns. In order to study the impact of the microblogging, Jin et al.  observed a significant impact on the increase in the relative trading volume as well as the decreases in the daily expected stock return and firm-level volatility. In similar lines, while studying the impact of the financial news in the media Nagata and Inui  observed that there is intrinsic impact on the movements of the financial markets. While Big Data is spreading, a very convincing evidence is observed by Moat et al.  , which infers that online data and information can give new insights into real-world which could affect the collective decision making and can even anticipate future actions in the capital markets. In the field of Behavioral economics Bollen et al.  observed that emotions can profoundly affect the individual behavior and decision-making which could even be extended to even predictive of economic indicators. Zhang et al.  extended the research even further and observed based upon the analysis of the Twitter feed on daily basis and its correlation with the indices (NASDAQ, S&P 500, and Dow Jones), that emotional tweets percentage is significantly negatively correlated with key indices in the USA market. This is significant revelation which indicates the impact of the blogs and microblogs on the capital market of the developed countries like USA. When people are pessimistic or uncertain about the future, they will be more cautious to invest and trade. So, capturing the collective mind―especially people’s mood becomes one possible way to predict the stock market movement. While studying the human emotional and its influences on the capital market, Sharpe  observed that human state of emotion could influence decisions even in the capital market and the investment decision.
3.3. Finding of Key Papers: Big Data and Capital Markets
4. Concluding Remarks
The theoretical premises starting from the Random walk theory and Efficient Market Hypothesis indicate that predictability of the stock prices, creating a pattern in the movement of capital markets based upon the past movement is a very complex task. All the studies in the past related to impacts of the Big Data and its influences on the capital markets indicate that information asymmetry can be reduced to a significant level. However, many other macro variables like availability of the internet to all the investors, speed, access to the relevant information, costs of information and biasness in the content on the social media can provide opportunity for inefficiency in the market. Most of the scholars acknowledge during their researches and analyses that Big Data has started playing significant influences on the capital markets and decision-making process. It will further evolve and use the continuous flow of information through defined and non-defined sources such as social media, microblogs as part of their Big Data strategy to do the sentiment analysis and improve the decision moments to improve the maturity and minimize the inefficiency in the capital markets. In this context, the fundamental analysis based upon the Big Data flow from the industry draws a strong attention to carrying out further researches in the perspective of Random Walk theory, theory of economic development and Efficient Market Hypothesis.
1Principles of Corporate Finance by Brealey Myers Allen, Chapter 14, Efficient Market and Behavioral Finance.
2Financial Management by Prasanna Chandra, Market Efficiency and Financing Decisions.
3https://siteresources.worldbank.org/INTRGEP2004/Resources/classification.pdf Accessed on April 28th 2018.
 Singh, A.P. (2017) Keeping It Simple: Comparative Analysis of TFP across Manufacturing. Industries and Major States of India. Theoretical Economics Letters, 7, 1821.
 Singh, A.P. (2018) Does Size Matter?—The Effect of Size of Production Workers, Management Staff and Proprietors on Productivity. Theoretical Economics Letters, 8, 2290.
 Funk, R.J. and Hirschman, D. (2014) Derivatives and Deregulation: Financial Innovation and the Demise of Glass-Steagall. Administrative Science Quarterly, 59, 669-704.
 Zhang, W. and Skiena, S. (2010) Trading Strategies to Exploit Blog and News Sentiment. Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, Washington DC, 23-26 May 2010, 375-378.
 Jin, X., Shen, D. and Zhang, W. (2016) Has Microblogging Changed Stock Market Behavior? Evidence from China. Physica A: Statistical Mechanics and Its Applications, 452, 151-156.
 Nagata, S. and Inui, K. (2014) Does High-Speed Trading Enhance Market Efficiency? Empirical Analysis on “Arrowhead” of the Tokyo Stock Exchange. The Journal of Trading, 9, 37-47.
 Moat, H.S., Preis, T., Olivola, C.Y., Liu, C. and Chater, N. (2014) Using Big Data to Predict Collective Behavior in the Real World. Behavioral and Brain Sciences, 37, 92-93.
 Zhang, X., Fuehres, H. and Gloor, P.A. (2011) Predicting Stock Market Indicators through Twitter “I Hope It Is Not as Bad as I Fear”. Procedia-Social and Behavioral Sciences, 26, 55-62.
 Seddon, J.J.J.M. and Currie, W.L. (2017) A Model for Unpacking Big Data Analytics in High-Frequency Trading. Journal of Business Research, 70, 300-307.
 Tian, X., Han, R., Wang, L., Lu, G. and Zhan, J. (2015) Latency Critical Big Data Computing in Finance. The Journal of Finance and Data Science, 1, 33-41.
 Easley, D., López de Prado, M.M. and O’Hara, M. (2012) The Volume Clock: Insights into the High-Frequency Paradigm. The Journal of Portfolio Management, 39, 19-29.
 Gilbert, E. and Karahalios, K. (2010) Widespread Worry and the Stock Market. Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, Washington DC, 23-26 May 2010, 59-65.