Chapter 7 The Market, The Players and The Rules
7.1 The Market
In an industry where almost every player has access to the same (or similar) core financial and fundamental data, investment firms look for Alternative Data seeking differentiated insights into companies not found in filings, earnings calls or fundamental datasets.
Until recently, the usage of alternative data has been confined mainly to the realm of quantitative investment managers, as these firms were best suited to obtain, clean and process this data. Now, however, alternative data is beginning to go mainstream, with increasing interest from fundamental and hybrid asset managers here (Johnson 2019).
Alternative Data is Untapped Alpha. The biggest opportunity for investors in this decade comes from the signals buried in the data generated by the digital economy. Alternative data is the deepest, least utilized alpha source in the world today - Quandl.
On June/2019, alternative data went mainstream when Verizon made a bold move by launching a subscription service for Yahoo Finance that offers alternative data and insights to retail investors at a price point of $34.99/month.
The amount spent in Alternative Data have been steadily increasing. More than half of all quantitative and fundamental investors surveyed recently are either considering or using alternative data as part of their workflow - Greenwich Associates.
A recent Greenwich Associates report found that a wide majority of investment managers, 72%, stated that alternative data was improving enhanced their signal quality in an arena where filtering out signal noise. Of those who are implementing an alternative data strategy, more than one-fifth claim to have received 20% or more of their alpha from the practice.
7.2 The Data
But where’s Alternative Data coming from and who are the key players providing access to it?
The growth of Alternative Data has been enabled by the digitalization of the world around us. Data have been produced at an unprecedented rate by heterogeneous sources including:
- Individuals who today mirror their lives in the form digital behavior into the Web, Social Media and Apps;
- Sensors, particularly with the emergence of IoT, that generate satellite images, weather forecasts and geolocation data;
- Businesses from which data is generated in the form of traditional official filings and earning calls as well as banking records or supply chain data.
The number of Alternative Data Providers is exploding.The number of alternative data vendors is exploding particularly in the past decade. A few companies are worth mentioning:
- Quandl is likely the largest financial and alternative data aggregator/provider today. They leverage relationships with third-party providers to be a one-stop-shop for alternative data from consumers, IoT and sentiment to traditional fundamental, pricing and estimates. Quandl has been recently acquired by Nasdaq.
- Dataminr leverages advanced anomaly detection technology in a real-time AI platform to uncover critical events from social media before the information goes mainstream. The company raised a staggering $500MM+ amount in funding so far and it gives no sign of that is slowing down.
- Thinknum provides a SaaS-based web platform for a variety of alternative data sets, in particular, web-scrapped data such as product comparisons and company ratings on jobs sites. Thinkum claims to have 8 of the 10 largest investment banks as their clients. They recently closed a $11.6 million Series A funding to expand their financial modeling tools.
- Yewno is helping the world to uncover the undiscovered through its advanced dynamic Knowledge Graph and AI based inference engine, which introduces an entirely new approach to knowledge extraction to enhance human understanding by correlating concepts across a vast volume of sources. Yewno ingests data from sources such as clinical trials, patents, company transcripts and court opinions and more. Yewno raised about $30 million dollars in funding and it has launched AI-based equity indexes with major partners such as STOXX and Nasdaq and licensed alternative data feeds to buy-side firms.
Other rising vendors include YipitData, 1010data and Enigma as well as incumbent players such as Refinitiv, S&P Global Market Intelligence, Bloomberg and Factset.
While the alternative data offering is exploding there is a gap between what data vendors offer and what data buyers want.
A research report conducted by BattleFin and AlternativeData.org has found that there is a gap between what data vendors offer and what data buyers actually want: - Data buyers view the three most-valuable data categories as: credit/debit cards (14.83%) web data (12.29%), and social/sentiment (11.02%). - By contrast, the percentage of data providers covering these data categories are: credit card (0.69%), web-crawled data (3.94%) and sentiment (4.40%). - Data providers report that the top-three categories their data covers are: business insights (6.48%), web traffic (4.63%) and sentiment (4.40%).
7.3 The Buyers
The data buyers include your usual suspects such as Two Sigma, Milennium, Third Point, WorldQuant, ExodusPoint and emerging Quant Funds like Credit Suisse’s QT Fund.
80% of buyers expect to purchase a minimum of one to five datasets in 2019 while nearly 60 percent expects to test and evaluate a minimum of one to five datasets this year - according to a research report conducted by BattleFin and AlternativeData.org.
Pension Plans and Index Providers are also entering the space seeking exposure to alternative factors, particularly with the emergence of AI-based strategies that leverage machine learning and computational linguistics techniques to extract signals from unstructured data (see STOXX’s or DWS’s AI-based indexes) as well as the increased interest in themes such as ESG (Environmental, Social and Corporate Governance) - a multi-dimensional multi-sourced theme with increasing demands that are no longer met by traditional data sources alone.
I have been to a good number of meetings with data buyers (BattleFin is maybe the best event out there for 1-on-1 meetings with high-profile buy-side firms sometimes allowing for 30+ meetings in two days of events in addition to dozens of unofficial ones). Some questions are commonplace:
- Can you describe your data sources and your data ingestion process?
- What’s the frequency/history/coverage?
- What’s the delivery method?
- What are the key use cases for your data?
- Can I run a trial with live data?
- Who are your clients?
- What’s pricing like?
- Do you offer exclusivity?
- What is the size/funding/team profile of your firm?
There is one tricky question though: Whether the data was already licensed to other clients. Particularly if the buyer is looking for alpha, they actually would prefer to be one of the first to test the data and avoid using a dataset that is already “crowded”. By the same token, they would be happy to hear that the dataset was already licensed to reputable clients as a sign of credibility. It’s up to you to navigate through this trade-off.
This is just the very first step in the sales and marketing life-cycle. From lead generation to customer sign-up, data buyers can take several months to make a final decision. Here are some findings:
- Providers and buyers agree that alt-data sales cycle can take up to seven months, with more than 75% of buyers requiring up to six months to test, evaluate and purchase an alt dataset.
- More than a third of data buyers use backtesting to inform their purchasing decision. Nearly 60% of providers are not interested in outsourcing the data testing process even if it can be handled efficiently.
- More than 20% of providers sell and distribute using a customized alt-data software platform. Nearly 18% use no software platform at all.
7.4 Conclusion
Alternative Data, once a treat only for the most advanced Hedge Funds, is becoming a must-have in the alpha-generation war on Wall Street.
Total spending in the area has been steadily growing as well as the number of data vendors and variety of data products. Relevant acquisitions have been made in the space and the first unicorn startups have been formed.
However, there are critical challenges in the industry. There is a gap between what buyers find most relevant and what the data vendors are producing. Further, the sales life-cycle is long, as players are still figuring out how to test the value of the data and considerable time is spent on on-boarding new datasets.
Successful players will be those who bridge the gap between data and insights first. The race has just begun, and it looks like no one wants to be just a spectator.
References
Johnson, Richard. 2019. “Demystifying Alternative Data.” https://www.greenwich.com/asset-management/demystifying-alternative-data; Greenwich Associates.