The data infrastructure market has grown massively as data and data systems are contributing directly to business results — not only in Silicon Valley — but also in traditional industry. But we are still in the early innings of data systems — whose rise we cover extensively here — and we believe that this data market has the opportunity to be as large and critical as the overall cloud and SaaS markets in the decades to come.
As the market matures, we see a fundamental shift in how organizations approach data access and analytics. Cloud data storage has made it possible for companies to marry data across transactional systems, applications, logs, and object storage, giving data analysts the ability to analyze previously disparate information for new insights on historical data and more refined predictions.
While companies want the ability to access, analyze, and run queries against all these data sources, many still have data all over the place. Most large, established companies maintain multiple data warehouses and data lakes — some on-prem, some in the cloud; some modern, and some legacy. For these companies, it would be an expensive 5-10 year endeavor to centralize their data in one place, and that assumes they don’t start new projects with different data stores or acquire companies. Unsurprisingly, we are yet to see a large company with a single data warehouse or lake, and instead, companies are looking to query data, wherever it is, without the need for an architectural lift and shift to a single point of storage.
Trino (formerly known as Presto SQL) is an open source, distributed SQL query engine that removes the need to move data in order to analyze it. Data analysts run Trino because it’s the best tool on the market in terms of efficiency and latency. It is used to query data no matter where it lives, most commonly for interactive analytics, using a consistent SQL interface, meaning analysts can access large scale distributed data stores through Trino in the same way they would interact with a centralized data warehouse. It also has the capability to join data from multiple sources for ad-hoc queries. It has a very wide range of connectors that can query data lakes (S3, HDFS, Azure Blob Store), RDBMS (MySQL, SQL Server), and no-SQL databases (MongoDB, Cassandra).
Simply put, we believe Trino will be one of the most widely used data infrastructure technologies in the market in the coming years.
Starburst, which provides an enterprise version of Trino, was started by CEO Justin Borgman, who has unique experience shepherding Trino (previously Presto), first getting involved in the community back in 2014 while he was the VP and GM of Hadoop products at Teradata, after the acquisition of his first company, Hadapt. Joining Justin are two co-founders Matt Fuller, and Kamil Bajda-Pawlikowski as well as Martin Traverso, Dain Sundstrom, David Philips, the team that originally created Presto at Facebook in 2012 and have led the community ever since. With its disciplined approach, long-term vision, and workhorse mentality, we believe the team at Starburst is building the technology required to unlock the value of data wherever it lives.
Starburst’s momentum as a company is staggering in just 26 months of selling in the market. Starburst offers a compelling managed solution for performance, connectivity, security, management, and support. It improves query performance by 10x vs. open source, with enterprise features such as single-sign on (SSO), role-based access control (RBAC), high availability, and 24/7 support.
And from a business standpoint, it’s working! Comcast, Nike, Bank of America, Slack, and VMWare are customers, and Starburst has best-in-class retention, as demonstrated by 99% gross revenue retention. They also have exceptional net dollar retention ~150%+, which indicates that customers buy more as they use Starburst longer.
We believe the future is bright for Starburst, and that it will become one of the key next generation data architecture companies in the market. We are thrilled to announce our investment in Starburst from the Growth fund and welcome Justin and Starburst to the a16z family. We are excited to work with you in the years to come!
The views expressed here are those of the individual AH Capital Management, L.L.C. (“a16z”) personnel quoted and are not the views of a16z or its affiliates. Certain information contained in here has been obtained from third-party sources, including from portfolio companies of funds managed by a16z. While taken from sources believed to be reliable, a16z has not independently verified such information and makes no representations about the enduring accuracy of the information or its appropriateness for a given situation. In addition, this content may include third-party advertisements; a16z has not reviewed such advertisements and does not endorse any advertising content contained therein.
This content is provided for informational purposes only, and should not be relied upon as legal, business, investment, or tax advice. You should consult your own advisers as to those matters. References to any securities or digital assets are for illustrative purposes only, and do not constitute an investment recommendation or offer to provide investment advisory services. Furthermore, this content is not directed at nor intended for use by any investors or prospective investors, and may not under any circumstances be relied upon when making a decision to invest in any fund managed by a16z. (An offering to invest in an a16z fund will be made only by the private placement memorandum, subscription agreement, and other relevant documentation of any such fund and should be read in their entirety.) Any investments or portfolio companies mentioned, referred to, or described are not representative of all investments in vehicles managed by a16z, and there can be no assurance that the investments will be profitable or that other investments made in the future will have similar characteristics or results. A list of investments made by funds managed by Andreessen Horowitz (excluding investments for which the issuer has not provided permission for a16z to disclose publicly as well as unannounced investments in publicly traded digital assets) is available at https://a16z.com/investments/.
Charts and graphs provided within are for informational purposes solely and should not be relied upon when making any investment decision. Past performance is not indicative of future results. The content speaks only as of the date indicated. Any projections, estimates, forecasts, targets, prospects, and/or opinions expressed in these materials are subject to change without notice and may differ or be contrary to opinions expressed by others. Please see https://a16z.com/disclosures for additional important information.