The properties of a software system – performance, accuracy, security, compliance, and so much more – have traditionally been dictated by the code used to build it. As a result, an entire tool chain has been developed to aid in writing, debugging, securing, and analyzing code.
However, with more software systems built around AI and ML models, system performance, accuracy, and security are as much a function of the data the system operates on as the code they’re built on. Building ML systems has become a competitive edge, especially in product differentiation, with valuable use cases from fraud detection to recommendation engines. Increasingly that competitive edge depends on ML that is used for real-time decisions, requires fresh data, and is low latency and high scale.
Unfortunately, the tooling to aid the data scientists and data engineers who build AI/ML systems is far less mature than the products built to aid in software development. Data scientists often work locally, training models and building the pipelines of data that feed them. But taking that local model into at-scale production is an arduous, time-consuming process, subject to constraints that just aren’t present in the training environment. Furthermore, models trained offline have to be pushed online, and operate on the same type of data (called features) in order to give sensible results. But the tooling to standardize, govern, and collaborate around ML data is still incredibly immature.
When I first met the Tecton team, they were pioneering a project from within Uber called Michelangelo. Their goal was to democratize machine learning and AI by providing the tooling for data management in ML pipelines, so it was as easy to build an ML system as to code a simple app.
The goal is simple to state, but figuring out how to do it took years of experience building large ML and AI systems within companies such as Uber, Google, Airbnb, and Facebook.
The insight the Michelangelo team had was to build a platform, which they called a feature store to manage the particular data signals (i.e. “features”) important to the ML systems much like you’d manage code. The feature store made the handoff easier between data scientists who identify the features, and the data engineers who manage the systems that use them in production. With Michelangelo, data scientists and engineers could extract features offline to train models, and then move those features in a consistent manner to production.
At Uber, the feature store greatly improved the time it took to get ML models into production, and provided a standardized and unified repository of the most important signals to the business. It also provided an interface between data scientists and data engineers so they could collaborate to achieve goals with fewer errors. Today, Michelangelo and its feature store power thousands of models in production.
The feature store garnered immediate attention throughout the industry. However, Mike, Jeremy, and Kevin, who worked on the project, knew that there was a lot more that could be built to further their goals of democratizing ML, so they created Tecton.
We initially did the seed investment in Tecton at the end of 2018 and it was soon obvious how much the industry wanted better tooling around ML data. After tracking a number of deep engagements with top ML teams and their interest in what Tecton was building, we invested in Tecton’s A alongside Sequoia. We strongly believe that these systems will continue to increasingly rely on data and ML models, and an entirely new tool chain is needed to aid in developing them. Therefore, we at a16z are incredibly thrilled to be working with Tecton to aid in building the most sophisticated AI and ML data pipelines.
***
The views expressed here are those of the individual AH Capital Management, L.L.C. (“a16z”) personnel quoted and are not the views of a16z or its affiliates. Certain information contained in here has been obtained from third-party sources, including from portfolio companies of funds managed by a16z. While taken from sources believed to be reliable, a16z has not independently verified such information and makes no representations about the enduring accuracy of the information or its appropriateness for a given situation. In addition, this content may include third-party advertisements. A16z has not reviewed such advertisements and does not endorse any advertising content contained therein.
This content is provided for informational purposes only, and should not be relied upon as legal, business, investment, or tax advice. You should consult your own advisers as to those matters. References to any securities or digital assets are for illustrative purposes only, and do not constitute an investment recommendation or offer to provide investment advisory services. Furthermore, this content is not directed at nor intended for use by any investors or prospective investors, and may not under any circumstances be relied upon when making a decision to invest in any fund managed by a16z. (An offering to invest in an a16z fund will be made only by the private placement memorandum, subscription agreement, and other relevant documentation of any such fund and should be read in their entirety.) Any investments or portfolio companies mentioned, referred to, or described are not representative of all investments in vehicles managed by a16z, and there can be no assurance that the investments will be profitable or that other investments made in the future will have similar characteristics or results. A list of investments made by funds managed by Andreessen Horowitz (excluding investments for which the issuer has not provided permission for a16z to disclose publicly as well as unannounced investments in publicly traded digital assets) is available at https://a16z.com/investments/.
Charts and graphs provided within are for informational purposes solely and should not be relied upon when making any investment decision. Past performance is not indicative of future results. The content speaks only as of the date indicated. Any projections, estimates, forecasts, targets, prospects, and/or opinions expressed in these materials are subject to change without notice and may differ or be contrary to opinions expressed by others. Please see https://a16z.com/disclosures for additional important information.