Splice Machine Feature Store Debuts

January 20, 2021

Splice Machine has launched the Splice Machine Feature Store. The solution will help more companies operationalize machine learning by reducing the complexity of feature engineering and allow data scientists to make the right decisions based on real-time data.

Despite the hype around AI and machine learning, seven out of 10 executives whose companies had made investments in artificial intelligence reported minimal or no impact from them, according to a 2019 research report from MIT. Those surveyed stated that creating a machine learning model and putting it into operation in an enterprise environment are two very different things.

"The capacity to create, share, explain and reliably reproduce features for a given model is paramount to the success of a data science team," said Monte Zweben, CEO, Splice Machine. "The old way of doing things meant data science operations were simply not scalable. The Splice Machine Feature Store enables you to harness complex analytics in real time and transform real-time data into features, so your models are never uninformed. It also stores feature history making training set creation a single click."

Feature engineering is the most time-consuming and expensive task of the data science life cycle. As companies work to operationalize machine learning, current approaches are not scalable because data science productivity is too low to enable widespread adoption. Simplifying the data science workflow by providing necessary architecture and automating feature serving with feature stores are two of the most important ways to make machine learning easy, accurate and fast at scale.

The Splice Machine Feature Store solves some of the biggest pain points of operationalizing machine learning, including:

Reducing the effort of feature engineering

Helping to solve for governance issues, such as bias, drift, or regulatory oversight

Scaling data science operations

Reducing monetary loss from the creation of inaccurate models

This will help data scientists realize numerous benefits, including:

Achieving faster deployments of AI/ML into production by reusing features and avoiding duplicative feature engineering

Spending 80% less time on feature engineering

Developing more informative models via automatic aggregation of raw data

Gaining predictive accuracy of models with near real-time feature updates and consistent training sets

"As our Clinical Advisor application continues to be adopted by more and more clinicians, there will be increasingly more data at our disposal to help hone and create new features that can further improve our application's precision in identifying the course of disease and improving patient outcomes," said IPH (Innovative Precision Health) Chief Scientific Officer, Mark Gudesblatt, MD. "Splice Machine's feature store makes it possible to leverage features developed by data scientists for a particular disease like Multiple Sclerosis to be leveraged by other data scientists for other diseases like Parkinson’s or Alzheimer's."

"We had this sort of a feature store at Airbnb, but it was limited by the fact that we were largely on HDFS," said Robert Yi, CDO at Dataframe and former Airbnb data scientist. "It enabled users to share features, but it didn't solve the online/offline problem. But the solution can obviously be much more elegant if you start with a more amenable database that can function in realtime. Splice Machine seems to be doing exactly that – ML flow integration, database re-injection, Spark lazy loading, easy deployment, and API-less access."

Zweben added, "The Splice Machine Feature Store is the only one in the market that does not have to synchronize two underlying data engines -- an online store and an offline store, simplifying the architecture with one store, lowering costs, and eliminating latency."

Terms of Use | Copyright © 2002 - 2021 CONSTITUENTWORKS SM  CORPORATION. All rights reserved. | Privacy Statement