Machine Learning Logistics: Model Management in the Real World

Model Management in the Real World

How do you get a machine learning system to deliver value from big data?

Turns out that 90% of the effort required for success in machine learning is not the algorithm or the model or the learning – it’s the logistics. Ted Dunning and Ellen Friedman identify what matters in machine learning logistics, what challenges arise, especially in a production setting, and they introduce an innovative solution: the rendezvous architecture.

This new design for model management is based on a streaming approach in a microservices style. Rendezvous addresses the need to preserve and share raw data, to do effective model-to-model comparisons and to have new models on standby, ready for a hot hand-off when a production model needs to be replaced.

In this book you’ll learn:

  • Why successful machine learning projects involve many models
  • How to use a decoy model to capture exact data inputs
  • The capabilities needed in stream transport technology to support a stream-first microservice approach
  • How to do accurate model evaluation, including use of the t-Digest
  • Why a canary model is useful in a production setting
  • How to achieve rapid and seamless deployment of new models
  • The role of DataOps style of work and a global data fabric in making logistics for machine learning much easier
  • Subscribe to the Data Insider Newsletter

    Learn the latest news and best practices about data science, big data analytics, artificial intelligence, data security, and more. Delivered Mondays and Thursdays

    Subscribe to the Data Insider Newsletter

    Learn the latest news and best practices about data science, big data analytics, artificial intelligence, data security, and more. Delivered Mondays and Thursdays

Resource Details

MapR Technologies logo
Provided by:
MapR Technologies
Topic:
Big Data
Format:
PDF