
Machine Learning at Scale with H2O: A practical guide to building and deploying machine learning models on enterprise systems
Category
Author
Publication
Packt Publishing
H2O is an open source, fast, and scalable machine learning framework that allows you to build models using big data and then easily productionalize them in diverse enterprise environments.
Machine Learning at Scale with H2O begins with an overview of the challenges faced in building machine learning models on large enterprise systems, and then addresses how H2O helps you to overcome them. You'll start by exploring H2O's in-memory distributed architecture and find out how it enables you to build highly accurate and explainable models on massive datasets using your favorite ML algorithms, language, and IDE. You'll also get to grips with the seamless integration of H2O model building and deployment with Spark using H2O Sparkling Water. You'll then learn how to easily deploy models with H2O MOJO. Next, the book shows you how H2O Enterprise Steam handles admin configurations and user management, and then helps you to identify different stakeholder perspectives that a data scientist must understand in order to succeed in an enterprise setting. Finally, you'll be introduced to the H2O AI Cloud platform and explore the entire machine learning life cycle using multiple advanced AI capabilities.
By the end of this book, you'll be able to build and deploy advanced, state-of-the-art machine learning models for your business needs.
About the Author
Gregory Keys is a Senior Solution Architect at H2O and has over 20 years of experience designing and implementing software and data systems. He innovated a model deployment and governance framework that was incorporated into Cloudera machine learning product line.
David Whiting is a Data Science Director and Head of Training at H2O.ai. He has over 18 years of experience in business, consulting, and academia. He is adept at developing and maintaining long-term collaborations with experts in multiple fields. He has both led and participated in multi-disciplinary teams and he enjoys mentoring developing analysts and has a substantial experience in doing so.
--This text refers to the paperback edition.- Build and deploy machine learning models using H2O
- Explore advanced model-building techniques
- Integrate Spark and H2O code using H2O Sparkling Water
- Launch self-service model building environments
- Deploy H2O models in a variety of target systems and scoring contexts
- Expand your machine learning capabilities on the H2O AI Cloud
This book is for data scientists and machine learning engineers who want to gain hands-on machine learning experience by building and deploying state-of-the-art models with advanced techniques using H2O technology. An understanding of the data science process and experience in Python programming is recommended. This book will also benefit students by helping them understand how machine learning works in real-world enterprise scenarios.
- Opportunities and Challenges
- Platform Components and Key Concepts
- Fundamental Workflow - Data to Deployable Model
- H2O Model Building at Scale Capability Articulation
- Advanced Model Building Part I
- Advanced Model Building Part II
- Understanding ML Models
- Putting It All Together
- Production Scoring and the H2O MOJO
- H2O Model Deployment Patterns
- The Administrator and Operations Views
- The Enterprise Architect and Security Views
- Introducing the H2O AI Cloud
- H2O at Scale in a Larger Platform Context
- Appendix Alternative Methods to Launch H2O Clusters