MLOps: CI/CD for Machine Learning Training Course

Duration

35 hours (usually 5 days including breaks)

Requirements

  • An understanding of the software development cycle
  • Experience building or working with Machine Learning models
  • Familiarity with Python programming

Audience

  • ML engineers
  • DevOps engineers
  • Data engineers
  • Infrastructure engineers
  • Software developers

Overview

MLOps is a set of tools and methodologies for combining Machine Learning and DevOps practices. The goal of MLOps is to automate and optimize the deployment and maintenance of ML systems in production.

This instructor-led, live training (online or onsite) is aimed at engineers who wish to evaluate the approaches and tools available today to make an intelligent decision on the path forward in adopting MLOps within their organization.

By the end of this training, participants will be able to:

  • Install and configure various MLOps frameworks and tools.
  • Assemble the right kind of team with the right skills for constructing and supporting an MLOps system.
  • Prepare, validate and version data for use by ML models.
  • Understand the components of an ML Pipeline and the tools needed to build one.
  • Experiment with different machine learning frameworks and servers for deploying to production.
  • Operationalize the entire Machine Learning process so that it’s reproduceable and maintainable.

Format of the Course

  • Interactive lecture and discussion.
  • Lots of exercises and practice.
  • Hands-on implementation in a live-lab environment.

Course Customization Options

  • To request a customized training for this course, please contact us to arrange.

Course Outline

Introduction

  • Machine Learning models vs traditional software

Overview of the DevOps Workflow

Overview of the Machine Learning Workflow

ML as Code Plus Data

Components of an ML System

Case Study: A Sales Forecasting Application

Accessing Data

Validating Data

Data Transformation

From Data Pipeline to ML Pipeline

Building the Data Model

Training the Model

Validating the Model

Reproducing Model Training

Deploying a Model

Serving a Trained Model to Production

Testing an ML System

Continuous Delivery Orchestration

Monitoring the Model

Data Versioning

Adapting, Scaling and Maintaining an MLOps Platform

Troubleshooting

Summary and Conclusion

MLOps for Beginners

Current State of AI

How MLOps alleviates challenges faced in AI implementation

AI Model Lifecycle

Introduction to ML Platforms

Requirements

  • No programming experience needed. You will learn everything you need to know.

Description

AI is no longer exclusively for digitally native companies like Amazon, Netflix, or Uber. Unsurprisingly, Gartner predicts that more than 75% of organizations will shift from piloting AI technologies to operationalizing them by the end of 2024 — which is where the real challenges begin. Unfortunately, scaling AI in this sense isn’t easy. There is a chasm between ML and MLOps that can be tricky to scale. Getting one or two AI models into production is different from running an entire enterprise or product on AI. And as AI is scaled, problems can (and often do) scale, too.

Organizations that are serious about AI have to adopt a new discipline, “MLOps” or Machine Learning Operations. MLOps is the bridge. It is an engineering culture and practice that aims to unify ML system development and operations to facilitate data processing, machine learning pipeline, model training, experimentation, evaluation, registry, deployment, monitoring, serving, and scaling. Essentially, MLOps refers to a set of practices that helps in deploying and maintaining machine learning models in production efficiently and reliably. It is a collaborative team function often comprising of data scientists and DevOps engineers.

In this course, you will learn:

  • The building blocks of MLOps
  • The best practices and tools that facilitate rapid, safe, and efficient development and operationalization of AI

Who this course is for:

  • Aspiring MLOps Professionals and Enthusiasts
  • Individuals interested in data and AI industry

Course content

What is Machine Learning (ML)? | Definition from Techopedia

What Does Machine Learning (ML) Mean?

Machine learning (ML) is the sub-category of artificial intelligence (AI) that builds algorithmic models to identify patterns and relationships in data. In this context, the word machine is a synonym for computer program and the word learning describes how ML algorithms become more accurate as they receive additional data.

The concept of machine learning is not new, but its practical application in business was not financially feasible until the advent of the internet and recent advances in big data analytics and cloud computing. That’s because training an ML algorithm to find patterns in data requires a lot of compute resources and access to big data.

The terms artificial intelligence and machine learning are sometimes used as synonyms because until recently, most AI initiatives have been narrow and most ML models were built to perform a single task, used supervised learning and required large, labeled data sets for training. Today, robotic process automation (RPA) can be used to automate the data pre-processing process and make training a machine learning algorithm much faster.

Techopedia Explains Machine Learning (ML)

High-quality machine learning models require high-quality training data and access to large data sets in order to extract features most relevant to specified business goals and reveal meaningful associations.

Machine Learning Models

A machine learning model is simply the output of an ML algorithm that has been run on data. The steps involved in building a machine learning model include the following:

  • Gather training data.
  • Prepare data for training.
  • Decide which learning algorithm to use.
  • Train the learning algorithm.
  • Evaluate the learning algorithm’s outputs.
  • If necessary, adjust the variables (hyperparameters) that govern the training process in order to improve output.

In a typical ML setting, supervised machine learning algorithms require a dataset comprised of examples where each example consists of an input and output. In such a setting, a typical objective of training a ML algorithm is to update the parameters of a predictive model to ensure the model’s decision trees consistently produces desired outcomes. This is where entropy comes in.

Entropy is a mathematical formula used to quantify the disorder and randomness in a closed system. In machine learning projects, an important goal is to make sure entropy remains as low as possible because this measure will determine how the model’s decision trees will choose to split data.

Training Machine Learning

There are three main types of algorithms used to train machine learning models: supervised learning, unsupervised learning and reinforcement learning.

  • Supervised learning – the algorithm is given labeled training data (input) and shown the correct answer (output). This type of learning algorithm uses outcomes from historical data sets to predict output values for new, incoming data.
  • Unsupervised learning – the algorithm is given training data that is not labeled. Instead of being asked to predict the correct output, this type of learning algorithm uses the training data to detect patterns that can then be applied to other groups of data that exhibit similar behavior. In some situations, it may be necessary to use a small amount of labeled data with a larger amount of unlabeled data during training. This type of training is often referred to as semi-supervised machine learning.
  • Reinforcement learning – instead of being given training data, the algorithm is given a reward signal and looks for patterns in data that will give the reward. This type of learning algorithm’s input is often derived from the learning algorithm’s interaction with a physical or digital environment.

What Causes Bias in Machine Learning?

There is a growing desire by the general public for artificial intelligence – and machine learning algorithms in particular — to be transparent and explainable, but algorithmic transparency for machine learning can be more complicated than just sharing which algorithm was used to make a particular prediction.

Many people who are new to ML are surprised to discover that it’s not the mathematical algorithms that are secret; in fact, most of the popular ML algorithms in use today are freely available. It’s the training data that has proprietary value, not the algorithm used.

Unfortunately, because the data used to train a learning algorithm is selected by a human being, it can inadvertently introduce bias to the ML model that’s being built. The iterative nature of learning algorithms can also make it difficult for ML engineers to go back and trace the logic behind a particular prediction.

When it is possible for a data scientist or ML engineer to explain how a specific prediction was made, an ML model may be referred to as explainable AI. When it is not possible to reveal how a specific prediction was made — either because the math becomes too complicated or the training data is proprietary — the ML model may be referred to as black box AI.

MLops

Machine learning projects are usually overseen by data scientists and machine learning engineers. The data scientist’s job typically involves creating an hypothesis and writing code that will hopefully prove the hypothesis to be true. An ML engineer’s job focuses on machine learning operations (MLOps).

Machine learning operations is an approach to managing the entire lifecycle of a machine learning model — including its training, tuning, everyday use in a production environment and eventual retirement. This is why ML engineers need to have a working knowledge of data modeling, feature engineering and programming — In addition to having a strong background in mathematics and statistics.

Ideally, data scientists and ML engineers in the same organization will collaborate when deciding which type of learning algorithm will work best to solve a particular business problem, but in some industries the ML engineer’s job is limited to deciding what data should be used for training and how machine learning model outcomes will be validated.