Databricks ML Engineer Skill Path

2f06907e-0c74-4ee9-b16d-219199af6778_3135f616-5436-41b3-adc1-310a6cfc4645_Databricks-data engineering.webp skill path cover image

End-to-end industry-focused skill path to help gain essential skills for running machine learning workloads on Databricks

Pre-Requisites

✅ Model Pre-procesisng

✅ Model building and Evaluation

✅ NLP tasks and RNN

✅ Vision tasks and CNN

Key Highlights

✅Mutiple raw and clean data sources for ingestion

✅ Aligned to Databricks ML Associate certificate

✅ Intricacies of Model serving and MOps

Skill Path

Masterclass

Introduction to Databricks

This Module details the need for a Unified Analytics platform like Databricks and how to utilize it to tackle Data + AI challenges. In this Module, we will look into Databricks architecture and how it can be created in Azure Databricks. We will also understand different types of clusters needed for various Analytical workloads

databricks

clusters

270 Minutes

Masterclass

Exploratory Analysis and data preparation using Databricks

This module emphasizes techniques for data exploration and visualization using Spark and Python within Databricks.

e-commerce

databricks

330 Minutes

Masterclass

AutoML using Databricks

This module explores Databricks' AutoML capabilities, which automate parts of the machine learning workflow.

e-commerce

databricks

270 Minutes

Masterclass

Feature Engineering using Databricks

This module focuses on the process of preparing data for machine learning, including handling missing values and encoding categorical features, engineering domain-specific features, and other data pre-processing techniques

e-commerce

databricks

375 Minutes

Masterclass

Feature Management with Feature stores

This module covers the Databricks Feature Store, which helps in storing and accessing features for machine learning pipelines.

e-commerce

databricks

120 Minutes

Masterclass

Feature Management with Unity catalog

This module explores the use of Unity Catalog for managing and sharing features across different teams and projects within Databricks.

e-commerce

databricks

unity-catalog

165 Minutes

Masterclass

Model Logging with MLFlow on Databricks

This module delves into MLflow, focusing on experiment tracking, model management, and deployment within Databricks.

e-commerce

databricks

150 Minutes

Scenario bundle

ML Model Building

This module covers the training of machine learning models, including hyperparameter tuning and parallelization techniques.

225 Minutes

Masterclass

Distributed Model training

This module discusses the challenges and techniques for scaling machine learning models using Spark.

e-commerce

databricks

PySpark

240 Minutes

Masterclass

Model Serving

This module covers the deployment and serving of machine learning models in Databricks, ensuring models are accessible for predictions in production environments.

e-commerce

databricks

150 Minutes

Masterclass

Model Drift Analysis

This module focuses on techniques and tools for monitoring and analyzing model drift in machine-learning applications. It covers the detection of changes in data and model performance over time to ensure ongoing model accuracy and reliability.

e-commerce

databricks

180 Minutes

Databricks ML Engineer Skill Path

Pre-Requisites

Key Highlights

Skill Path

Introduction to Databricks

6 Scenarios

Exploratory Analysis and data preparation using Databricks

3 Scenarios

AutoML using Databricks

5 Scenarios

Feature Engineering using Databricks

5 Scenarios

Feature Management with Feature stores

4 Scenarios

Feature Management with Unity catalog

4 Scenarios

Model Logging with MLFlow on Databricks

4 Scenarios

ML Model Building

4 Scenarios

Distributed Model training

3 Scenarios

Model Serving

4 Scenarios

Model Drift Analysis

4 Scenarios