
Learning Objectives
Overview
This module introduces data governance in Databricks, focusing on challenges in managing data access across multiple workspaces. It covers key governance aspects such as access control, user & role management, data lineage, data audit, and data discovery to ensure data security, compliance, and efficiency.
The module then explores Unity Catalog, Databricks' built-in governance solution that centralizes access control, auditing, and metadata management across workspaces. It explains the benefits of a unified governance framework and the hierarchical structure of Unity Catalog.
Finally, you will learn how to create a catalog in Unity Catalog, which serves as the top-level container for organizing datasets. The module covers catalog creation using Databricks UI and SQL commands, catalog types, and storage configuration for efficient data management.
Prerequisites
- Basic understanding of Databricks workspaces and how users access data.
- Familiarity with database concepts, including schemas, tables