Course

Implementing a Lakehouse with Microsoft Fabric – Intensive Training («DP601»)

This course is designed to build your foundational skills in data engineering on Microsoft Fabric, focusing on the Lakehouse concept.
Duration 1 day
Price 900.–
Course documents Official Microsoft Courseware and Microsoft Learn

Course facts

  • Introduction to end-to-end analytics using Microsoft Fabric
  • Getting started with Lakehouses in Microsoft Fabric
  • Using Apache Spark in Microsoft Fabric
  • Working with Delta Lake tables in Microsoft Fabric
  • Ingesting Data with Dataflows Gen2 in Microsoft Fabric
  • Using Data Factory pipelines in Microsoft Fabric

This course will explore the powerful capabilities of Apache Spark for distributed data processing and the essential techniques for efficient data management, versioning, and reliability by working with Delta Lake tables. This course will also explore data ingestion and orchestration using Dataflows Gen2 and Data Factory pipelines. This course includes a combination of lectures and hands-on exercises that will prepare you to work with lakehouses in Microsoft Fabric.

Course outline:

Module 1: Introduction to end-to-end analytics using Microsoft Fabric

  • Discover how Microsoft Fabric can meet your enterprise's analytics needs in one platform. Learn about Microsoft Fabric, how it works, and identify how you can use it for your analytics needs.

Module 2: Get started with Lakehouses in Microsoft Fabric

  • Lakehouses merge data lake storage flexibility with data warehouse analytics. Microsoft Fabric offers a lakehouse solution for comprehensive analytics on a single SaaS platform.

Module 3: Use Apache Spark in Microsoft Fabric

  • Apache Spark is a core technology for large-scale data analytics. Microsoft Fabric provides support for Spark clusters, enabling you to analyze and process data in a Lakehouse at scale.

Module 4: Work with Delta Lake tables in Microsoft Fabric

  • Tables in a Microsoft Fabric lakehouse are based on the Delta Lake storage format commonly used in Apache Spark. By using the enhanced capabilities of delta tables, you can create advanced analytics solutions.

Module 5: Ingest Data with Dataflows Gen2 in Microsoft Fabric

  • Data ingestion is crucial in analytics. Microsoft Fabric's Data Factory offers Dataflows (Gen2) for visually creating multi-step data ingestion and transformation using Power Query Online.

Module 6: Use Data Factory pipelines in Microsoft Fabric

  • Microsoft Fabric includes Data Factory capabilities, including the ability to create pipelines that orchestrate data ingestion and transformation tasks.

The primary audience for this course is data professionals who are familiar with data modeling, extraction, and analytics. It is designed for professionals who are interested in gaining knowledge about Lakehouse architecture, the Microsoft Fabric platform, and how to enable end-to-end analytics using these technologies.

You should be familiar with basic data concepts and terminology. We recommend to participate in the following course or have equal knowledge:

Download

Questions

Choose your date

Further courses