Raj Cloud Technologies

Azure Data Engineering & Databricks Training

Ratings 4.7
4.6/5

(Rating based on 2K+ reviews)

UPDATE: Starting 17th January

Live: Instructor Led Training

This comprehensive training is designed to enhance your skills in both Azure Data Engineering and Big Data processing using Spark, helping you advance your career . Throughout the course, you’ll work on 5 real-world projects, gaining hands-on experience with Azure Data Engineering and Big Data technologies.We start from the basics, so no prior knowledge is required. In addition to mastering these technologies, you’ll receive guidance on completing projects, preparing for interviews, building a strong resume, and certification support for exams like the Azure Data Engineer Associate (DP-203) and Databricks Associate Certification.

+91 81052 96858

Have Queries? Ask our Experts.

Get More Info, Enquire Now!

We are available 24x7 for your queries.

Contact Form (Azure)

Our students were hired by:

Azure Data Engineering & Databricks Training

Technologies Taught

Course Unique Features

Certifications

Course Curriculum

Azure Data Engineering Curriculum

Azure Basics
  • Basics of Cloud
  • Public, Private and Hybrid Cloud
  • Iaas, Saas, and Paas
  • Azure Entra
  • Tenant
  • Subscription
  • Management Group
  • Resource Group
  • Azure Portal Overview
Azure Storage Fundamentals
  • Azure Storage Account
  • Azure Blob Storage
  • Azure Data Lake
  • ACL and Permissions

Azure Data Factory Components

  • Azure Data Factory Overview
  • Integration Run Time
  • Linked Service
  • Pipeline, Dataset and Activity
  • Source and Sink Configuration
  • Copy Data Activity
  • Recursive, Wildcard, File Listing, Mapping, User Properties.
  • Parameterization of Copy Data Activity
  • Sequential and Parallel Copy

Data Loading in Azure SQL Server

  • Upsert, Pre-Copy Script, and AutoCreate Table Options.
  • Handling Multiple Files with Lookup
  • Full Load Pipeline
  • New Watermark and Old watermark Concept
  • Delta Laod for Single File.
  • Delta Load for Multiple File

Pipeline Monitoring

  • Monitoring of Activity
  • Monitoring of Pipeline
  • Execute Pipeline Activity

Azure DevOps and Git

  • Azure DevOps Overview
  • Azure DevOps Integration with ADF
  • Azure Data Factory Integration with Git
  • Git Pull, Git Push
  • Cherry Pick, Git Revert.

Azure Key Vault and Security

  • Azure Key Vault
  • Managing Keys and Security
  • Azure Key Vault integration with ADF
  • Azure Key Vault Integration with Linked Service

Data Modelling and Design

  • Fact and Dimension Tables.
  • Star and Snowflake Schemas
  • SCD Types and Implementation
  • Creating Stored Procedures for Data Modelling
  • Trigger Stored Procedure from ADF

Azure Logic Apps and Notifications

  • Azure Logic App
  • Sending Email Alerts using Logic App

Project 1 – Migrating Data from MS SQL Server to Cloud

  • Full Load and Delta Load with Monitoring and Warehousing

Azure Data Factory Advance

  • Event-Based, Scheduled, and Tumbling Window Triggers
  • Data flow and Data Transformation Activity
  • If-Else, Metadata, and Web Activities
  • Global Parameter
  • Parameterizing Triggers

Azure Databricks

  • Overview of Databricks
  • Spark and Spark Architecture
  • Data Lake and Delta Lake
  • Delta Table and Features
  • Processing CSV, JSON AND XML file using PySpark.
  • Integration of Azure Key Vault with Databricks.
  • Secret Scope and JDBC connector
  • Writing Data to Azure SQL server.

Project 2 – KPI Dashboarding with Azure Databricks

  • Data processing using Pyspark and creating Delta Lake

Azure Synapse Analytics

  • Overview of Azure Synapse
  • Difference between Synapse and Data Factory
  • Synapse Dedicated Pool
  • Synapse Serverless Pool
  • Polybase Copy
  • Synapse Spark Pool

Project 3 – Dashboarding using Azure Synapse

  • Orchestrated the secure migration of data from on-premise servers to Azure Data Lake using Synapse Analytics pipelines

Big Data Processing Using Data Bricks and Spark

Introduction to Data Bricks and Spark Architecture

  • Overview of Databricks and Spark Architecture
  • Databricks Workspace Overview
  • Understanding Databricks Services and Features
  • Cluster Management: Creation, Autoscaling, and Administration

RDD and Data Frame Fundamentals

  • RDD (Resilient Distributed Dataset) Overview
  • Data Frame Spark API and Data Source API Fundamentals
  • Conversion between PySpark and Pandas
  • Common Transformation Techniques in PySpark
  • Transformations vs. Actions

Dbutils and Parameterization

  • Usage of Dbutils for File System Interaction
  • Parameterization Techniques in Databricks

Delta Lake and Delta Table

  • Table Manipulation with Delta Lake
  • Read and Process CSV, JSON, and XML Files
  • Types of Views in Databricks (Global, Local, Temporary)
  • Managed vs. Unmanaged Tables
  • Versioning and Time Travel in Delta Lake

Azure DevOps and Git Integration

  • Azure DevOps and Git Workflow Integration
  • Cherry Pick and Git Revert Commands
  • Secret Scope Creation and Management
  • JDBC Connector for SQL Server

Project 4 – Retail Dashboarding with Azure Databricks

  • Data processing using Pyspark and creating Delta Lake

Databricks CLI and Backup Process

  • Databricks CLI Overview and Installation
  • Backup Process Setup for Notebooks and Configurations.

Understanding Spark UI

  • Navigating the Spark UI for Job Monitoring
  • Understanding the Stages, Tasks, and Execution Plans

Unity Catalog and SCD (Slowly Changing Dimensions) Implementation.

  • Overview of Unity Catalog for Data Governance
  • SCD Types (Type 1, Type 2, Type 3) and Implementation

Lakehouse and Medallion Architecture

  • Introduction to Lakehouse Architecture
  • Medallion Architecture (Bronze, Silver, Gold Layers) Overview

Spark Optimization Techniques

  • User-Defined Functions (UDF) for Custom Transformations
  • Catalyst Optimizer and Data Frame. explain for Optimization
  • Directed Acyclic Graphs (DAG) and Adaptive Query Execution (AQE)
  • Predicate Pushdown and Projection Pushdown
  • Repartition, Coalesce, Cache, and Persist

Project 5 – Medallion Architecture with Unity Catalog

  • Process data using PySpark and create Delta Lake while applying Medallion Architecture for structured data layers.

Handling Complex Data and Advanced Joins

  • Handling Complex JSON, Struct, and Nested Data Types
  • Data Skew and Techniques for Handling Skewed Data
  • Sort Merge Join, Broadcast Join, and Optimizing Joins
  • Z-Ordering for Efficient Querying

Orchestration and Scheduling Techniques

  • Job Orchestration and Scheduling with Databricks Jobs API
  • Best Practices for Workflow Orchestration

Resume Preparation and Interview tips

  • Resume Preparation and Interview tips

What you’ll learn

Course Instructed By:

Mr.Abhishek Agrawal

A Microsoft Certified Azure Data Engineer, NIT Raipur graduate with 8+ yrs of experience. Specializing in Azure Data Factory, Databricks, Azure Synapse Analytics, PySpark and SparkSQL. Guided over 200+ globally into successful Azure Cloud careers. and provides real-world learning, along with dedicated support for career transitions and interview preparation. Approved trainer by Raj Cloud Technologies.

Like the curriculum? Enroll Now

Login or Sign up using your Google account for fast enrolment and easy access.

Azure Data Engineering & Databricks

Fees: 20,000 25,000

Live Session Timing: 7:30 PM, IST

Azure Data Engineering

Azure Data Eng. Certificate Program

Evening: 7:30 PM, IST
(Mon-Fri) 90 Minutes/Session (2 Months)

Part payment option is available. Contact us

Azure Databricks

With Spark

Azure Databricks Certificate Program

Evening: 7:30 PM, IST
(Mon-Fri) 90 Minutes/Session (2 Months)

Part payment option is available. Contact us

Data Engineering + Data Bricks

Full Azure Certified Program

Evening: 7:30 PM, IST
(Mon-Fri) 90 Minutes/Session (4 Months)

Part payment option is available. Contact us

What our students say?

Meghana R
Meghana R
@meghana-r
Read More
I just want to share my experience about Natraj sir training, it is one of the best training I had ever on informatica. I learned lots of real time concepts from Raj sir training and also they are very useful in my job. The training is based on Realtime scenarios so that you will get familiar with the concepts of informatica and Oracle and Unix. Thank you Raj sir for giving us such a nice training and so much of confidence...
Akash Dhus
Akash Dhus
Read More
It's a fantastic course for a beginner also. I could feel the effort that was put into to make sure people understood. Thank you Raj, when I become one the greatest, I will remember this beginning. A wonderful experience . The lecturers are great with a very nice way on interacting and lots of useful material. Thank you for all your cooperation. Hope to see more of you in future. Thank you once again.
Previous
Next

Enroll Now

Book your place for free session of Azure Data Engineering & Databricks Training

Enroll Form (Azure)

Download Course Curriculum (Syllabus)

Contact Form (Azure)
Log in/Sign up with Google account for password less login.

Or

Login with your email & password

Azure Data Engineering & Databricks

New Batch Starting,

17th January 2025

Interested To Join? fill this form below

Contact Form (Azure)

Registred Email:

- Not Updated -

Login to update/set a password or try "Forget Password"