Description
In this course, you will :
- Use the collaborative Databricks workspace and write SQL code that executes against a cluster of machines.
- Use Spark UI to analyze performance and identify bottlenecks.
- Create an end-to-end pipeline that reads data, transforms it, and saves the result.
- Build a linear regression model and make predictions using SparkSQL.
Syllabus :
- Introduction to Spark
- Spark Core Concepts
- Engineering Data Pipelines
- Data Lakes, Warehouses and Lakehouses