Description
In this course, you will :
-
Understand Apache Spark’s framework, execution and programming model for the development of Big Data Systems
-
Learn how to work with a free Cloud-based and a Desktop machine for Spark setup and configuration
-
Build Advanced Big Data applications for different types of data (volume, variety, veracity) through real case studies
-
Learn Advanced hands-on PySpark practices on structured, unstructured and semi-structured data using RDD, DataFrame and SQL
-
Investigate and optimize data skewness to tune spark performance