Description
In this course, you will :
- Introduces the exciting world of Big Data, as well as various concepts and frameworks for Big Data processing You will understand why Apache Spark is regarded as the best BigData framework.
- Discover Spark SQL, a Spark module for structured data processing.
- How Spark SQL enables the use of DataFrames in Python.
- Discover key Machine Learning algorithms.
Syllabus :
- Introduction to Big Data analysis with Spark
- Programming in PySpark RDD’s
- PySpark SQL & DataFrames
- Machine Learning with PySpark MLlib