Description
In this course, you will learn:
- Pyspark is used to programme Spark.
- In a Spark application, identifying the computational tradeoffs.
- Using Spark and Parquet for data loading and cleaning.
- Statistical and machine learning methods are used to model data.