DescriptionGet Hands-on Experience as to how they themselves can become Spark Application Developers. Become masters at working with Spark DataFrames, HiveQL, and Spark SQL. Understand how to control importing and exporting of Data in Spark through Apache Sqoop in the exact format that is needed. Learn all Spark RDDs Transformations and Actions needed to analyze Big Data. Become absolutely ready for the Cloudera Spark CCA 175 Certification Exam.
This course is designed to cover the end-to-end implementation of the major components of Spark. I will be giving you hands on experience and insight into how big data processing works and how it is applied in the real world. We will explore Spark RDDs, which are the most dynamic way of working with your data. They allow you to write powerful code in a matter of minutes and accomplish whatever tasks that might be required of you. They, like DataFrames, leverage the Spark Lazy Evaluation and Directed Acyclic Graphs (DAG) to give you 100x better functionality than MapReduce while writing less than a tenth of the code. You can execute all the Joins, Aggregations,Transformations and even Machine Learning you want on top of Spark RDDs. We will explore these in depth in the course and I will equip you with all the tools necessary to do anything you want with your data.