Description
In this course, you will learn :
- About ETL pipelines and how to use them to process and combine data from CSV, JSON, logs, APIs, and databases.
- Tokenize, lemmatize, and remove stop words from text data before analysing it. Using bag of words and tf-idf, transform and vectorize text data and build features with scikit-learn.
- Learn about the benefits of using machine learning pipelines to speed up the data preparation and modelling process.
- Learn how to break down data silos by gathering information from multiple sources and transforming it into usable forms for analysts and top-level decision makers.
Syllabus :
- ETL Pipelines
- Natural Language Processing
- Machine Learning Pipelines