Introduction to Data Mining #
Chapter Flow #
The flow of topics that we cover:
graph TB; A(What is Data Mining) --> B(Way of Working in AIML) B --> C(Data Pipelines) C --> D{Hands-on} D -->|Use Case| E(Uber ML Architecture) D -->|Your Development Environment| F(Colaboratory)
After reading this chapter you should feel familiar with the following ideas and concepts:
- The wider scope of AI and differences between data mining / machine learning and AI.
- The data science ecosystem.
- The functional decomposition of the four pipelines needed to power a data intensive business / application.
- The complexity behind a real world data pipeline and its tradeoffs (CAP theorem)
- The Python language basic APIs in data science projects and a tutorial path for you to dig deeper.