Table of Contents
1 Descriptive vs Inferential Statistics
2 Terms
3 Data Exploration
3.1 Univariate
3.2 Bivariate
3.3 Multivariate
1 Supervised
1.1 Regression
(Linear Regression, Polynomial Regression)
1.2 Classification
(Logistic Regression, KNN, Decision Trees, Random Forest, etc.
2 Unsupervised
2.1 Clustering
(K means)
2.2 Association
2.3 Dimensionality reduction
1 Understand Objective
2 Data Collection
3 Data Preparation
(Data Cleaning, EDA, Feature Engineering, Feature Selection, Train/Validation/Test Split)
4 Modeling
5 Evaluation
5.1 Classification
(Confusion Matrix, AUC-ROC, Lift Charts, Gain Chart, KS Statistic, F1 Score, etc.)
5.2 Regression
(R-squared (R2), RMSE, RSE, MAE, RAE)
6 Model Deployment
1 Educational
1.1 Classification: Titanic Survival Prediction
1.2 Regression: Boston Housing Prediction
2 Business
1.1 Click Through Rate Prediction
1.2 Fraud Detection
6 Components of Data Science Syllabus
1. PROGRAMMING
Start with most in demand programming languages/ tools.
2. STATISTICS
Statistics foundation is an import part of data science field. Cover all the important statistics topics required for data science.
3. MODELING
Most interesting part of Data Science is training models. Get in depth understanding of different models by examples, videos and code.
4. PROJECT LIFE CYCLE
6 steps approach of Data Science Project Lifecycle is followed in any end-to-end Data Science project.
5. CASE STUDIES
Curated hands-on case studies with all the required explanation. Educational case studies are best for beginners and explore data science field. Business case studies contains capstone projects used in actual companies, ranging from retail, finance, healthcare domains,…
6. INTERVIEWS
The last part of the puzzle is Interview Preparation, to be able to present and explain everything one has learned so far. This section will act as a guide and assist you in interview preparation.