Course Overview
本节课主要介绍了数据科学是什么,以及本课程的大纲。
这节课会涉及以下内容
- Pandas and NumPy
- Relational Databases & SQL
- Exploratory Data Analysis
- Regular Expressions
- Visualization
- matplotlib
- Seaborn
- plotly
- Sampling
- Probability and random variables
- Model design and loss formulation
- Linear Regression
- Feature Engineering
- Regularization, Bias-Variance Tradeoff, Cross-Validation
- Gradient Descent
- Logistic Regression
- Decision Trees and Random Forests
- PCA
然后简单 示范了一个数据分析的例子。