Applied Machine Learning Course Save

This course covers the applied side of algorithmics in machine learning and deep learning, focusing on hands-on coding experience in Python.

Project README

Course: Applied Machine Learning

This course covers the applied/coding side of algorithmics in machine learning and deep learning, with a smidgen of evolutionary algorithms.

Syllabus: What is machine learning (ML), Python, applying ML, supervised learning, dimensionality reduction, unsupervised learning, deep networks, evolutionary algorithms

Algorithm Pros and Cons

Resources: Evolutionary Algorithms, Machine Learning, Deep Learning

Expanded Syllabus

What is machine learning (ML)?
Basics of Python programming
Applying ML: evaluation, dataset splits, cross-validation, performance measures, bias/variance tradeoff, visualization, confusion matrix, choosing estimators, hyperparameter tuning, statistics
Supervised learning: models, features, objectives, model training, overfitting, regularization, classification, regression, gradient descent, k nearest neighbors, linear regression, logistic regression, decision tree, random forest, adaptive boosting, gradient boosting, support vector machine, naïve Bayes
Dimensionality reduction: principal component analysis
Unsupervised learning: hierarchical clustering, k-means, t-SNE
Deep networks: backpropagation, deep neural network, convolutional neural network
Evolutionary algorithms: genetic algorithm,genetic programming

5-Min Reads

Here you can find short reads on various topics related to this course: machine learning, deep learning, evolutionary algorithms, artificial intelligence, and more.

Lesson Plan

( marks my colab notebooks)

1: Python, AI+ML Intro

Python Programming (Python, PyCharm), Pandas, NumPy / NumPy (Numba) (np.dot vs loop example )
Computing Machinery and Intelligence
Machine Learning: history, applications, recent successes
Data Science Infographic (Building an ML Model, Data Science Landscape)
How to avoid machine learning pitfalls
Top 10 machine learning algorithms with their use-cases
21 Most Important (and Must-know) Mathematical Equations in Data Science
Four Takeaways on the Race to Amass Data for A.I.

2: ML Intro, Simple Example, KNN, Cross-Validation

3: Scikit-learn, Models, Decision Trees

4: Random Forest, Linear Regression Logistic Regression

5: AdaBoost, Gradient Boosting

Summary: Linear Regression, Logistic Regression, LinVsLog , PolynomialFeatures
Adaptive Boosting
Gradient Boosting
AddGBoost, Strong(er) Gradient Boosting

6: XGBoost, Comparing ML algos, Gradient Descent

Reminder: Adaboost, Gradient boost
XGBoost
Comparing supervised learning algorithms
How to find the best performing Machine Learning algorithm, Boston dataset (from sklearn.datasets import load_boston -> racist data destruction?)
Gradient Descent (Least Squares, Least Squares)
Stochastic Gradient Descent
Stochastic Gradient Descent Algorithm With Python and NumPy

7: Choosing Model, SVM, Bayes, Metrics

8: Metrics, PCA, t-SNE, Clustering

9: Hyperparameter Tuning, p-vals, t-test, Permutation Test

10+11: Neural Networks

12+13: Evolutionary Algorithms

Cheat Sheets

Algorithm Pros and Cons

KN Neighbors
✔ Simple, No training, No assumption about data, Easy to implement, New data can be added seamlessly, Only one hyperparameter
✖ Doesn't work well in high dimensions, Sensitive to noisy data, missing values and outliers, Doesn't work well with large data sets — cost of calculating distance is high, Needs feature scaling, Doesn't work well on imbalanced data, Doesn't deal well with missing values
Decision Tree
✔ Doesn't require standardization or normalization, Easy to implement, Can handle missing values, Automatic feature selection
✖ High variance, Higher training time, Can become complex, Can easily overfit
Random Forest
✔ Left-out data can be used for testing, High accuracy, Provides feature importance estimates, Can handle missing values, Doesn't require feature scaling, Good performance on imbalanced datasets, Can handle large dataset, Outliers have little impact, Less overfitting
✖ Less interpretable, More computational resources, Prediction time high
Linear Regression
✔ Simple, Interpretable, Easy to Implement
✖ Assumes linear relationship between features, Sensitive to outliers
Logistic Regression
✔ Doesn’t assume linear relationship between independent and dependent variables, Output can be interpreted as probability, Robust to noise
✖ Requires more data, Effective when linearly separable
Lasso Regression (L1)
✔ Prevents overfitting, Selects features by shrinking coefficients to zero
✖ Selected features will be biased, Prediction can be worse than Ridge
Ridge Regression (L2)
✔ Prevents overfitting
✖ Increases bias, Less interpretability
AdaBoost
✔ Fast, Reduced bias, Little need to tune
✖ Vulnerable to noise, Can overfit
Gradient Boosting
✔ Good performance
✖ Harder to tune hyperparameters
XGBoost
✔ Less feature engineering required, Outliers have little impact, Can output feature importance, Handles large datasets, Good model performance, Less prone to overfitting
✖ Difficult to interpret, Harder to tune as there are numerous hyperparameters
SVM
✔ Performs well in higher dimensions, Excellent when classes are separable, Outliers have less impact
✖ Slow, Poor performance with overlapping classes, Selecting appropriate kernel functions can be tricky
Naïve Bayes
✔ Fast, Simple, Requires less training data, Scalable, Insensitive to irrelevant features, Good performance with high-dimensional data
✖ Assumes independence of features
Deep Learning
✔ Superb performance with unstructured data (images, video, audio, text)
✖ (Very) long training time, Many hyperparameters, Prone to overfitting

Resources: Evolutionary Algorithms, Machine Learning, Deep Learning

Vids

John Koza Genetic Programming (YouTube)
גיא כתבי - אלגוריתמים אבולוציוניים (YouTube) [גיא בוגר הקורס שלי: אלגוריתמים אבולוציוניים וחיים מלאכותיים]
StatQuest with Josh Starmer
ML YouTube Courses
Machine Learning Essentials for Biomedical Data Science: Introduction and ML Basics
Artificial Intelligence Under Fire: Attacking and Defending Deep Neural Networks

Basic Reads

Advanced Reads

Books (🡇 means free to download)

M. Sipper, Evolved to Win, Lulu, 2011 🡇
M. Sipper, Machine Nature: The Coming Age of Bio-Inspired Computing, McGraw-Hill, New York, 2002
A.E. Eiben and J.E. Smith, Introduction to Evolutionary Computing, Springer, 1st edition, 2003, Corr. 2nd printing, 2007
R. Poli, B. Langdon, & N. McPhee, A Field Guide to Genetic Programming, 2008 🡇
J. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection, MIT Press, Cambridge, MA, 1992.
S. Luke, Essentials of Metaheuristics, 2013 🡇
A. Geron, Hands On Machine Learning with Scikit Learn and TensorFlow, 2017 🡇
G. James, D. Witten, T. Hastie, R. Tibshirani, An Introduction to Statistical Learning, 2nd edition, 2021 🡇
J. VanderPlas, Python Data Science Handbook
K. Reitz, The Hitchhiker’s Guide to Python
M. Nielsen, Neural Networks and Deep Learning
Z. Michalewicz & D.B. Fogel, How to Solve It: Modern Heuristics, 2nd ed. Revised and Extended, 2004
Z. Michalewicz. Genetic Algorithms + Data Structures = Evolution Programs. Springer-Verlag, Berlin, 3rd edition, 1996
D. Floreano & C. Mattiussi, Bio-Inspired Artificial Intelligence: Theories, Methods, and Technologies, MIT Press, 2008
A. Tettamanzi & M. Tomassini, Soft Computing: Integrating Evolutionary, Neural, and Fuzzy Systems, Springer-Verlag, Heidelberg, 2001
M. Mohri, A. Rostamizadeh, and A. Talwalka, Foundations of Machine Learning, MIT Press, 2012 🡇
Simon J.D. Prince, Understanding Deep Learning, MIT Press, 2023 🡇

Software

Datasets

Open Source Agenda is not affiliated with "Applied Machine Learning Course" Project. README Source: moshesipper/Applied-Machine-Learning-Course

Stars

Open Issues

Last Commit

1 month ago

Repository

moshesipper/Applied-Machine-Learning-Course

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/applied-machine-learning-course"><img src="https://www.opensourceagenda.com/projects/applied-machine-learning-course/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022