Lectures and Tutorials: Video lectures will be uploaded each week. There will be one live online class discussion each week, details to be announced.
Online discussion forum: Please sign up for the course on Piazza
Teaching assistants: Debjit Paria, Samarth Ramesh
Evaluation:
15% for quizzes on Moodle
35% for assignments
50% for final exam
Copying is fatal
Text and reference books:
Web Data Mining by Bing Liu.
Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan
Machine Learning by Tom Mitchell.
C4.5: Programs for Machine Learning by Ross Quinlan.
Artificial Intelligence: A Modern Approach (3rd ed) by Stuart J Russell and Peter Norvig.
Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow (2nd ed) by Aurélien Géron
Here is a tentative list of topics.
Supervised learning: Frequent itemsets, association rules, regression, decision trees, naive Bayes, classifier evaluation, PAC learning, VC dimension, ensemble classifiers, expectation maximization, semi-supervised learning, linear classifiers, perceptrons, SVM, kernel methods, neural networks.
Unsupervised learning: Clustering, outlier detection, PCA, dimensionality reduction.
Other topics: Probabilistic graphical models, Bayesian networks, hidden Markov models, …
Lecture 5: Supervised Learning (Slides), (Video)
Lecture 6: Decision Trees (Slides), (Video)
Lecture 7: Impurity Measures for Decision Trees (Slides), (Video)
Lecture 8: Handling Numeric Attributes (Slides), (Video)
Lecture 9: Evaluating Classifiers (Slides), (Video)
Live session: 28 August 2020 (Video)
Lecture 10: Linear Regression (Slides), (Video)
Lecture 11: Regression, the non-linear case (Slides), (Video)
Lecture 12: Regression for Classification (Slides), (Video)
Lecture 13: Regression using decision trees (Slides), (Video)
Lecture 14: Handling overfitting in decision trees (Slides), (Video)
Live session: 12 September 2020 (Video)
Lecture 15: Naïve Bayes classifiers (Slides), (Video)
Lecture 16: Naïve Bayes text classification (Slides), (Video)
Live session: 18 September 2020 (Video)
Lecture 17: PAC Learning (Slides), (Video)
Live session: 25 September 2020 (Video)
Lecture 18: VC-Dimension (Slides), (Video)
Lecture 19: Shatter functions (Slides), (Video)
Lecture 20: Ensemble Classifiers — Bagging (Slides), (Video)
Lecture 21: Ensemble Classifiers — Boosting (Slides), (Video)
Live session: 16 October 2020 (Video)
Lecture 22: Expectation Maximization (Slides), (Video)
Lecture 23: Semi-supervised Learning (Slides), (Video)
Lecture 24: Theoretical foundations of EM (Slides), (Video)
Live class, October 30: Latent Dirichlet Allocation (Class Notes), (Video)
Live class, November 09: Clustering (Class Notes), (Video)
Live class, November 13: Outlier Detection, Applications of Unsupervised Learning, Dimensionality Reduction (Class Notes), (Video)
Live class, November 16: Linear Separators — Perceptrons, SVMs (Class Notes), (Video)
Live class, November 20: Kernel Methods, Neural Networks (Class Notes — Kernel Methods, Neural Networks ) (Video)
Kernel methods
Neural networks: Multilayer perceptrons, sigmoid neurons, network architecture, universality
Live class, November 23: Neural Networks: Learning parameters (Class notes ) (Video)
Neural networks: Backpropagation, cost functions
Live class, November 27: Probabilistic Graphical Models (Class notes ) (Video)
Bayesian networks: basic definitions, semantics, exact inference
Bayesian networks: Conditional independence, D-separation
Live class, November 30: Hidden Markov Models (Class notes ) (Video)
Hidden Markov Models: Filtering, prediction, smoothing, most likely explanation; Dynamic Bayesian Networks