Teaching assistants: Deepanshi, Devesh Bajaj, Srishti Lakhotia
Evaluation:
Assignments 30-40%, quizzes and midsemester exam 20-30%, final exam 40%
Copying is fatal
Course outline (tentative)
Supervised learning: Association rules, regression, decision trees, naive Bayes, classifier evaluation, expectation maximization, ensemble classifiers, SVM, neural networks.
Unsupervised learning: Clustering, outlier detection, dimensionality reduction.
Other topics (if time permits): Probabilistic graphical models, Bayesian networks, Markov models, information retrieval, ranking and social choice, …
Text and reference books:
Web Data Mining by Bing Liu, 2nd edition, Springer (2011).
Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan
Machine Learning by Tom Mitchell.
C4.5: Programs for Machine Learning by Ross Quinlan.
Artificial Intelligence: A Modern Approach by Stuart J Russell and Peter Norvig, 3rd edition, Pearson (2016).
Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow by Aurélien Géron, 3rd edition, O'Reilly (2022)
Lecture 1: 6 Jan 2026
(Class Notes (pdf))
Introduction to supervised and unsupervised learning
Lecture 2: 8 Jan 2026
(Class Notes (pdf))
Market-basket analysis, frequent itemsets, Apriori algorithm
Lecture 3: 13 Jan 2026
(Class Notes (pdf),
Lecture Slides (pdf))
Association rules, class association rules
Supervised learning, decision trees, impurity
Lecture 4: 20 Jan 2026
(Class Notes (pdf),
Lecture Slides (pdf))
Decision trees: impurity measures (entropy, Gini index), information gain ratio, handling numeric attributes
Evaluating classifiers: training/test sets, cross validation
Lecture 5: 22 Jan 2025
(Class Notes (pdf),
Lecture Slides (pdf))
Evaluating classifiers: confusion matrix
Decision Trees in Python
Lecture 6: 27 Jan 2026
(Class Notes (pdf),
Lecture Slides (pdf))
Linear Regression: loss functions, normal equation, gradient descent
Lecture 7: 29 Jan 2026
(Class Notes (pdf),
Slides (pdf))
Linear Regression: probabilistic justification for SSE loss
Polynomial regression, regularization, the non-polynomial case
Lecture 8: 3 Feb 2026
(Class Notes (pdf),
Slides (pdf))
Regression using decision trees
Handling overfitting in decision trees
Regression for Classification
Lecture 9: 5 Feb 2026
Linear, polynomial and logistic regression in Python
Regression Trees in Python
Naïve Bayesian classifiers
(Class Notes (pdf), Slides (pdf))Lecture 10: 12 Feb 2026
(Class Notes (pdf),
Slides (pdf))
Naïve Bayes text classification
Ensemble Classifiers: Bagging
Lecture 11: 17 Feb 2026
(Class Notes (pdf),
Slides (pdf))
Ensemble Classifiers: Boosting