Teaching assistants: Avirup Das, Ameya Kamat, Anuja Pal, Tanuj Sur
Lectures and Tutorials: Two live online classes each week. Zoom link on Moodle page. Tutorials as needed.
Evaluation:
Assignments 30-40%, quizzes and midsemester exam 20-30%, final exam 40%
Copying is fatal
Text and reference books:
Web Data Mining by Bing Liu.
Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan
Machine Learning by Tom Mitchell.
C4.5: Programs for Machine Learning by Ross Quinlan.
Artificial Intelligence: A Modern Approach by Stuart J Russell and Peter Norvig.
Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow by Aurélien Géron, O'Reilly, 2nd edition (2019)
Here is a tentative list of topics.
Supervised learning: Association rules, regression, decision trees, naive Bayes, SVM, classifier evaluation, expectation maximization, ensemble classifiers.
Unsupervised learning: Clustering, outlier detection, dimensionality reduction.
Text mining: Basic ideas from information retrieval, TF/IDF model, Page Rank, HITS
Other topics (if time permits): Probabilistic graphical models, Bayesian networks, Markov models, neural networks, ranking and social choice, …
Lecture 1: 24 Jan 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Introduction, market-basket analysis, frequent itemsets
Lecture 2: 27 Jan 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Apriori algorithm, association rules, class association rules
Lecture 3: 31 Jan 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Supervised learning, decision trees, impurity measures (entropy, Gini index)
Lecture 4: 03 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Decision trees: information gain ratio, handling numeric attributes
Evaluating classifiers: training/test sets, confusion matrix
Lecture 5: 07 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Decision Trees in Python
Linear Regression: loss functions, normal equation, gradient descent
Lecture 6: 10 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Linear Regression: gradient descent, probabilistic justification for SSE loss
Polynomial regression, regularization
Lecture 7: 14 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Regression: the non-polynomial case
Regression for Classification
Regression using decision trees
Lecture 8: 17 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Linear, polynomial and logistic regression in Python
Regression trees in Python
Handling overfitting in decision trees
Lecture 9: 21 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Naïve Bayesian classifiers
Naïve Bayes text classification
Lecture 10: 24 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Ensemble Classifiers: Bagging
Lecture 11: 28 Feb 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Ensemble Classifiers: Boosting
Lecture 12: 03 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Ensemble Classifiers: Gradient Boosting
Lecture 13: 07 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Unsupervised learning: Clustering — K-Means, Hierarchical, Density-based
Unsupervised learning: Local density based outlier detection
Lecture 14: 10 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Applications of unsupervised learning: semi-supervised learning, image segmentation
Lecture 15: 21 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Dimensionality reduction: PCA, manifold learning, locally linear embeddings
Lecture 16: 24 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Expectation maximization and applications
Lecture 17: 31 Mar 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Linear Separators — Perceptrons, SVMs
Lecture 18: 4 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Kernel methods
Lecture 19: 7 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Neural networks: Multilayer perceptrons, sigmoid neurons, network architecture, universality
Lecture 20: 11 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Neural networks: Backpropagation, cost functions
Lecture 21: 18 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Bayesian networks: basic definitions, semantics, exact inference
Lecture 22: 21 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Bayesian networks: Conditional independence, D-separation
Lecture 23: 28 Apr 2022
(Class Notes (pdf),
Slides (pdf),
Lecture video)
Bayesian networks: approximate inference, sampling
Introduction to Markov chains
Lecture 24: 02 May 2022
(Class Notes (pdf),
Lecture video)
Markov Chain Monte Carlo: Metropolis-Hastings, Gibbs Sampling