Madhavan Mukund



Data Mining and Machine Learning,
Jan-Apr 2023

Data Mining and Machine Learning

Jan-Apr, 2023


Administrative details

  • Instructors: Pranabendu Misra, Madhavan Mukund

  • Teaching assistants: Shourjya Basu, Sampad Kumar Kar, Shankar Ram V

  • Evaluation:

    • Assignments 30-40%, quizzes and midsemester exam 20-30%, final exam 40%

    • Copying is fatal

  • Text and reference books:

    • Web Data Mining by Bing Liu.

    • Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan

    • Machine Learning by Tom Mitchell.

    • C4.5: Programs for Machine Learning by Ross Quinlan.

    • Artificial Intelligence: A Modern Approach by Stuart J Russell and Peter Norvig.

    • Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow by Aurélien Géron, O'Reilly, 3rd edition (2022)


Course plan

Here is a tentative list of topics.

  • Supervised learning: Association rules, regression, decision trees, naive Bayes, SVM, classifier evaluation, expectation maximization, ensemble classifiers.

  • Unsupervised learning: Clustering, outlier detection, dimensionality reduction.

  • Text mining: Basic ideas from information retrieval, TF/IDF model, Page Rank, HITS

  • Other topics (if time permits): Probabilistic graphical models, Bayesian networks, Markov models, neural networks, ranking and social choice, …


Assignments


Lecture summary