Madhavan Mukund



Data Mining and Machine Learning,
Jan-Apr

Data Mining and Machine Learning

Jan-Apr, 2020


Administrative details

  • Teaching assistants: Debjit Paria, Kapil Pause

  • Evaluation:

    • Assignments 40%, midsemester exam 20%, final exam 40%

    • Copying is fatal

  • Text and reference books:

    • Web Data Mining by Bing Liu.

    • Foundations of Data Science by Avrim Blum, John Hopcroft and Ravi Kannan

    • Machine Learning by Tom Mitchell.

    • C4.5: Programs for Machine Learning by Ross Quinlan.

    • Artificial Intelligence: A Modern Approach by Stuart J Russell and Peter Norvig.

    • An Introduction to Information Retrieval by Christopher D Manning, Prabhakar Raghavan and Hinrich Schütze

    • Hands-On Machine Learning with Scikit-Learn, Keras and Tensorflow by Aurélien Géron, O'Reilly, 2nd edition (2019)


Course plan

Here is a tentative list of topics.

  • Supervised learning: Frequent itemsets, association rules, regression, decision trees, naive Bayes, SVM, classifier evaluation, expectation maximization, ensemble classifiers.

  • Unsupervised learning: Clustering, outlier detection.

  • Text mining: Basic ideas from information retrieval, TF/IDF model, Page Rank, HITS

  • Other topics: Probabilistic graphical models, Bayesian networks, Markov models, neural networks, ranking and social choice, …



Lecture summary