Instructors: Sourish Das, Madhavan Mukund
Evaluation:
Assignments TBA, midsemester exam TBA, final exam TBA
Copying is fatal
Text and reference books:
(To be expanded and updated. Highly subject to change.)
Lecture 1, 13 Aug 2019: Class notes
Overview of PAC learning, capacity, underfitting, overfitting, bias, variance
Lecture 2, 20 Aug 2019: Class notes
Feed-forward networks: nonlinearity, XOR example, cost functions, MLE and cross-entropy
Lecture 3, 22 Aug 2019: Class notes
Feed-forward networks: output layer activation functions; hidden layer activation functions; architecture; backpropagation and stochastic gradient descent; optimization – momentum; adaptive learning rates – delta-bar-delta, AdaGrad, RMSprop, Adam; initializing neural networks
Lecture 4, 27 Aug 2019: Class notes
Convolutional neural networks: meaning of convolution; pooling; sharing of parameters; padding;
Lecture 5, 29 Aug 2019: Class notes
Recurrent neural networks: processing sequential data, unfolding computational graphs, universality of RNNs, teacher forcing for output feedback, backpropagation through time, probabilistic graphical models, RNNs as directed graphical models
Lecture 6, 3 Sep 2019: Class notes
Recurrent neural networks: Providing context in RNNs; bidirectional RNNs; encoder-decoder architectures, long term dependencies – skip connections and leaky units; LSTM
Lecture 7, 5 Sep 2019: Madhavan's notes (use at your own risk)
Natural exponential families, Logistic regression, Poisson regression, Generalized Linear Models, Gaussian Process Regression
Reference: Gaussian Processes for Machine Learning, Carl Edward Rasmussen and Christopher K. I. Williams, Chatper 2: Regression
Lecture 8, 10 Sep 2019: Madhavan's notes (use at your own risk)
CNNs and RNNs in Tensorflow: Code for examples shown in class
Bayesian Optimization, Gaussian Process Regression
GPR using scikit-learn: Code for Gaussian Process Regression
Lecture 9, 17 Sep 2019: Madhavan's notes (use at your own risk)
GPR: Rasmussens' formulation (from Chapter 2 of the RW book)
Gaussian Process Regression example using scikit-learn, Google Colab notebook
GPR with large data sets through sampling
Lecture 10, 19 Sep 2019:
Bayesian Optimization
Reference: A Tutorial on Bayesian Optimization by Peter I. Frazier
Lecture 11, 01 Oct 2019: Madhavan's notes (use at your own risk)
Bayesian Optimization
Reference: A Tutorial on Bayesian Optimization by Peter I. Frazier
Lecture 12, 03 Oct 2019: Madhavan's notes (use at your own risk)
Autoencoders
Lecture 13, 10 Oct 2019:
Coding Bayesian optimization
Lecture 14, 15 Oct 2019: Class notes
Reinforcement learning: Multi-armed bandits
Lecture 15, 17 Oct 2019: Class notes
Reinforcement learning: Markov Decision Processes
Lecture 16, 22 Oct 2019: Class notes
Reinforcement learning: Dynamic programming — policy evaluation, policy iteration, value iteration
Lecture 17, 24 Oct 2019: Class notes
Reinforcement learning: Monte Carlo Methods
Lecture 18, 29 Oct 2019: Class notes
Reinforcement learning: Monte Carlo Methods – off-policy prediction; Temporal-Difference Learning – TD Prediction
Lecture 19, 5 Nov 2019: Class notes
Reinforcement learning: Temporal-Difference Learning – TD(0), SARSA, Q Learning, Expected Sarsa, Double Learning
Lecture 20, 7 Nov 2019: Class notes
Reinforcement learning: Temporal-Difference Learning – n-step Bootstrapping: TD(n), n-step SARSA, n-step off-policy learning, n-step tree backup algorithm