Login
Course Information
Identification

CS 747: Foundations of Intelligent and Learning Agents
 
Description

Agency, intelligence, and learning
Exploration and multi-armed bandits
Markov Decision Problems and planning
Reinforcement learning
Search
Multi-agent systems and multi-agent learning
Case studies
 
References

Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, MIT Press, 1998. [Chapters 1, 2, 3, 4, 6, 8, and 9]
Dynamic Programming and Optimal Control, Volume II, Dimitri P. Bertsekas, 4th edition, Athena Scientific, 2012. [Chapter 2]
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, Sebastien Bubeck and Nicolo Cesa-Bianchi, Foundations and Trends in Machine Learning, Volume 5, Number 1, 2012. [Chapters 2 and 3]
Selected research papers
 
Home Page

http://www.cse.iitb.ac.in/~shivaram/teaching/cs747-a2018/
 
Prerequisites

N/A
 
Other Details

Duration : Full Semester Total Credit : 6
Type : Theory
 
Autumn Semester 2019-20

Status : Offered Instructor : Prof. Shivaram Kalyanakrishnan
 
Spring Semester 2019-20

Status : Not Offered Instructor : ---




Last Modified Date: 15-Jul-2013

Webmail

Username:
Password:
Faculty CSE IT
Forgot Password
    [+] Sitemap     Feedback