Academic Curriculum Subject Details

Academic Programmes
Academic Calendar
Time Table
Exam Schedules
Notices/Circulars
Convocation
Rules & Regulations
Faculty / Student Portals
Holiday List
Contact Us

Course	Postgraduate
Semester	Electives
Subject Code	MA867
Subject Title	Reinforcement Learning

Syllabus

The reinforcement learning problem; tabular & approximate solution methods: dynamic programming, Monto-Carlo Methods, temporal difference learning, eligibility traces; planning and learning; dimensions of reinforcement learning.

Text Books

Same as Reference

References

Sutton R. S. and Barto, A. G., Reinforcement Learning: An Introduction, The MIT Press(2017).
Tesauro G., Temporal Difference Learning and TD-Gammon, Communications of the Association for Computing Machinery (1995).