Reputation: 269
This is a conceptual question. I come from a Computer Vision background where the Object Detection DNNs are trained using a predefined dataset such as COCO, NYU-D etc and then the DNN is able to predict the results for an input image based on the training.
However, in the case of Deep Reinforcement Learning I am unable to find a dataset that can train Deep RL networks. Rather I find resources that talk about environment for training.
So the questions is whether Deep RL networks are required to be trained using environments only or is it possible to train them similar to Object Detection DNNs i.e by using some sort of dataset ?
Upvotes: 1
Views: 555
Reputation: 96
This is a very common confusion in the AI community. Long story short, reinforcement learning (RL) method requires feedback (reward,state) from environment based on the action determined by RL. dataset is not able to provide that feedback. You can consider RL as a close-loop feedback system, whereas suerpervised learning (DNN) as the open-loop feedforward system.
To help you understand RL better. RL methods learn from the interaction with environment incrementally in the following steps:
I suggest you to briefly read the RL text book from Richard Sutton: Reinforcement Learning: An Introduction. You can download free from here: https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf
Upvotes: 2