-
Notifications
You must be signed in to change notification settings - Fork 588
Open
Labels
questionFurther information is requestedFurther information is requested
Description
❓ Question
I wanted to train a DQN agent for 10M, given the time limit of the cluster I'm using I have to continue training the agent by loading the last saved model.I tried to follow the documentation (see the command bellow) but I seems like the model is not properly loaded: the last model achieves a score of 20 but when I continue training it has a score of -21 and other parameters (e.g exploration rate) don't continue annealing.
I just wanted to make sure that I'm not missing something. Is this the right way to do it?
python train.py --algo dqn --env PongNoFrameskip-v4 -i agents/dqn/PongNoFrameskipv4_1/rl_model_60000_steps.zip -n 1000000
Checklist
- I have checked that there is no similar issue in the repo
- I have read the SB3 documentation
- I have read the RL Zoo documentation
- If code there is, it is minimal and working
- If code there is, it is formatted using the markdown code blocks for both code and stack traces.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested