DisCor in PyTorch

This is a PyTorch implementation of DisCor[1] and Soft Actor-Critic[2,3]. I tried to make it easy for readers to understand the algorithm. Please let me know if you have any questions.

Setup

If you are using Anaconda, first create the virtual environment.

conda create -n discor python=3.8 -y
conda activate discor

Then, you need to setup a MuJoCo license for your computer. Please follow the instruction in mujoco-py for help.

Finally, you can install Python liblaries using pip.

pip install --upgrade pip
pip install -r requirements.txt

If you're using other than CUDA 10.2, you need to install PyTorch for the proper version of CUDA. See instructions for more details.

Example

MetaWorld

First, I trained DisCor and SAC on hammer-v1 from MetaWorld tasks as below. Following the DisCor paper, I visualized success rate in addition to test return. These graphs correspond to Figure 7 and 16 in the paper.

python train.py --cuda --env_id hammer-v1 --config config/metaworld.yaml --num_steps 2000000 --algo discor

Gym

I trained DisCor and SAC on Walker2d-v2 from Gym tasks as below. A graph corresponds to Figure 17 in the paper.

python train.py --cuda --env_id Walker2d-v2 --config config/mujoco.yaml --algo discor

References

[1] Kumar, Aviral, Abhishek Gupta, and Sergey Levine. "Discor: Corrective feedback in reinforcement learning via distribution correction." arXiv preprint arXiv:2003.07305 (2020).

[2] Haarnoja, Tuomas, et al. "Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor." arXiv preprint arXiv:1801.01290 (2018).

[3] Haarnoja, Tuomas, et al. "Soft actor-critic algorithms and applications." arXiv preprint arXiv:1812.05905 (2018).

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
config		config
discor		discor
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DisCor in PyTorch

Setup

Example

MetaWorld

Gym

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

toshikwa/discor.pytorch

Folders and files

Latest commit

History

Repository files navigation

DisCor in PyTorch

Setup

Example

MetaWorld

Gym

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages