Skip to content

Unified implementation of backdoor attacks in Reinforcement Learning

License

Notifications You must be signed in to change notification settings

EthanRath/Backdoors-In-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Backdoors-In-RL

Unified implementation of backdoor attacks in Reinforcement Learning

Attacks Implemented

Works by us

"Sleepernets: Universal backdoor poisoning attacks against reinforcement learning agents" https://arxiv.org/abs/2405.20539

Q-Incept i.e. "Adversarial Inception Backdoor Attacks against Reinforcement Learning" https://arxiv.org/abs/2410.13995v1

Other Works

BadRL https://arxiv.org/abs/2312.12585

TrojDRL https://arxiv.org/abs/1903.06638

Setup

First install requirements for cleanrl atari, box2d, and mujoco https://docs.cleanrl.dev/

Ensure you're using the version with gymnasium==0.28.1

Install:

Running the Code

python ppo.py --attack_name <attack_name> --env_id <env_id>

Optionally include --track if you wish to track results with weights and biases.

See and change attack and env parameters in configs/attacks.yaml and configs/envs.yaml respectively.

Export results to CSV with python write_csv.py

About

Unified implementation of backdoor attacks in Reinforcement Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages