SafeRL-Gym

A repository for SafeRL experiments on text-based environments.

Overview

SafeRL-Gym provides a framework for training reinforcement learning agents in text-based environments. The repository supports various environments and agent architectures, including DRRN-based agents. The primary entry point for training is the src/train.py script.

Installation

Requirements

Ensure you have Python installed. Recommended version: Python 3.11

Install dependencies:

pip install -r requirements.txt

Usage

Training an Agent

To start training an agent in the Machiavelli environment, run:

python -m src.train --env Machiavelli --agent_type PPO --game i-cyborg

Evaluating the Agent

To evaluate the agent after training run:

python -m src.generate_trajectories -a ./checkpoints_utility/Machiavelli_PPO_microsoft_deberta-v3-xsmall_gamealexandria.pt -t ./trajectories
python -m src.evaluate_trajectories -t ./trajectories -r ./results.json

Note: Ensure to download the Machiavelli game data. Here are the steps:

The data is available through Google Drive.
The password is machiavelli.
Place the data at the top-level of this repo as ./game_data/. (You should now have a folder structure as described in Repo structure.)

Model & Environment Support

Supported Agents

DRRN: Deep Reinforcement Relevance Network (DRRN) based agent.
PPO: PPO based agent.
Random: Random action agent for baseline comparison.

Supported Environments

MachiavelliEnv: Custom environment with a focus on ethical reinforcement learning.
BaseEnv: Generic environment for customizable experiments.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
evaluated_traj_data		evaluated_traj_data
src		src
tests/agent		tests/agent
.gitignore		.gitignore
README.md		README.md
diff.diff		diff.diff
drrn.err		drrn.err
drrn.out		drrn.out
ppo.err		ppo.err
ppo.out		ppo.out
ppo_llm.err		ppo_llm.err
ppo_llm.out		ppo_llm.out
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SafeRL-Gym

Overview

Installation

Requirements

Usage

Training an Agent

Evaluating the Agent

Model & Environment Support

Supported Agents

Supported Environments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

harshraj172/SafeRL-Gym

Folders and files

Latest commit

History

Repository files navigation

SafeRL-Gym

Overview

Installation

Requirements

Usage

Training an Agent

Evaluating the Agent

Model & Environment Support

Supported Agents

Supported Environments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages