-
Notifications
You must be signed in to change notification settings - Fork 132
feat: support discrete environment #286
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #286 +/- ##
==========================================
- Coverage 96.84% 96.70% -0.14%
==========================================
Files 138 140 +2
Lines 7000 7130 +130
==========================================
+ Hits 6779 6895 +116
- Misses 221 235 +14 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Hi! I’m currently working on a safe RL project with a discrete action environment. I came across this PR and it looks like exactly what I need. I noticed it’s been open for a while - is there any plan to merge it, or is something blocking progress? |
Description
This pull request is aimed at supporting environments with discrete action spaces and observation spaces. It has been implemented in the Taxi-v3 and CartPole-v1 environments in Gymnasium. Relevant documents, code checks, and standards have been updated. Currently, it supports the following on-policy algorithms:
to run in discrete environments.

The performance curve below shows the correctness of our implementation.
Motivation and Context
resolve #273 #283
Types of changes
What types of changes does your code introduce? Put an
x
in all the boxes that apply:Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!
make format
. (required)make lint
. (required)make test
pass. (required)