The upper and lower limits of action space in a custom multi-agent environment

Hello! I found that the current custom environment does not seem to support discrete action space, so I changed the model to a one-dimensional continuous action space. But I found that my definition of action_space does not seem to work.
At the beginning, I noticed that the output of the action always hovered around 0.9 during training, and the value space I defined was [-20, 20]. To verify whether it is a question of randomness, I changed the range to [-0.1, 0.1], but the action of each input step() is still 0.9~1.1.
![Image](https://github.com/user-attachments/assets/bc3c9178-4bd9-4695-a786-fd210c0fd519)
![Image](https://github.com/user-attachments/assets/80b83b5b-0581-4eb8-a10c-901d0928bab1)
I noticed that the value of the upper and lower limits of the action space in "off_policy_marl.py" seems to be wrong, and only returns "NONE". I wonder if this is the root cause that affects the correctness of action space? Or is it that something went wrong when I defined the environment?
![Image](https://github.com/user-attachments/assets/1017ba5d-c0a9-42c8-8ae8-717a1435982f)
I checked the observation_space and state_space, and their values ​​are both normal.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The upper and lower limits of action space in a custom multi-agent environment #112

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

The upper and lower limits of action space in a custom multi-agent environment #112

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions