Skip to content

feat: enable GRPO training with logprobs from offline trajectory data #811

feat: enable GRPO training with logprobs from offline trajectory data

feat: enable GRPO training with logprobs from offline trajectory data #811

Triggered via pull request November 29, 2025 02:37
Status Success
Total duration 1m 51s
Artifacts

ruff.yml

on: pull_request
Fit to window
Zoom out
Zoom in