feat: enable GRPO training with logprobs from offline trajectory data#467
Draft
JRMeyer wants to merge 19 commits intoOpenPipe:mainfrom
Draft
feat: enable GRPO training with logprobs from offline trajectory data#467JRMeyer wants to merge 19 commits intoOpenPipe:mainfrom
JRMeyer wants to merge 19 commits intoOpenPipe:mainfrom
Commits
Commits on Dec 5, 2025
- committed
- andcommitted
- committed
- committed
- committed
- andcommitted
- andcommitted
- andcommitted
Commits on Dec 6, 2025
Commits on Dec 7, 2025
Commits on Dec 9, 2025
Commits on Dec 10, 2025
- committed
- committed
- andcommitted
- andcommitted
- committed