Align terminal reward with the last trainable token and add ALFWorld Evaluation #50
Annotations
3 errors
|
codespell (3.10)
Process completed with exit code 65.
|
|
codespell (3.9)
The strategy configuration was canceled because "codespell._3_10" failed
|
|
codespell (3.9)
The operation was canceled.
|