You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Apr 1, 2026. It is now read-only.
trm model in training loop use q_head to introduce halting so it will not reach $N_{sup}=16$, however, in evaluation, there is no halting. It should match the convergent steps in training, otherwise the model will overthink.
TinyRecursiveModels/models/recursive_reasoning/trm.py
Line 275 in 7de0d20
trm model in training loop use$N_{sup}=16$ , however, in evaluation, there is no halting. It should match the convergent steps in training, otherwise the model will overthink.
q_headto introduce halting so it will not reach