Fix scheduler stepping and label dtype handling in training loop by likitha487 · Pull Request #173 · ML4SCI/DeepLense

likitha487 · 2026-03-09T16:07:30Z

Fixes issue #152

Changes made:

Replaced scheduler.step(loss) with scheduler.step() since some schedulers like CosineAnnealingWarmRestarts do not require the loss value.
Updated label conversion from labels.type(torch.LongTensor).to(device) to labels.long().to(device) to avoid unnecessary CPU tensor creation.

This improves training stability and prevents unnecessary tensor transfers.

Fix scheduler stepping and label dtype handling in training loop

9d08ddf

likitha487 mentioned this pull request Mar 9, 2026

Incorrect scheduler stepping and label dtype handling in training loop #152

Open

Provide feedback