2025-12-16 20:57:43 | ============================================================ TRAINING STARTED ============================================================ Output: ./outputs/smolvlm-atari Epochs: 1 Batch size: 8 Grad accum: 16 Effective batch: 128 Learning rate: 2e-05 Total steps: 310 ============================================================ 2025-12-16 21:14:36 | [ 50/310] ( 16.1%) | loss: 5.9426 | lr: 1.94e-05 | grad: 4.4062 | elapsed: 0:16:52 | ETA: 1.5h 2025-12-16 21:29:54 | [ 100/310] ( 32.3%) | loss: 0.0324 | lr: 1.63e-05 | grad: 0.2197 | elapsed: 0:32:10 | ETA: 1.1h 2025-12-16 21:45:12 | [ 150/310] ( 48.4%) | loss: 0.0179 | lr: 1.15e-05 | grad: 0.1406 | elapsed: 0:47:28 | ETA: 51m 2025-12-16 22:00:29 | [ 200/310] ( 64.5%) | loss: 0.0170 | lr: 6.25e-06 | grad: 0.1367 | elapsed: 1:02:45 | ETA: 35m 2025-12-16 22:15:46 | [ 250/310] ( 80.6%) | loss: 0.0169 | lr: 2.05e-06 | grad: 0.1553 | elapsed: 1:18:02 | ETA: 19m 2025-12-16 22:31:04 | [ 300/310] ( 96.8%) | loss: 0.0168 | lr: 6.90e-08 | grad: 0.1475 | elapsed: 1:33:21 | ETA: 3m 2025-12-16 22:34:57 | >>> CHECKPOINT SAVED at step 310 -> ./outputs/smolvlm-atari 2025-12-16 22:34:57 | [ 310/310] (100.0%) | loss: 0.9753 | lr: N/A | grad: N/A | elapsed: 1:37:13 | ETA: 0m 2025-12-16 22:34:57 | ============================================================ TRAINING COMPLETE ============================================================ Total time: 1:37:13 Final step: 310 Output: ./outputs/smolvlm-atari ============================================================