Commit 7030255
committed
Add downstream task evaluation script for llama3_native_te recipe
Adds eval_downstream.py that runs lm-eval benchmarks (arc_challenge,
arc_easy, boolq, copa, hellaswag, piqa, winogrande) on trained Lingua
1B checkpoints. Supports safetensors, distributed FSDP2 (DCP), and DDP
checkpoint formats. Self-contained checkpoint loading avoids TE version
compatibility issues with checkpoint.py imports.
Made-with: Cursor1 parent a047ec3 commit 7030255
1 file changed
Lines changed: 562 additions & 0 deletions
0 commit comments