Tag: LLM generalization

Benchmark Transfer After Fine-Tuning: How LLMs Generalize Across Tasks

Learn how LLMs maintain general intelligence after specialization. Explore benchmark transfer, PEFT, LoRA, and strategies to prevent catastrophic forgetting.

How Training Duration and Token Counts Affect LLM Generalization

Training duration and token counts don't guarantee better LLM generalization. What matters is how sequence lengths are structured during training. Learn why variable-length training beats raw scale and how to avoid common pitfalls.