What is Catastrophic Forgetting?

Catastrophic Forgetting is the loss or degradation of previously learned capabilities when a model is trained or fine-tuned on new data.

How It Works

Catastrophic forgetting is a central risk in fine-tuning. A model adapted too aggressively to a narrow dataset may become better at the target examples while losing broader language ability, safety behavior, reasoning, multilingual performance, or formatting skills. It can be caused by high learning rates, too many training steps, narrow data, poor mixing, or full-parameter updates that overwrite useful representations. Teams mitigate it with validation suites, smaller updates, PEFT, data mixing, regularization, and regression tests against baseline capabilities.

Key Characteristics

  • Degrades previous capabilities after training on new data
  • Often appears when fine-tuning data is narrow or training is too aggressive
  • Can affect safety, reasoning, multilingual ability, formatting, or domain knowledge
  • May be hidden if evaluation only tests the new target task
  • Requires regression testing against baseline and holdout capabilities

Common Use Cases

  1. Evaluating whether SFT harmed general instruction following
  2. Checking if domain tuning reduced safety refusals
  3. Comparing full fine-tuning with LoRA or other PEFT methods
  4. Running regression suites before shipping a tuned model
  5. Designing data mixtures that preserve general capability

Example

loading...
Loading code...

Frequently Asked Questions

How do you detect catastrophic forgetting?

Run regression evaluations on baseline capabilities, safety behavior, formatting, and domain tasks before and after fine-tuning.

Does PEFT prevent catastrophic forgetting?

It can reduce risk because fewer base weights change, but it does not eliminate forgetting or behavior regression.

What causes catastrophic forgetting?

Common causes include narrow datasets, high learning rates, too many steps, poor data mixing, and lack of regression checks.

How can forgetting be mitigated?

Use curated mixed data, smaller updates, PEFT, regularization, early stopping, and broad validation suites.

Related Tools

Related Terms

Related Articles