What is Catastrophic Forgetting?

Catastrophic Forgetting is the loss or degradation of previously learned capabilities when a model is trained or fine-tuned on new data.

How It Works

Catastrophic forgetting is a central risk in fine-tuning. A model adapted too aggressively to a narrow dataset may become better at the target examples while losing broader language ability, safety behavior, reasoning, multilingual performance, or formatting skills. It can be caused by high learning rates, too many training steps, narrow data, poor mixing, or full-parameter updates that overwrite useful representations. Teams mitigate it with validation suites, smaller updates, PEFT, data mixing, regularization, and regression tests against baseline capabilities.

Key Characteristics

Degrades previous capabilities after training on new data
Often appears when fine-tuning data is narrow or training is too aggressive
Can affect safety, reasoning, multilingual ability, formatting, or domain knowledge
May be hidden if evaluation only tests the new target task
Requires regression testing against baseline and holdout capabilities

Common Use Cases

Evaluating whether SFT harmed general instruction following
Checking if domain tuning reduced safety refusals
Comparing full fine-tuning with LoRA or other PEFT methods
Running regression suites before shipping a tuned model
Designing data mixtures that preserve general capability

Example

Loading code...

Frequently Asked Questions

How do you detect catastrophic forgetting?

Run regression evaluations on baseline capabilities, safety behavior, formatting, and domain tasks before and after fine-tuning.

Does PEFT prevent catastrophic forgetting?

It can reduce risk because fewer base weights change, but it does not eliminate forgetting or behavior regression.

What causes catastrophic forgetting?

Common causes include narrow datasets, high learning rates, too many steps, poor data mixing, and lack of regression checks.

How can forgetting be mitigated?

Use curated mixed data, smaller updates, PEFT, regularization, early stopping, and broad validation suites.

Related Tools

JSON Formatter

Format, beautify, validate and minify JSON online for free. Features syntax highlighting, tree view, history tracking, and one-click copy. No signup required. 100% client-side processing for privacy.

Text Analyzer

Free online text analyzer tool. Count words, characters, sentences, paragraphs. Calculate reading time, speaking time, and analyze word frequency. All processing happens in your browser.

Code Diff

Free online code diff tool to compare two code snippets with syntax highlighting. Supports 20+ programming languages. Find differences instantly with GitHub-style diff view.

Related Terms

Fine-tuning

Fine-tuning is a transfer learning technique that adapts a pre-trained machine learning model to a specific task or domain by continuing the training process on a smaller, task-specific dataset. This approach leverages the general knowledge already captured in the pre-trained model while customizing its behavior for specialized applications.

SFT

SFT is a supervised training stage that fine-tunes a pretrained language model on curated prompt-response examples.

PEFT

PEFT (Parameter-Efficient Fine-Tuning) is a family of techniques that adapt large pre-trained models to downstream tasks by training only a small subset of parameters, dramatically reducing computational requirements while maintaining competitive performance.