ML//Training//parameter freezing

Choosing which model parameters to update during fine-tuning and which to lock (freeze)


Choosing which model parameters to update during fine-tuning and which to lock (freeze)

In standard fine-tuning and DPO: nothing is frozen by default — all parameters (attention + MLP) get updated.

In LoRA: base model frozen, small adapter matrices trained. Typically applied to W_Q, W_K, W_V, W_O — irónicamente las de attention, no MLP — because behavior changes most with fewest params there.

The tokenizer is almost never touched in fine-tuning — changing it means changing the embedding matrix, which is basically starting over.

Trainable vs frozen is a spectrum: full fine-tuning (0% frozen) → LoRA (~99% frozen) → prompt tuning (100% frozen, only input modified)