ML//Transformer//logits

2026-02-28

Raw scores over the entire vocabulary before normalization: Wu × normalized_output = logits.

Raw scores over the entire vocabulary before normalization: Wu × normalized_output = logits.

Each value = dot product between the context vector and one row of the LM head: measures "how aligned is this output with each possible next token".

Not probabilities: can be negative, arbitrarily large. Softmax converts them to a proper distribution.

Temperature divides logits before softmax: logits/T. Low T = peaky distribution (confident), high T = flat (creative)

In DPO: the model computes logits for both preferred and rejected outputs, widening the gap between them.