ML//Transformer//logits
Raw scores over the entire vocabulary before normalization: Wu × normalized_output = logits.
Raw scores over the entire vocabulary before normalization: Wu × normalized_output = logits.
Each value = dot product between the context vector and one row of the LM head — measures "how aligned is this output with each possible next token".
Not probabilities — can be negative, arbitrarily large. Softmax converts them to a proper distribution.
Temperature divides logits before softmax: logits/T. Low T = peaky distribution (confident), high T = flat (creative)
In DPO: the model computes logits for both preferred and rejected outputs, widening the gap between them.