LLM

Basically a "next token predictor".

Predicting following tokens in sequence.

Training

Main objective of ML training is to minimize loss (hence, making more accurate predictions) over time.

Value/Loss

Value = Data's contribution to model's output. Loss = Measuring how wrong the model's outputs are.

Loss Functions (Objective Function)

Loss quantifies the difference between a model’s predictions and actual outcomes.

measures how wrong the model’s predictions are.

Loss function directly influence the effectiveness of model prediction.

accuracy of next token
guided learning
- losses are consumed as a feedback mechanism for models to adjust internal weights and biases
performance optimization
- efficient loss minimization enhances model performance, reduces overfitting, and improves generalization to unseen data.

Standard Loss Functions