HomeAbout

LLM

Basically a "next token predictor".

  • Predicting following tokens in sequence.

Training

Main objective of ML training is to minimize loss (hence, making more accurate predictions) over time.

Value/Loss

Value = Data's contribution to model's output. Loss = Measuring how wrong the model's outputs are.

Loss Functions (Objective Function)

Loss quantifies the difference between a model’s predictions and actual outcomes.

  • measures how wrong the model’s predictions are.

Loss function directly influence the effectiveness of model prediction.

  • accuracy of next token
  • guided learning
    • losses are consumed as a feedback mechanism for models to adjust internal weights and biases
  • performance optimization
    • efficient loss minimization enhances model performance, reduces overfitting, and improves generalization to unseen data.

Standard Loss Functions

AboutContact