Large Language Models (LLMs)

Created: =dateformat(this.file.ctime,"dd MMM yyyy, hh:mm a") | Modified: =dateformat(this.file.mtime,"dd MMM yyyy, hh:mm a") Tags: knowledge

Overview

Introduction

Perplexity Metric (PPL)

metric to determine the accuracy of next token prediction for language models. where lower perplexity, better the model
the metric applies specifically to classical language models (sometimes called autoregressive or causal language models) and is not well defined for masked language models like BERT
defined as the exponentiated average negative log-likelihood of a sequence
Intuitively, it can be thought of as an evaluation of the model’s ability to predict uniformly among the set of specified tokens in a corpus.
the tokenization procedure has a direct impact on a model’s perplexity which should always be taken into consideration when comparing different models.
Evaluation Metrics for Language Modeling

Theoretical References

Code References

Methods

Tools, Frameworks

OpenAccess-AI-Collective/axolotl
- User-friendly and powerful fine-tuning tool that is used in a lot of state-of-the-art open-source models.
GitHub - unslothai/unsloth: Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
- fintuning
exo
Determine model size reqs
- Transformer Math 101 | EleutherAI Blog
- Can You Run It? LLM version - a Hugging Face Space by Vokturz

Darius Knowledge Hub

Explorer

Large Language Models (LLMs)

Large Language Models (LLMs)

Overview

Introduction

Perplexity Metric (PPL)

Theoretical References

Papers

Articles

Courses

Code References

Methods

Tools, Frameworks

Graph View

Table of Contents

Backlinks

Darius Knowledge Hub

Explorer

Large Language Models (LLMs)

Large Language Models (LLMs)

Overview

Related fields

Introduction

Perplexity Metric (PPL)

Theoretical References

Papers

Articles

Courses

Code References

Methods

Tools, Frameworks

Graph View

Table of Contents

Backlinks