arxiv.org - Emergent and Predictable Memorization in Large Language Models

Emergent and Predictable Memorization in Large Language Models - arxiv.org ![rw-book-cover|200x400](https://readwise-assets.s3.amazonaws.com/static/images/article0.00998d930354.png) ## Metadata - Author: **arxiv.org** - Full Title: Emergent and Predictable Memorization in Large Language Models - Category: #articles - Tags: #ai #llm - URL: https://arxiv.org/abs/2304.11158 ## Highlights - The paper "Emergent and Predictable Memorization in Large Language Models" by Stella Biderman et al. studies the problem of memorization in large language models and proposes a method to predict which sequences will be memorized before full training of the model, based on extrapolation of memorization behavior from lower-compute trial runs, and provides novel insights on the distribution of memorization scores across models and data. Key insights and lessons learned from the paper: Memorization is a key concern for deploying large language models safely, particularly for sensitive datapoints such as PII. Intermediate checkpoints are better predictors of memorization behavior than smaller fully-trained models. Memorization scores follow a power-law distribution across models and data, with some datapoints being more prone to memorization than others. Fine-tuning can mitigate memorization to some extent, but not completely.