LoRA: Low-Rank Adaptation of Large Language Models

作者： Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen (2021)

arXiv： 2106.09685

领域

对齐

TLDR（中文）

LoRA 通过冻结预训练模型权重，只训练两个低秩矩阵的乘积（秩 r 远小于原始维度），把微调的可训练参数量降低了 10000 倍。这使得在消费级 GPU 上微调大模型成为可能，几乎成为当今最主流的参数高效微调（PEFT）方法。

TLDR (English)

LoRA freezes pretrained weights and only trains the product of two low-rank matrices (rank r much smaller than original dimensions), reducing trainable parameters by up to 10,000x. This makes fine-tuning large models on consumer GPUs feasible and has become the dominant parameter-efficient fine-tuning (PEFT) method.

出现在这些文章里

微调与对齐：让模型听指令、守规矩
Fine-Tuning and Alignment: Making Models Follow Instructions

同被引用

这些论文与本文出现在同一篇文章中

LoRA: Low-Rank Adaptation of Large Language Models

领域

TLDR（中文）

TLDR (English)

出现在这些文章里

同被引用

相关论文