This repo contains the source code of the Python package loralib and several examples of how to integrate it with PyTorch models, such as those in Hug

GitHub - microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

submited by

Style Pass

2023-03-24 12:30:09

This repo contains the source code of the Python package loralib and several examples of how to integrate it with PyTorch models, such as those in HuggingFace. We only support PyTorch for now. See our paper for a detailed description of LoRA.

LoRA: Low-Rank Adaptation of Large Language Models Edward J. Hu*, Yelong Shen*, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen Paper: https://arxiv.org/abs/2106.09685

LoRA reduces the number of trainable parameters by learning pairs of rank-decompostion matrices while freezing the original weights. This vastly reduces the storage requirement for large language models adapted to specific tasks and enables efficient task-switching during deployment all without introducing inference latency. LoRA also outperforms several other adaptation methods including adapter, prefix-tuning, and fine-tuning.

We obtain result comparable or superior to full finetuning on the GLUE benchmark using RoBERTa (Liu et al., 2019) base and large and DeBERTa (He et al., 2020) XXL 1.5B, while only training and storing a fraction of the parameters. Click the numbers below to download the RoBERTa and DeBERTa LoRA checkpoints.

GitHub - microsoft/LoRA: Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Leave a Comment

Related Posts

Recent Posts

AI will bring back old boys’ clubs

Innovative Heat-Conductive Plastic Prevents Overheating of Electronics

DOGE and other day 1 Trump appointees head for the exits at multiple agencies

When hiring software testers doesn't work (and what to do BEFORE you hire them)

The Garlic Bread Hack

YouTube will identify and restrict minors’ accounts with AI

Looking Through the Past

Introduction to the Fundamentals of Amazon Redshift

The case for memes as a new form of comics

Planet Labs' Hyperspectral Imagery

50 things I know - by Cate Hall - Useful Fictions

Vibe Coding, Psychological Safety, Belief vs. Conviction | Agoston Torok, CTO, Promaton

The Strong Goldbach Conjecture: For AI Reasoning in Higher-Order Logic(Standard Semantics)

Italy says Meta may be violating law with AI in WhatsApp

Benchmarks in CI: Escaping the Cloud Chaos

Why your vibe coded app only works in your head | Farouq Aldori

Helsinki records zero traffic deaths for full year

Personal Superintelligence

EPA plans to ignore science, stop regulating greenhouse gases

Examining mushrooms under microscopes can help engineers design stronger materials