D r a g - a n d - D r o p L L M s : Zero-Shot Prompt-to-Weights

submited by

Style Pass

2025-07-28 10:30:05

Despite strong zero-shot competence endowed by pre-training, Large Language Models (LLMs) still require task-specific customization for real-world applications. Parameter-Efficient Fine-Tuning (PEFT), such as LoRA, addresses this by introducing a small set of trainable parameters while keeping original weights frozen. However, it can only alleviate but not erase the cost of per-task-tuning, creating a major bottleneck for large-scale deployment.

We observe that a LoRA adapter is nothing more than a function of its training data: gradient descent “drags” the base weights towards a task-specific optimum. If that mapping from prompts to weights can be learned directly, we could bypass gradient descent altogether.

Utilizing fine-tuned LoRAs as training data, D n D establishes connections between input data prompts and model parameters. We test D n D 's zero-shot ability by feeding it with prompts from datasets unseen in training and instruct it to generate parameters for novel datasets. Our method shows amazing improvment over the average of training LoRAs on zero-shot test sets, generalizes to multiple real-world tasks, and scales to various LLM sizes.

D r a g - a n d - D r o p L L M s : Zero-Shot Prompt-to-Weights

Leave a Comment

Related Posts

Recent Posts

AI is eating the Internet

The CBOR, dCBOR, and Gordian Envelope Book

We Raised $21M to Give Fortune 100 Cloud for AI Agents

The Great American Novel

Meta pirated and seeded porn for years to train AI, lawsuit says

thoughts on “do 100 thing”

Welcome to pudb’s documentation!¶

Measles isn't just dangerous, it may erase your immune system

European Commission brings use of Microsoft 365 into compliance with data protection rules for EU institutions and bodies

Supply Chain Security Alert: num2words PyPI Package Shows Signs of Compromise

How to Setup a Secure Ubuntu Home Server: A Complete Guide

Search code, repositories, users, issues, pull requests...

Think You Know the Call? Let the AI Referee Decide.

BGP Alternative for Distributed Network Applications

How we solved the "AI agent black box" problem with typed tasks

A Semi-Technical Primer on LLMs - by Ian D'Silva

You Don’t Remember Being a Baby, but Your Brain Was Making Memories

Notes on Robot Vacuums

Scientists hit quantum computer error rate of 0.000015% — a world record achievement that could lead to smaller and faster machines

Six Principles for Production AI Agents