On N-dimensional Rotary Positional Embeddings

submited by

Style Pass

2025-07-31 21:30:09

One of the simplest ways of encoding relative positional information in attention is to add a scalar to each of the attention logits, with a value somehow depending on the distance between the corresponding query and key (e.g. learned values in T5, fixed values decreasing with distance in ALiBi).

However, this makes it difficult for a query to attend to any specific (key, relative position) pair. In particular, the query must have a component pointing in the direction of the desired key, but this increases the attention scores with all tokens with that key, regardless of their position.

Rotary positional embeddings (RoPE) are an elegant solution to this problem. Essentially, the query and key vectors for every token are rotated by an angle proportional to the token's 1-d coordinate position.

On N-dimensional Rotary Positional Embeddings

Leave a Comment

Related Posts

Recent Posts

Listening To Ethernet Via Eurorack

Search code, repositories, users, issues, pull requests...

We’re updating our plans for goo.gl links.

Search code, repositories, users, issues, pull requests...

10 Lessons from 10 years at GitHub

Qwen3 Coder 480B is Live on Cerebras

2026: A Tech Odyssey

Why MCP’s Disregard for 40 Years of RPC Best Practices Will Burn Enterprises

Spotify used to seem like a necessary evil for musicians. Now it just seems evil

Live-Action Assassin’s Creed Series Coming to Netflix

nsf-data – Grant Witness

I couldn't submit a PR, so I got hired and fixed it myself

What’s Really Driving Your Retirement Account

India to penalize universities with too many retractions

Async status updates for remote startups

How a Nazi-Obsessed Amateur Historian Went From Obscurity to the Top of Substack

Donald J. Trump: "Based on the highly provocative statements of the Former President of Russia... - Trump's Truth

Lying increases trust in science, study finds

Anthropic studied what gives an AI system its ‘personality’ — and what makes it ‘evil’

Get the email address for GitHub username