Recently, the International Conference on Learning Representations (ICLR) announced its final decisions for the 2024 conference, drawing significant a

Why Mamba was rejected?

submited by

Style Pass

2024-05-08 18:30:07

Recently, the International Conference on Learning Representations (ICLR) announced its final decisions for the 2024 conference, drawing significant attention to a particular submission: the Mamba model. This model, initially seen as a major contender against the well-known Transformer architecture for language modeling tasks, was ultimately rejected despite its promising with scores of 8 — 8 — 6 — 3 from reviewers.

Mamba’s rejection raises questions, especially considering its innovative approach as a selective state space model capable of scaling linearly with context length, potentially outperforming the Transformer in certain aspects. Yet, upon closer examination of the reviewers’ feedback, it becomes evident that concerns were primarily about the evaluation methodology.

1. Missing LRA Results: The absence of Long Range Arena (LRA) benchmark results, a standard for evaluating long-sequence models, was a significant gap. The LRA has been a conventional benchmark in similar research, making its omission a notable oversight.

Why Mamba was rejected?

Leave a Comment

Related Posts

Recent Posts

Thousands More Teslas Are Piling Up In Parking Lots

Hard Vs. Soft Culture Fixes - by Robin Hanson

The Mac Pro and Studio won’t get the M4 nod until mid-2025

Hertz Charging a Tesla Renter for Gas Was Not an Isolated Incident

Following the Lacquer God | Garland Magazine

The Worst Method for Learning ML — Reproducing AlphaFold

How Can This Country Possibly Be Electing Trump Again?

Search code, repositories, users, issues, pull requests...

Ubuntu 24.10 to Default to Wayland for NVIDIA Users

The Terrible Situation of Windows (Professional) Laptops - Antonin Carette

Solution to the puzzle Sleuth.

Streaming JSON & text in one request

Intel Core i9-11900K With DDR4 Memory Dethrones Core i9-14900KS With DDR5 Memory In PYPrime Latency Benchmark World Record

NVIDIA GeForce RTX 4070 Ti SUPER Gets Memory Tuned To 26 Gbps Speeds, Ends Up Faster Than 4080 SUPER GPU

Who Would Benefit From Ebrahim Raisi’s Death?

Search code, repositories, users, issues, pull requests...

Search code, repositories, users, issues, pull requests...

The WAF Swiss-Knife

Search code, repositories, users, issues, pull requests...

Go's History in Code