Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zat

apple / OpenELM like 16

submited by

Style Pass

2024-04-24 05:00:03

Sachin Mehta, Mohammad Hossein Sekhavat, Qingqing Cao, Maxwell Horton, Yanzi Jin, Chenfan Sun, Iman Mirzadeh, Mahyar Najibi, Dmitry Belenko, Peter Zatloukal, Mohammad Rastegari

We introduce OpenELM, a family of Open-source Efficient Language Models. OpenELM uses a layer-wise scaling strategy to efficiently allocate parameters within each layer of the transformer model, leading to enhanced accuracy. We pretrained OpenELM models using the CoreNet library. We release both pretrained and instruction tuned models with 270M, 450M, 1.1B and 3B parameters.

Our pre-training dataset contains RefinedWeb, deduplicated PILE, a subset of RedPajama, and a subset of Dolma v1.6, totaling approximately 1.8 trillion tokens. Please check license agreements and terms of these datasets before using them.

We have provided an example function to generate output from OpenELM models loaded via HuggingFace Hub in generate_openelm.py.

apple / OpenELM like 16

Leave a Comment

Related Posts

Recent Posts

Turning Down the Flame on Medical Gaslighting

Deep-space lasers smash data speed records over interplanetary distances

Google's medical AI destroys GPT's benchmark and outperforms doctors

NYPL's Accessible Technology Conference 2023: Recordings, Transcripts, and Thanks

Building RAG with Open-Source and Custom AI Models

cert-manager: All-in-One Kubernetes TLS Certificate Manager¶

High Transportation Efficiency Vacuum Pump

Show & Tell Projects Archive - Quantified Self

Internal SEO Links as a Service generate internal links fully automated with AI

Musk is raising $6B for AI startup. Also, is TikTok dodging Apple’s commissions?

26 top website directories for finding inspiration for your next landing page or website.

James Shore: A Useful Productivity Measure?

The 7 Habits of Highly Effective… Pirates?

Does containerization affect the performance of databases?

Chuck Ros, SoftServe: Delivering transformative AI solutions responsibly

Why I switched from Gmail to HEY

The WebAIM Million The 2024 report on the accessibility of the top 1,000,000 home pages

FreeIPA - Identity, Policy, Audit

Golang Fuzzing Explained

Your