…and it’s pretty fun. I was very skeptical about the AI/LLM “boom” back when it started. I thought, like many other people, tha

llama.cpp guide - Running LLMs locally, on any hardware, from scratch

submited by

Style Pass

2024-11-29 15:30:02

…and it’s pretty fun. I was very skeptical about the AI/LLM “boom” back when it started. I thought, like many other people, that they are just mostly making stuff up, and generating uncanny-valley-tier nonsense. Boy, was i wrong. I’ve used ChatGPT once or twice, to test the waters - it made a pretty good first impression, despite hallucinating a bit. That was back when GPT3.5 was the top model. We came a pretty long way since then.

However, despite ChatGPT not disappointing me, i was still skeptical. Everything i’ve wrote, and every piece of response was fully available to OpenAI, or whatever other provider i’d want to use. This is not a big deal, but it tickles me in a wrong way, and also means i can’t use LLMs for any work-related non-open-source stuff. Also, ChatGPT is free only to a some degree - if i’d want to go full-in on AI, i’d probably have to start paying. Which, obviously, i’d rather avoid.

At some point i started looking at open-source models. I had no idea how to use them, but the moment i saw the sizes of “small” models, like Llama 2 7B, i’ve realized that my RTX 2070 Super with mere 8GB of VRAM would probably have issues running them (i was wrong on that too!), and running them on CPU would probably yield very bad performance. And then, i’ve bought a new GPU - RX 7900 XT, with 20GB of VRAM, which is definitely more than enough to run small-to-medium LLMs. Yay!

Is it a good alternative to use Microsoft MakeCode to program LEGO MindStorms EV3? – Part 1

Comment

Writing a Game From Scratch - Introduction | Legion

Comment

What Is Strict Mode In TypeScript, Why And When You Should Use It?

Comment

Where to start with Self-Hosting - A Beginners Guide

Comment

Anime Encoding Guide for x265 (HEVC) & AAC/OPUS (and Why to Never Use FLAC)

Comment

The Hitchhiker's Guide to Online Anonymity

Comment

"ABRASION HOLOGRAPHY" HAND-DRAWN HOLOGRAMS

Comment

A remaster with no old code: Crash Bandicoot was rebuilt nearly from scratch

Comment

How to Install Hardware Drivers on Linux

Comment

Arch Linux, Clear Linux & Ubuntu Against Windows 10/11 On Intel Rocket Lake - Phoronix

Comment

llama.cpp guide - Running LLMs locally, on any hardware, from scratch

Leave a Comment

Related Posts

Is it a good alternative to use Microsoft MakeCode to program LEGO MindStorms EV3? – Part 1

Writing a Game From Scratch - Introduction | Legion

What Is Strict Mode In TypeScript, Why And When You Should Use It?

Where to start with Self-Hosting - A Beginners Guide

Anime Encoding Guide for x265 (HEVC) & AAC/OPUS (and Why to Never Use FLAC)

The Hitchhiker's Guide to Online Anonymity

"ABRASION HOLOGRAPHY" HAND-DRAWN HOLOGRAMS

A remaster with no old code: Crash Bandicoot was rebuilt nearly from scratch

How to Install Hardware Drivers on Linux

Arch Linux, Clear Linux & Ubuntu Against Windows 10/11 On Intel Rocket Lake - Phoronix

Recent Posts

How gen AI is moving from the Napster to the Spotify era; Trump eyes AI czar; we need a better way to benchmark AI models; new competitors emerge to OpenAI's o1; artists leak Sora in protest

The science of why you can remember song lyrics from years ago

Static website hosting with Cloudflare pages

Chinese pebble-bed reactor passes “meltdown” test

Beware of Econ Grad School - by Arnold Kling - In My Tribe

Canadian media companies sue OpenAI in case potentially worth billions

Don't Worry About the Vase

Computer Science > Machine Learning

Making Computer Faster: a Deep Dive into Dynamic Dispatch (part 1) - Deviant/Abstraction

Surprising 16-year-long ADHD study reveals opposite of what researchers expected

West Is Best — How Westvleteren 12 Accidentally Became “The Best Beer In The World”

Wick Allison’s Raspberry Secret Is Out

World’s largest piracy network taken down after 100 homes raided across 10 countries

Planned Release Date: 2025

Yes, There Are Antitrust Voters in a Swing State

Who is supposed to teach the law to the citizens?

Handheld consoles are the industry's next battleground | Opinion

Computer Science > Distributed, Parallel, and Cluster Computing

CUBE | Notion

Cinnamon, spice and ‘everything nice’ – why lead-tainted cinnamon products have turned up on shelves, and what questions consumers should ask