Gwern is a pseudonymous researcher and writer. He was one of the first people to see LLM scaling coming. If you've read his  blog, you know he's one o

Gwern Branwen - How an Anonymous Researcher Predicted AI's Trajectory

submited by
Style Pass
2024-11-14 09:00:02

Gwern is a pseudonymous researcher and writer. He was one of the first people to see LLM scaling coming. If you've read his blog, you know he's one of the most interesting polymathic thinkers alive.

In order to protect Gwern's anonymity, I proposed interviewing him in person, and having my friend Chris Painter voice over his words after. This amused him enough that he agreed.

After the episode, I convinced Gwern to create a donation page where people can help sustain what he’s up to. Please go here to contribute.

Watch on  YouTube. Listen on  Apple Podcasts,  Spotify, or any other podcast platform. Read the full transcript  here. Follow me on Twitter for updates on future episodes.

Jane Street is looking to hire their next generation of leaders. Their deep learning team is looking for ML researchers, FPGA programmers, and CUDA programmers. Summer internships are open - if you want to stand out, take a crack at their new Kaggle competition. To learn more, go here.

Turing provides complete post-training services for leading AI labs like OpenAI, Anthropic, Meta, and Gemini. They specialize in model evaluation, SFT, RLHF, and DPO to enhance models’ reasoning, coding, and multimodal capabilities. Learn more here.

Leave a Comment