A recent conversation with a friend in the robotics industry naturally led to a discussion of how the industry as a whole can build more generalizable

Foundation models, internet-scale data, and the path to generalized robots

submited by

Style Pass

2024-02-13 05:00:03

A recent conversation with a friend in the robotics industry naturally led to a discussion of how the industry as a whole can build more generalizable robotic systems: systems that can perform well in novel tasks with little new data, systems that are robust to task perturbations, and systems that can adapt to different robot morphologies. We discussed prospects for leveraging large datasets of heterogeneous robotics data and the lessons we can take from the astonishing success of LLMs/other foundation models. This post explores that topic further.

Below I summarize some of the areas of intense general robotics research over the last few years. It focuses on the attempts to leverage LLMs/VLMs and related techniques for robotics and what techniques might prove fruitful for extracting the most value out of large robotics datasets. These techniques are being applied all across the robotics field to varying success. Considering my active work on driverless vehicles, I avoid analyzing related advances in autonomous driving. Instead, I focus this review mostly on grasping and manipulation robots, as that is an area of particular research emphasis. I think the lessons learned in this robotics sub-domain extend to most other areas of robotics.

As a preliminary historical note, the idea of using data from other robots or models trained on non-robotics data to build better robots is still a novel one. The hand-engineered components in conventional robotics systems (pre-deep learning) struggled to leverage robot data at all except to scrutinize directly for debugging purposes. To be clear, model based approaches can still produce powerful results, including the notable robot parkour exhibited by Boston Dynamics robots. However, this post assumes a more modern architecture that includes data driven components that are architected to enable learning from data, whether generated directly by the target platform or otherwise.

HERO Robots | Robot Workshop

Comment

A Foosball Playing Robot That Can Beat Most Humans at the Game

Comment

It's all about Quality of Service with M1 Macs compared to Intel models.

Comment

The Great Hargeisa Goat Bubble - Julian Gough's website

Comment

How Pinterest Reduced Costs and Improved Data Consistency with a NewSQL Database | PingCAP

Comment

Using AI to help organizations detect and report child sexual abuse material online

Comment

Paging Doctor Cloud! Amazon HealthLake Is Now Generally Available

Comment

Now in private preview: optimize your data distribution with hierarchical partition keys

Comment

Deploying Machine Learning Models (is still terrible)

Comment

Debunking the Mechanical Turk Helped Set Edgar Allan Poe on the Path to Mystery Writing

Comment

Foundation models, internet-scale data, and the path to generalized robots

Leave a Comment

Related Posts

HERO Robots | Robot Workshop

A Foosball Playing Robot That Can Beat Most Humans at the Game

It's all about Quality of Service with M1 Macs compared to Intel models.

The Great Hargeisa Goat Bubble - Julian Gough's website

How Pinterest Reduced Costs and Improved Data Consistency with a NewSQL Database | PingCAP

Using AI to help organizations detect and report child sexual abuse material online

Paging Doctor Cloud! Amazon HealthLake Is Now Generally Available

Now in private preview: optimize your data distribution with hierarchical partition keys

Deploying Machine Learning Models (is still terrible)

Debunking the Mechanical Turk Helped Set Edgar Allan Poe on the Path to Mystery Writing

Recent Posts

Eleven Predictions: Here's What AI Does Next - by Ted Gioia

When Life Feels Too Busy for Friendship

Startups are getting fined, or sometimes banned, by individual states

Importance, Value, and Causal Impact

Being a middle manager is getting more and more toxic

Cheaper Snapdragon X Is Here! First Look at Lenovo's 8-Core X Plus Arm Laptops

USC study confirms the rotation of Earth’s inner core has slowed

Thoughts on Simplicity and Its Quiet Strength

Cismela

A top 'engineer' faked his degrees and only had a high-school education. He got away with it for years.

Portable VR Welding Simulator

If Trump wins the election, it will doom our efforts to slow climate disaster

Islands are engines of language diversity

Steve Wozniak Reunites With the Historic Homebrew Computer Club

Search code, repositories, users, issues, pull requests...

Saving Southeast Asia’s Sunken Warships

Generate Videos in One Place (Beta)*

Nature’s ghosts: how reviving medieval farming offers wildlife an unexpected haven

Google says replacing C/C++ in firmware with Rust is easy

Death Valley National Park has its hottest summer on record