Chaitanya K. Joshi's excellent blog post on his PhD work inspired me to write down some thoughts on symmetries and invariances in neural network

Modeling Symmetries | Mark Neumann

submited by

Style Pass

2025-08-08 12:30:10

Chaitanya K. Joshi's excellent blog post on his PhD work inspired me to write down some thoughts on symmetries and invariances in neural network models—particularly in the context of molecular data. Discussions around removing architectural biases can lean a little absolutist ("Bitter Lesson, GPUs go brrr!"), whereas the reality is more nuanced. I am actually not so opposed to the idea of encoding physical priors, but I am interested in when such symmetries should be embedded directly into the model architecture, and when they might be better handled implicitly: through data augmentation, regularization or inference-time tricks.

This post represents some of my current thinking on the topic, through the lens of two modeling domains in which the approaches to encoding symmetries are very different: neural network potentials and diffusion models.

When modeling image data, there are many desirable invariances one can imagine (translation being the predominant one baked into the most common image model architecture, CNNs). Some of these invariances have questionable utility in practice, or do not match the statistics of a natural data distribution. Rotations of images fits this mold well - in theory, a classifier's ability to accurately predict an image attribute should not have too much to do with it's orientation. But requiring this invariance property removes a lot of information about the spatial distribution of colours and objects in images (skies are typically blue, and typically at the top of an image). If a model learns this bias from the data, is it incorrect?

Modeling Symmetries | Mark Neumann

Leave a Comment

Related Posts

Recent Posts

Civilizations of Africa through a new lens

AI’s Hot, Hot Mess

Chilling 66-million-year-old discovery rewrites primate origin story

Self-cleaning glass uses electric field to remove dust particles within seconds

Fear Of Super Intelligent AI Is Driving Harvard And MIT Students To Drop Out

Automatically Deploy An Application that uses FusionAuth For Authentication Using Vercel

How Drift Can Derail an Organization, 2040’s Ideas and Innovations, Issue 222

Suramya's Blog : Welcome to my crazy life…

Create agent and tool identities with AgentCore Identity

Wheelchair Users Are Finally Winning the Right to Repair

Clear Thinking - by Michał Poczwardowski - Perspectiveship

Lee fire in northwestern Colorado, fueled by strong winds, grows to 60,000 acres

How I Used Test Techniques on Chatbots

Simple is a scam – NO Complexity

Big Tech’s “Sovereign Cloud” promises just collapsed — in their own words

OpenAI beats Elon Musk's Grok in AI chess tournament

Computer Science > Computers and Society

The Sunday Morning Post: Whatever Happened to Serial Killers?

Programming with AI: You're Probably Doing It Wrong | devroom.io

Spatial Joins in DuckDB