The problem is that the tree structure, and the file path listing just below it, are totally different ways of sorting. Filesystems walk the tree by l

Representing filesystems in databases efficiently with Hierarchical Ordering

submited by

Style Pass

2024-11-16 18:30:04

The problem is that the tree structure, and the file path listing just below it, are totally different ways of sorting. Filesystems walk the tree by level, where as databases walk their B-Trees by order (and are constantly rebalanced).

S3 isn’t just blobs of your files, there’s a key-value database that keeps track of your files as well. And it’s the thing you hit first every time you make a request to S3.

The problem is that if we wanted to treat S3 as a filesystem, and did an ls /, we’ve potentially just asked our S3 client to make unlimited requests.

This means that all children of the first “directory” that would appear will have to be sorted through before getting to the next.

Or in other words, if a/ has 1 million files beneath it, then we have to do 1,000 ListObjectV2 requests before we even see the b/ paths.

If you’re trying to mount S3 as a filesystem, this could result in atrocious performance (among other shortfalls trying to use S3 as a filesystem).

Now in private preview: optimize your data distribution with hierarchical partition keys

Comment

An Engineering Talent Summit

Comment

Python's ChainMap: Manage Multiple Contexts Effectively

Comment

Abstract Syntax Tree for Patching Code and Assessing Code Quality | Soroco Engineering

Comment

Databases on Object Storage - the New Normal

Comment

Reimagining database querying on unstructured data

Comment

Yes, You May Need a Blockchain

Comment

Recfiles - Wikipedia

Comment

1.1 Overview of Time Series Characteristics

Comment

Demystify SwiftUI - WWDC 2021 - Videos - Apple Developer

Comment

Representing filesystems in databases efficiently with Hierarchical Ordering

Leave a Comment

Related Posts

Now in private preview: optimize your data distribution with hierarchical partition keys

An Engineering Talent Summit

Python's ChainMap: Manage Multiple Contexts Effectively

Abstract Syntax Tree for Patching Code and Assessing Code Quality | Soroco Engineering

Databases on Object Storage - the New Normal

Reimagining database querying on unstructured data

Yes, You May Need a Blockchain

Recfiles - Wikipedia

1.1 Overview of Time Series Characteristics

Demystify SwiftUI - WWDC 2021 - Videos - Apple Developer

Recent Posts

Maintaining CTF Skills With Spaced Repetition

Don't miss tomorrow's tech industry news

Challenging the Significance of the LALIA and the Justinianic Plague: A Reanalysis of the Archaeological Record

Fiery Tesla Crash Traps And Kills Four After Electric Doors Couldn’t Open

New Study Says the Chevrolet Corvette is the 2nd Most Dangerous Car on the Road

Computer Science > Information Retrieval

How to be a multidisciplinary neuroscientist

Massive Galaxies at High Redshift: we told you so

Gordon Welchman: the architect of 'Ultra' intelligence

Stress warps fear memories in multiple ways

Four Dead In Fire As Tesla Doors Fail To Open After Crash

New haptic patch transmits complexity of touch to the skin

Computer Science > Machine Learning

Optimising LR Automata | Whatever

Blizzard just quietly released Warcraft 1 and 2 remasters, and they look like Zynga games made by a blind duck

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

LibSQL: SQLite for Modern Applications

Computer Science > Computer Vision and Pattern Recognition

News - Research shows caterpillar fungus can slow down growth of cancer cells - University of Nottingham

Your minimalistic app is great! Can you add these 38 features, though?