Disclaimer: This is a fairly specific benchmark, it addresses only serialization performance of a single big Python object. That was the problem we we

When to dump json?. Disclaimer: This is a fairly specific… | by Piotr Zakrzewski | plotwise | Sep, 2021 | Medium

submited by

Style Pass

2021-09-28 08:30:03

Disclaimer: This is a fairly specific benchmark, it addresses only serialization performance of a single big Python object. That was the problem we were solving at the time at Plotwise. It might not be relevant to any other case, but you can probably adapt our benchmark code to test for a different scenario.

You may instinctively know that JSON is not the most efficient format. It is human readable, which means it is pretty redundant as far as efficient encoding goes (think for instance of all the keys that need repeating, or the need for utf-8 encoding). Let’s say you are dealing with a single, big structured data object (Python class instance more specifically ..), is it worth using binary encoding? Or better dump it into Json? What about pickling? How much speed can you possibly gain at what cost to the code complexity?

The use-case we have at Plotwise is storing a snapshot of a route planning. Those plannings grow with the number of delivery events, vehicles and driver shifts. They also change dynamically as they are optimized over time. It is important for us to be able to quickly serialize (…dump) the planning and persist it, but also to be able to restore it from persistence. The planning is a nested multi-field JSON with mostly numeric fields and some low-cardinality string fields. A typical planning would be anywhere between 200 kb and 1.5 mb. At this size the benefits of switching to a more compact format should start to be apparent. But how big would it be exactly? Is it worth it?

Why I'm excited about newsletters

Comment

VC LARPing. VCs’ job is to value companies. They… | by Liron Shapira | Bloated MVP | Jun, 2021 | Medium

Comment

Ohio Scientific and Commodore 64 Environments

Comment

Kubeflow KFServing 0.6 is Out!

Comment

A Tour of Safe Tracing GC Designs in Rust

Comment

Imagination and remembrance: what role should historical epidemiology play in a world bewitched by mathematical modelling of COVID-19 and other epidemics?

Comment

COMMSEC: AWS GuardDuty: Post-DNS Era Covert Channel for C&C and Data Exfiltration

Comment

Janet Programming Language

Comment

Forests on caffeine: coffee waste can boost forest recovery

Comment

Shift Left. The principle of Shift Left is to take… | by Omkar Birade | Jul, 2021 | Medium

Comment

When to dump json?. Disclaimer: This is a fairly specific… | by Piotr Zakrzewski | plotwise | Sep, 2021 | Medium

Leave a Comment

Related Posts

Why I'm excited about newsletters

VC LARPing. VCs’ job is to value companies. They… | by Liron Shapira | Bloated MVP | Jun, 2021 | Medium

Ohio Scientific and Commodore 64 Environments

Kubeflow KFServing 0.6 is Out!

A Tour of Safe Tracing GC Designs in Rust

Imagination and remembrance: what role should historical epidemiology play in a world bewitched by mathematical modelling of COVID-19 and other epidemics?

COMMSEC: AWS GuardDuty: Post-DNS Era Covert Channel for C&C and Data Exfiltration

Janet Programming Language

Forests on caffeine: coffee waste can boost forest recovery

Shift Left. The principle of Shift Left is to take… | by Omkar Birade | Jul, 2021 | Medium

Recent Posts

Search code, repositories, users, issues, pull requests...

Tesla’s Autopilot and Full Self-Driving linked to hundreds of crashes, dozens of deaths

Microsoft releases MS-DOS 4 source code on GitHub — 45 year old code now open-source

Worse Than You Can Imagine 61

A Strong ‘Night Effect’ for Bitcoin: All of bitcoin’s gains come from overnight

Wisdom from Marcus Aurelius - by Gurwinder - The Prism

Creatine found to improve cognitive performance during sleep deprivation

Auto Safety Regulator Investigating Tesla Recall of Autopilot

In the face of bans, ByteDance tightens grip over US TikTok operations

You can’t teach caring… - by Jos Visser - Wednesday Wisdom

Why contributing to open source is scary and how to contribute anyway

Announcing two new LMS libraries

adding activitypub to humungus

DirectoryTree Authorization is a Native Role and Permission Management Package for Laravel

UMaine’s new 3D printer smashes former Guinness World Record to advance the next generation of advanced manufacturing - UMaine News - University of Maine

By-Product Valorization as a Means for the Brewing Industry to Move toward a Circular Bioeconomy

Scientists capture X-rays from upward positive lightning

A Chrome feature is creating enormous load on global root DNS servers

Learn one thing at a time | Lawrence Jones

Nightly Postgres Backups via GitHub Actions