The data is anonymized (VERY IMPORTANT POINT). The full dataset could lead to legal issues. If you want your posts removed, I do not know how to as I

Datasets: informatiker / 20-million-bluesky-posts like 32

submited by
Style Pass
2024-11-29 20:00:17

The data is anonymized (VERY IMPORTANT POINT). The full dataset could lead to legal issues. If you want your posts removed, I do not know how to as I can't map DID -> Posts.

This dataset consists of 20 million public (AS IN OPENLY AVAILABLE) posts from Bluesky Social, collected through the platform's firehose API.

This is not intended to hurt anyone, and I created it because I love making funny graphs. I do not intend to do any other NLP on this data.

Leave a Comment
Related Posts