The images were generated with a collection of models available under the Apache-2.0 or creativeml-openrail-m licenses. To generate this dataset we us

Datasets: ninamoss / sleeetview_agentic_ai_dataset like 0

submited by
Style Pass
2025-01-15 03:00:02

The images were generated with a collection of models available under the Apache-2.0 or creativeml-openrail-m licenses. To generate this dataset we used our own agentic implementation given the goal of creating a dataset that can be used to research synthetic content detection. As pioneers in the synthetic content detection realm, we think having a varied sampling of synthetic data is important to determine detection efficiency.

Also included in the dataset are segmentation masks and metadata generated with the DETR panoptic model: https://huggingface.co/facebook/detr-resnet-101-panoptic

An in depth explanation of our approach to agentic generation of synthetic content can be found here: https://menditai.substack.com/p/the-night-the-dataset-appeared-an We opted for a local setup using Ollama and Falcon3 as the LLM powering the agent. Based on our experience with this process we find:

Segmentation masks and metadata were automatically generated using the DETR panoptic model: https://huggingface.co/facebook/detr-resnet-101-panoptic

Leave a Comment
Related Posts