Simulacra Aesthetic Captions

ImagesCC0 1.0 Universal Public Domain DedicationIntroduced 2022-07-03

Simulacra Aesthetic Captions is a dataset of over 238000 synthetic images generated with AI models such as CompVis latent GLIDE and Stable Diffusion from over forty thousand user submitted prompts. The images are rated on their aesthetic value from 1 to 10 by users to create caption, image, and rating triplets. In addition to this each user agreed to release all of their work with the bot: prompts, outputs, ratings, completely public domain under the CC0 1.0 Universal Public Domain Dedication. The result is a high quality royalty free dataset with over 176000 ratings that can be used for projects such as:

  • Filtering Datasets
  • Guiding Generative Models
  • Training A Prompt Generator
  • Extracting vitamin phrases ("trending on artstation", etc) Alignment Research

Description from: https://github.com/JD-P/simulacra-aesthetic-captions