enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Flux (text-to-image model) - Wikipedia

    en.wikipedia.org/wiki/Flux_(text-to-image_model)

    Flux (also known as FLUX.1) is a text-to-image model developed by Black Forest Labs, based in Freiburg im Breisgau, Germany. Black Forest Labs were founded by former employees of Stability AI. As with other text-to-image models, Flux generates images from natural language descriptions, called prompts.

  3. List of datasets in computer vision and image processing

    en.wikipedia.org/wiki/List_of_datasets_in...

    Aerial Image Segmentation Dataset 80 high-resolution aerial images with spatial resolution ranging from 0.3 to 1.0. Images manually segmented. 80 Images Aerial Classification, object detection 2013 [160] [161] J. Yuan et al. KIT AIS Data Set Multiple labeled training and evaluation datasets of aerial images of crowds.

  4. Hugging Face - Wikipedia

    en.wikipedia.org/wiki/Hugging_Face

    datasets, mainly in text, images, and audio; web applications ("spaces" and "widgets"), intended for small-scale demos of machine learning applications. There are numerous pre-trained models that support common tasks in different modalities, such as:

  5. Stable Diffusion - Wikipedia

    en.wikipedia.org/wiki/Stable_Diffusion

    Stable Diffusion was trained on pairs of images and captions taken from LAION-5B, a publicly available dataset derived from Common Crawl data scraped from the web, where 5 billion image-text pairs were classified based on language and filtered into separate datasets by resolution, a predicted likelihood of containing a watermark, and predicted ...

  6. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    Dataset of legal contracts with rich expert annotations ~13,000 labels CSV and PDF Natural language processing, QnA 2021 The Atticus Project: Vietnamese Image Captioning Dataset (UIT-ViIC) Vietnamese Image Captioning Dataset 19,250 captions for 3,850 images CSV and PDF Natural language processing, Computer vision 2020 [113] Lam et al.

  7. GPT-2 - Wikipedia

    en.wikipedia.org/wiki/GPT-2

    GPT-2 was pre-trained on a dataset of 8 million web pages. [2] It was partially released in February 2019, followed by full release of the 1.5-billion-parameter model on November 5, 2019. [3] [4] [5] GPT-2 was created as a "direct scale-up" of GPT-1 [6] with a ten-fold increase in both its parameter count and the size of its training dataset. [5]

  8. The Pile (dataset) - Wikipedia

    en.wikipedia.org/wiki/The_Pile_(dataset)

    The Pile is an 886.03 GB diverse, open-source dataset of English text created as a training dataset for large language models (LLMs). It was constructed by EleutherAI in 2020 and publicly released on December 31 of that year. [1] [2] It is composed of 22 smaller datasets, including 14 new ones. [1]

  9. Generative artificial intelligence - Wikipedia

    en.wikipedia.org/wiki/Generative_artificial...

    Similarly, an image model prompted with the text "a photo of a CEO" might disproportionately generate images of white male CEOs, [128] if trained on a racially biased data set. A number of methods for mitigating bias have been attempted, such as altering input prompts [ 129 ] and reweighting training data.