enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    The datasets are classified, based on the licenses, as Open data and Non-Open data. The datasets from various governmental-bodies are presented in List of open government data sites. The datasets are ported on open data portals. They are made available for searching, depositing and accessing through interfaces like Open API. The datasets are ...

  3. List of datasets in computer vision and image processing

    en.wikipedia.org/wiki/List_of_datasets_in...

    Overhead Imagery Research Data Set: Annotated overhead imagery. Images with multiple objects. Over 30 annotations and over 60 statistics that describe the target within the context of the image. 1000 Images, text Classification 2009 [170] [171] F. Tanner et al. SpaceNet SpaceNet is a corpus of commercial satellite imagery and labeled training data.

  4. List of neuroscience databases - Wikipedia

    en.wikipedia.org/wiki/List_of_neuroscience_databases

    A Virtual Library for Behavioral Performance in Standard Conditions – Rodent Spontaneous Activity in an Open Field during Repeated Testing and after Treatment with Drugs or Brain Lesions Research using an animal model of obsessive-compulsive disorder employed a standardized paradigm where the behavior of rats in a large open field was video ...

  5. List of GIS data sources - Wikipedia

    en.wikipedia.org/wiki/List_of_GIS_data_sources

    Kentucky Open Data Portal: The Kentucky Open Data Portal is a site for exploring, accessing and downloading Kentucky-specific GIS data and discovering mapping apps. You can analyze and combine datasets using maps, as well as develop new web and mobile applications. [13] KyFromAbove

  6. ImageNet - Wikipedia

    en.wikipedia.org/wiki/ImageNet

    The ImageNet project is a large visual database designed for use in visual object recognition software research. More than 14 million [1] [2] images have been hand-annotated by the project to indicate what objects are pictured and in at least one million of the images, bounding boxes are also provided. [3]

  7. List of biological databases - Wikipedia

    en.wikipedia.org/wiki/List_of_biological_databases

    open-source database for molecular interactions Protein-protein and other molecular interactions String: an open source molecular interaction database to study interactions between proteins Protein-protein and other molecular interactions Human Protein Atlas: Human Protein Atlas: aims at mapping all the human proteins in cells, tissues and organs

  8. The Pile (dataset) - Wikipedia

    en.wikipedia.org/wiki/The_Pile_(dataset)

    The Pile is an 886.03 GB diverse, open-source dataset of English text created as a training dataset for large language models (LLMs). It was constructed by EleutherAI in 2020 and publicly released on December 31 of that year. [1] [2] It is composed of 22 smaller datasets, including 14 new ones. [1]

  9. List of text corpora - Wikipedia

    en.wikipedia.org/wiki/List_of_text_corpora

    Text corpora (singular: text corpus) are large and structured sets of texts, which have been systematically collected.Text corpora are used by both AI developers to train large language models and corpus linguists and within other branches of linguistics for statistical analysis, hypothesis testing, finding patterns of language use, investigating language change and variation, and teaching ...