enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    The datasets are classified, based on the licenses, as Open data and Non-Open data. The datasets from various governmental-bodies are presented in List of open government data sites. The datasets are ported on open data portals. They are made available for searching, depositing and accessing through interfaces like Open API. The datasets are ...

  3. List of datasets in computer vision and image processing

    en.wikipedia.org/wiki/List_of_datasets_in...

    The dataset is labeled with semantic labels for 32 semantic classes. over 700 images Images Object recognition and classification 2008 [56] [57] [58] Gabriel J. Brostow, Jamie Shotton, Julien Fauqueur, Roberto Cipolla RailSem19 RailSem19 is a dataset for understanding scenes for vision systems on railways. The dataset is labeled semanticly and ...

  4. BookCorpus - Wikipedia

    en.wikipedia.org/wiki/BookCorpus

    The dataset was initially hosted on a University of Toronto webpage. [4] An official version of the original dataset is no longer publicly available, though at least one substitute, BookCorpusOpen, has been created. [1] Though not documented in the original 2015 paper, the site from which the corpus's books were scraped is now known to be ...

  5. Over 1,200 (and growing) books published by the Metropolitan Museum of Art, New York, up to c. 2009, fully available to download as PDFs (though content is still copyrighted) from the Thomas J. Watson Library at the MMA. Exhibition and collection catalogues, many very large and well-illustrated, and much else.

  6. The best books of 2024, according to Goodreads - AOL

    www.aol.com/lifestyle/the-best-books-of-2024...

    The annual Goodreads Choice Awards are the only major book awards chosen by readers for readers, and this year over 6.2 million votes were cast by book lovers for their favorite page-turners of ...

  7. The Pile (dataset) - Wikipedia

    en.wikipedia.org/wiki/The_Pile_(dataset)

    The Pile is an 886.03 GB diverse, open-source dataset of English text created as a training dataset for large language models (LLMs). It was constructed by EleutherAI in 2020 and publicly released on December 31 of that year. [1] [2] It is composed of 22 smaller datasets, including 14 new ones. [1]

  8. Open Library - Wikipedia

    en.wikipedia.org/wiki/Open_Library

    Open Library is an online project intended to create "one web page for every book ever published". Created by Aaron Swartz, [3] [4] Brewster Kahle, [5] Alexis Rossi, [6] Anand Chitipothu, [6] and Rebecca Hargrave Malamud, [6] Open Library is a project of the Internet Archive, a nonprofit organization.

  9. Zenodo - Wikipedia

    en.wikipedia.org/wiki/Zenodo

    Zenodo is a general-purpose open repository developed under the European OpenAIRE program and operated by CERN. [1] [2] [3] It allows researchers to deposit research papers, data sets, research software, reports, and any other research related digital artefacts.