enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    Dataset HF card, and project's GitHub repository. [393] Diggelmann et al. Climate News dataset A dataset for NLP and climate change media researchers The dataset is made up of a number of data artifacts (JSON, JSONL & CSV text files & SQLite database) Climate news DB, Project's GitHub repository [394] ADGEfficiency Climatext

  3. Data Version Control (software) - Wikipedia

    en.wikipedia.org/wiki/Data_Version_Control...

    There are several open source projects that provide similar data version control capabilities to DVC, [52] such as: Git LFS, Dolt, Nessie, and lakeFS. These projects vary in their fit to the different needs of data engineers and data scientists such as: scalability, supported file formats, support in tabular data and unstructured data, volume ...

  4. Figshare - Wikipedia

    en.wikipedia.org/wiki/Figshare

    Figshare is an online open access repository where researchers can preserve and share their research outputs, including figures, datasets, images, and videos. [1] It is free to upload content and free to access, in adherence to the principle of open data.

  5. Data build tool - Wikipedia

    en.wikipedia.org/wiki/Data_build_tool

    Dbt enables analytics engineers to transform data in their warehouses by writing select statements, and turns these select statements into tables and views. Dbt does the transformation (T) in extract, load, transform (ELT) processes – it does not extract or load data, but is designed to be performant at transforming data already inside of a ...

  6. Fashion MNIST - Wikipedia

    en.wikipedia.org/wiki/Fashion_MNIST

    The Fashion MNIST dataset is a large freely available database of fashion images that is commonly used for training and testing various machine learning systems. [1] [2] Fashion-MNIST was intended to serve as a replacement for the original MNIST database for benchmarking machine learning algorithms, as it shares the same image size, data format and the structure of training and testing splits.

  7. List of volunteer computing projects - Wikipedia

    en.wikipedia.org/wiki/List_of_volunteer...

    Trying to solve Sierpinski / Riesel Bases up to 1030, the project is Conjecture 'R Us [111] Yes 1,267 (Mar 2023) [112] 617.109 (Mar 2023) [112] TN-Grid: 2014-05-01 [113] Research Area of Trento of the National Research Council of Italy, University of Trento: Genetics: Gene@home is a scientific project belonging to the infrastructure TrentoGrid ...

  8. CloudSim - Wikipedia

    en.wikipedia.org/wiki/CloudSim

    CloudSim is a framework for modeling and simulation of cloud computing infrastructures and services. [1] Originally built primarily at the Cloud Computing and Distributed Systems (CLOUDS) Laboratory, [2] the University of Melbourne, Australia, CloudSim has become one of the most popular open source [citation needed] cloud simulators in the research and academia.

  9. CatBoost - Wikipedia

    en.wikipedia.org/wiki/Catboost

    The source code is licensed under Apache License and available on GitHub. [6] InfoWorld magazine awarded the library "The best machine learning tools" in 2017. [11] along with TensorFlow, Pytorch, XGBoost and 8 other libraries. Kaggle listed CatBoost as one of the most frequently used machine learning (ML) frameworks in the world.