enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. List of datasets for machine-learning research - Wikipedia

    en.wikipedia.org/wiki/List_of_datasets_for...

    GitHub repository of the project: Dynatrace This data is not pre-processed AIOps Challenge 2020 Data This data is not pre-processed GitHub repository of the project: Loghub This data is not pre-processed List of repositories: HTML Pages This data is not pre-processed List of HTML pages: Opensift ebooks This data is not pre-processed [409]

  3. Data Version Control (software) - Wikipedia

    en.wikipedia.org/wiki/Data_Version_Control...

    There are several open source projects that provide similar data version control capabilities to DVC, [52] such as: Git LFS, Dolt, Nessie, and lakeFS. These projects vary in their fit to the different needs of data engineers and data scientists such as: scalability, supported file formats, support in tabular data and unstructured data, volume ...

  4. Data build tool - Wikipedia

    en.wikipedia.org/wiki/Data_build_tool

    Dbt enables analytics engineers to transform data in their warehouses by writing select statements, and turns these select statements into tables and views. Dbt does the transformation (T) in extract, load, transform (ELT) processes – it does not extract or load data, but is designed to be performant at transforming data already inside of a ...

  5. Project Jupyter - Wikipedia

    en.wikipedia.org/wiki/Project_Jupyter

    Project Jupyter (/ ˈ dʒ uː p ɪ t ər / ⓘ) is a project to develop open-source software, open standards, and services for interactive computing across multiple programming languages. It was spun off from IPython in 2014 by Fernando Pérez and Brian Granger.

  6. Caffe (software) - Wikipedia

    en.wikipedia.org/wiki/Caffe_(software)

    Caffe is being used in academic research projects, startup prototypes, and even large-scale industrial applications in vision, speech, and multimedia. Yahoo! has also integrated Caffe with Apache Spark to create CaffeOnSpark, a distributed deep learning framework.

  7. OpenFDA - Wikipedia

    en.wikipedia.org/wiki/OpenFDA

    OpenFDA is a project indexing and formatting Food and Drug Administration (FDA) data, and making it accessible to the public. The ultimate goal of enabling the data accessibility is to educate people and save lives. The currently provided API of accessing data is under beta version. The project is open source and the code is available from GitHub.

  8. CatBoost - Wikipedia

    en.wikipedia.org/wiki/Catboost

    The source code is licensed under Apache License and available on GitHub. [6] InfoWorld magazine awarded the library "The best machine learning tools" in 2017. [11] along with TensorFlow, Pytorch, XGBoost and 8 other libraries. Kaggle listed CatBoost as one of the most frequently used machine learning (ML) frameworks in the world.

  9. DuckDB - Wikipedia

    en.wikipedia.org/wiki/DuckDB

    DuckDB is an open-source column-oriented relational database management system (RDBMS). [1] It is designed to provide high performance on complex queries against large databases in embedded configuration, [2] such as combining tables with hundreds of columns and billions of rows.