enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Apache Superset - Wikipedia

    en.wikipedia.org/wiki/Apache_Superset

    Apache Superset is an open-source software application for data exploration and data visualization able to handle data at petabyte scale ().The application started as a hack-a-thon project by Maxime Beauchemin (creator of Apache Airflow) while working at Airbnb and entered the Apache Incubator program in 2017. [1]

  3. Big data - Wikipedia

    en.wikipedia.org/wiki/Big_data

    Big data "size" is a constantly moving target; as of 2012 ranging from a few dozen terabytes to many zettabytes of data. [26] Big data requires a set of techniques and technologies with new forms of integration to reveal insights from data-sets that are diverse, complex, and of a massive scale. [27]

  4. pytest - Wikipedia

    en.wikipedia.org/wiki/Pytest

    Pytest is a Python testing framework that originated from the PyPy project. It can be used to write various types of software tests, including unit tests, integration tests, end-to-end tests, and functional tests. Its features include parametrized testing, fixtures, and assert re-writing.

  5. Wes McKinney - Wikipedia

    en.wikipedia.org/wiki/Wes_McKinney

    Wes McKinney is an American software developer and businessman. He is the creator and "Benevolent Dictator for Life" (BDFL) of the open-source pandas package for data analysis in the Python programming language, and has also authored three versions of the reference book Python for Data Analysis.

  6. Voldemort (distributed data store) - Wikipedia

    en.wikipedia.org/wiki/Voldemort_(distributed...

    Voldemort does not try to satisfy arbitrary relations and the ACID properties, but rather is a big, distributed, persistent hash table. [2] A 2012 study comparing systems for storing application performance management data reported that Voldemort, Apache Cassandra, and HBase all offered linear scalability in most cases, with Voldemort having the lowest latency and Cassandra having the highest ...

  7. Apache Hadoop - Wikipedia

    en.wikipedia.org/wiki/Apache_Hadoop

    Apache Hadoop (/ h ə ˈ d uː p /) is a collection of open-source software utilities for reliable, scalable, distributed computing.It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.

  8. Spyder (software) - Wikipedia

    en.wikipedia.org/wiki/Spyder_(software)

    Spyder is extensible with first-party and third-party plugins, [8] and includes support for interactive tools for data inspection and embeds Python-specific code quality assurance and introspection instruments, such as Pyflakes, Pylint [9] and Rope. [10] [11] Spyder uses Qt for its GUI and is designed to use either of the PyQt or PySide Python ...

  9. Data-driven programming - Wikipedia

    en.wikipedia.org/wiki/Data-driven_programming

    Standard examples of data-driven languages are the text-processing languages sed and AWK, [1] and the document transformation language XSLT, where the data is a sequence of lines in an input stream – these are thus also known as line-oriented languages – and pattern matching is primarily done via regular expressions or line numbers.