enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Apache Kylin - Wikipedia

    en.wikipedia.org/wiki/Apache_Kylin

    Apache Kylin is an open source distributed analytics engine designed to provide a SQL interface and multi-dimensional analysis (OLAP) on Hadoop and Alluxio supporting extremely large datasets. It was originally developed by eBay , and is now a project of the Apache Software Foundation .

  3. AMPLab - Wikipedia

    en.wikipedia.org/wiki/AMPLab

    AMPLAB was a University of California, Berkeley lab focused on big data analytics located in Soda Hall. The name stands for the Algorithms, Machines and People Lab. [1] [2] It has been publishing papers since 2008 [3] and was officially launched in 2011. [4]

  4. Alluxio - Wikipedia

    en.wikipedia.org/wiki/Alluxio

    Initially as research project "Tachyon", Alluxio was created at the University of California, Berkeley's AMPLab as Haoyuan Li's Ph.D. Thesis, [2] advised by Professor Scott Shenker & Professor Ion Stoica. Alluxio sits between computation and storage in the big data analytics stack. It provides a data abstraction layer for computation frameworks ...

  5. Apache Spark - Wikipedia

    en.wikipedia.org/wiki/Apache_Spark

    Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance . Originally developed at the University of California, Berkeley 's AMPLab , the Spark codebase was later donated to the Apache Software Foundation ...

  6. JanusGraph - Wikipedia

    en.wikipedia.org/wiki/JanusGraph

    The project is supported by IBM, Google, Hortonworks and Grakn Labs. [4] JanusGraph supports various storage backends (Apache Cassandra, Apache HBase, Google Cloud Bigtable, Oracle BerkeleyDB, ScyllaDB). [5] [6] The Scalability of JanusGraph depends on the underlying technologies, which are used with JanusGraph. For example, by using Apache ...

  7. List of Apache Software Foundation projects - Wikipedia

    en.wikipedia.org/wiki/List_of_Apache_Software...

    CarbonData: an indexed columnar data format for fast analytics on big data platform, e.g., Apache Hadoop, Apache Spark, etc; Cassandra: highly scalable second-generation distributed database; Causeway(formerly Isis): a framework for rapidly developing domain-driven apps in Java; Cayenne: Java ORM framework

  8. Fluentd - Wikipedia

    en.wikipedia.org/wiki/Fluentd

    Fluentd was positioned for "big data," semi- or un-structured data sets.It analyzes event logs, application logs, and clickstreams. [3] According to Suonsyrjä and Mikkonen, the "core idea of Fluentd is to be the unifying layer between different types of log inputs and outputs.", [4] Fluentd is available on Linux, macOS, and Windows.

  9. Apache Hadoop - Wikipedia

    en.wikipedia.org/wiki/Apache_Hadoop

    Apache Hadoop (/ h ə ˈ d uː p /) is a collection of open-source software utilities for reliable, scalable, distributed computing.It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.