enow.com Web Search

Search results

  1. Results from the WOW.Com Content Network
  2. Apache Airflow - Wikipedia

    en.wikipedia.org/wiki/Apache_Airflow

    Apache Airflow is an open-source workflow management platform for data engineering pipelines. It started at Airbnb in October 2014 [2] as a solution to manage the company's increasingly complex workflows. Creating Airflow allowed Airbnb to programmatically author and schedule their workflows and monitor them via the built-in Airflow user ...

  3. Apache NiFi - Wikipedia

    en.wikipedia.org/wiki/Apache_NiFi

    Apache NiFi is a software project from the Apache Software Foundation designed to automate the flow of data between software systems.Leveraging the concept of extract, transform, load (ETL), it is based on the "NiagaraFiles" software previously developed by the US National Security Agency (NSA), which is also the source of a part of its present name – NiFi.

  4. Apache Superset - Wikipedia

    en.wikipedia.org/wiki/Apache_Superset

    Apache Superset is an open-source software application for data exploration and data visualization able to handle data at petabyte scale . The application started as a hack-a-thon project by Maxime Beauchemin (creator of Apache Airflow ) while working at Airbnb and entered the Apache Incubator program in 2017. [ 1 ]

  5. Google Cloud Dataflow - Wikipedia

    en.wikipedia.org/wiki/Google_Cloud_Dataflow

    Google Cloud Dataflow. Google Cloud Dataflow is a fully managed service for executing Apache Beam pipelines within the Google Cloud Platform ecosystem. Dataflow provides a fully managed service for executing Apache Beam pipelines, offering features like autoscaling, dynamic work rebalancing, and a managed execution environment. [1]

  6. Apache Flink - Wikipedia

    en.wikipedia.org/wiki/Apache_Flink

    Apache Flink. Apache Flink is an open-source, unified stream-processing and batch-processing framework developed by the Apache Software Foundation. The core of Apache Flink is a distributed streaming data-flow engine written in Java and Scala. [3][4] Flink executes arbitrary dataflow programs in a data-parallel and pipelined (hence task ...

  7. Apache Hadoop - Wikipedia

    en.wikipedia.org/wiki/Apache_Hadoop

    Apache Hadoop ( / həˈduːp /) is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. [vague] It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.

  8. Apache HBase - Wikipedia

    en.wikipedia.org/wiki/Apache_HBase

    Apache HBase. HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation 's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System) or Alluxio, providing Bigtable-like capabilities for Hadoop.

  9. Apache Velocity - Wikipedia

    en.wikipedia.org/wiki/Apache_Velocity

    Apache Velocity first released in April 2001, is a Java -based template engine that provides a template language to reference objects defined in Java code. It aims to ensure clean separation between the presentation tier and business tiers in a Web application (the model–view–controller design pattern). Velocity is an open source software ...