Since its establishment in 1999, the Apache Software Foundation has successfully established its own strong ecosystem. Many excellent open source projects have emerged in its community, and more and more domestic and foreign projects are coming to this international open source community for incubation.
It is reported thatall current Apache projects need to be incubated by incubators and meet a series of quality requirements before they can graduate. Projects that graduate from the incubator either become top-level projects independently or become sub-projects of other top-level projects.
To help everyone understand the standards of Apache incubation, this article counts several top-level projects that were successfully incubated and independently managed by Apache from January 1, 2016 to January 19, 2017.
1. Apache Beam
Apache Beam is an incubation project contributed by Google to the Apache Foundation on February 1, 2016. It was officially launched on January 10, 2017. Announced graduation and upgraded to Apache top-level project.
The main goal of Apache Beam is to unify the programming paradigms of batch and stream processing and provide a simple, flexible, feature-rich and expressive SDK for unlimited, out-of-order, web-scale data set processing. The project focuses on the programming paradigm and interface definition of data processing, and does not involve the implementation of specific execution engines. Apache Beam hopes that data processing programs developed based on Beam can be executed on any distributed computing engine.
2. Apache Eagle
Apache Eagle originated from eBay and was first used to solve the monitoring problem of large-scale Hadoop clusters. It was submitted to Apache on October 26, 2015. Incubated, it was officially announced to graduate as an Apache top-level project on January 10, 2017.
Apache Eagle is an open source monitoring and alerting solution for intelligently identifying security and performance issues on big data platforms in real-time, such as Apache Hadoop, Apache Spark, and more. Apache Eagle mainly includes: high scalability, high scalability, low latency, dynamic collaboration and other features. It supports real-time monitoring of data behavior, can immediately detect access to sensitive data or malicious operations, and take immediate countermeasures.
3. Apache Geode
Apache Geode was originally developed by Gemstone Systems as a commercial product. It was widely used in the financial field in the early days as a transactional, low-latency The data engine is used in Wall Street trading platforms. The code was submitted to the Apache Incubator on April 27, 2015, and graduated as an Apache top-level project on November 21, 2016.
Apache Geode is a data management platform that provides real-time, consistent access to data-critical applications across the entire cloud architecture. It uses dynamic data replication and partitioning technology to achieve high availability, high performance, high scalability, and fault tolerance. In addition, for a distributed data container, Apache Geode is a memory-based data management system that provides reliable asynchronous event notification and reliable message delivery.
4. Apache Twill
Apache Twill submitted the code to the Apache Incubator on November 14, 2013, and announced its graduation on July 27, 2016. Apache top-level project.
Apache Twill provides rich built-in features for common distributed applications for development, deployment, and management, greatly simplifying Hadoop cluster operations and management. It has become a key component behind the Cask Data Application Platform (CDAP), using YARN containers and Java threads as abstractions. CDAP is an open source integration and application platform that enables developers and organizations to easily build, deploy and manage data applications on Hadoop and Spark.
5. Apache Kudu
Apache Kudu is a data storage system developed by Cloudera. It became an Apache incubation project on December 3, 2015 and became an Apache incubation project in July 2016. Graduation was officially announced on May 25th and upgraded to an Apache top-level project.
Apache Kudu is an open source column storage engine built for the Hadoop ecosystem and designed to enable flexible, high-performance analytics pipelines. It supports many operations found in traditional databases, including real-time inserts, updates, and deletes. It is currently used by different companies and organizations in many industries, including retail, online service delivery, risk management, digital advertising, etc., and the more familiar one is Xiaomi.
6, Apache Bahir
The code of Apache Bahir was originally extracted from the Apache Spark project, and later provided as an independent project and released in 2016 It was announced as an Apache top-level project on June 29, 2016.
Apache Bahir expands the coverage of the analysis platform by providing diversified streaming connectors and SQL data sources. Initially it only provides extensions for Apache Spark, and currently also provides Apache Flink, and may also provide extensions for Apache in the future. Expansion services are available on Beam and more platforms.
For more Apache related technical articles, please visit theApache Usage Tutorialcolumn to learn!
The above is the detailed content of What are the top projects of apache. For more information, please follow other related articles on the PHP Chinese website!