Apache Kudu
{{Short description|Open-source column-oriented data store}}
{{Infobox software
| name = Apache Kudu
| logo = Apache Kudu logo.svg
| screenshot =
| caption = Apache Kudu
| other_names = Kudu
| developer = {{URL|https://kudu.apache.org/committers.html|Apache Kudu Committers and PMC Members}}
| latest release version = 1.16.0
| latest release date = {{Start date and age|2022|04|11|df=yes}}{{cite web|title=Apache Kudu - Releases|url=https://kudu.apache.org/releases/|access-date=27 September 2022|quote=Kudu 1.16.0 was released on Apr 11, 2022.}}
| latest preview version =
| latest preview date =
| operating system = Linux, macOS
| repo = {{URL|https://gitbox.apache.org/repos/asf?p{{=}}kudu.git|Kudu Repository}}
| programming language = C++
| genre = Database management system, Distributed data store
| license = Apache License 2.0{{Cite web|url=https://kudu.apache.org/faq.html#project-status|title=Project Status|date=2017-05-21|language=en-US|access-date=2017-05-21|archive-url=https://web.archive.org/web/20170521110002/https://kudu.apache.org/faq.html|archive-date=2017-05-21|url-status=live|quote=Is Kudu open source? Yes, Kudu is open source and licensed under the Apache Software License, version 2.0. Apache Kudu is a top level project (TLP) under the umbrella of the Apache Software Foundation.}}
}}
Apache Kudu is a free and open source column-oriented data store of the Apache Hadoop ecosystem. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data.{{cite web |url=https://kudu.apache.org/ |title=Home |website=kudu.apache.org}}
The open source project to build Apache Kudu began as internal project at Cloudera.{{Cite web|url=https://kudu.apache.org/faq.html#project-status|title=Why was Kudu developed internally at Cloudera before its release?|date=2017-05-21|language=en-US|access-date=2017-05-21}} The first version Apache Kudu 1.0 was released 19 September 2016.{{Cite web|url=https://kudu.apache.org/releases/|title=Apache Kudu releases|date=2017-05-21|language=en-US|access-date=2017-05-21|archive-url=https://web.archive.org/web/20170521105214/https://kudu.apache.org/releases/|archive-date=2017-05-21|url-status=live|quote=Kudu 1.0.0 was released on September 19, 2016. It is the first release not considered "beta". [...] Kudu 0.5.0 (beta) was released on Sep 28, 2015. It was the first public version of Kudu.}}
Comparison with other storage engines
Kudu was designed and optimized for OLAP workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation.{{Cite web|url=https://kudu.apache.org/faq.html#project-motivation|title=Why build a new storage engine? Why not just improve Apache HBase to increase its scan speed?|date=2017-05-21|language=en-US|access-date=2017-05-21|archive-url=https://web.archive.org/web/20170521110002/https://kudu.apache.org/faq.html|archive-date=2017-05-21|url-status=live}} Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/Bigtable".
See also
{{Portal|Free and open-source software}}
References
{{Reflist|30em}}
External links
- {{Official website|//kudu.apache.org/}}
- [https://github.com/apache/kudu Apache Kudu GitHub repository]
{{Apache Software Foundation}}
{{DEFAULTSORT:Kudu}}
Category:Software using the Apache license
Category:Free software programmed in C++
Category:Free database management systems
Category:Online analytical processing
Category:Data warehousing products
Category:Data analysis software