Apache Kafka

Apache Kafka[1]
Original author(s)LinkedIn
Developer(s)Apache Software Foundation
Initial releaseJanuary 2011; 13 years ago (2011-01)[2]
Stable release
3.8.0[3] Edit this on Wikidata / 26 July 2024
Repository
Written inScala, Java
Operating systemCross-platform
TypeStream processing, Message broker
LicenseApache License 2.0
Websitekafka.apache.org Edit this at Wikidata

Apache Kafka is a distributed event store and stream-processing platform. It is an open-source system developed by the Apache Software Foundation written in Java and Scala. The project aims to provide a unified, high-throughput, low-latency platform for handling real-time data feeds. Kafka can connect to external systems (for data import/export) via Kafka Connect, and provides the Kafka Streams libraries for stream processing applications. Kafka uses a binary TCP-based protocol that is optimized for efficiency and relies on a "message set" abstraction that naturally groups messages together to reduce the overhead of the network roundtrip. This "leads to larger network packets, larger sequential disk operations, contiguous memory blocks [...] which allows Kafka to turn a bursty stream of random message writes into linear writes."[4]

  1. ^ "Apache Kafka at GitHub". github.com. Archived from the original on 16 January 2023. Retrieved 5 March 2018.
  2. ^ "Open-sourcing Kafka, LinkedIn's distributed message queue". Archived from the original on 26 December 2022. Retrieved 27 October 2016.
  3. ^ "Release 3.8.0". 26 July 2024. Retrieved 20 August 2024.
  4. ^ "Efficiency". kafka.apache.org. Retrieved 2019-09-19.