Tag: Apache Kafka

Message Queues vs. Streaming Systems: Key Differences and Use Cases
Message Queues vs. Streaming Systems: Key Differences and Use Cases

In the world of data processing and messaging systems, terms like “queue” and “streaming” often come up. While they may sound similar, they serve distinct purposes and can significantly impact how systems handle data. Let’s break down their differences in a straightforward way. What Are Message Queues? Imagine a coffee shop where customers place orders […]

Read More
What is Event Streaming in Apache Kafka?
What is Event Streaming in Apache Kafka?

Event streaming is a powerful data processing paradigm where events—small, immutable pieces of data—are continuously produced, captured, and processed in real time. Apache Kafka, an open-source distributed event streaming platform, has become the go-to solution for implementing event streaming in modern systems. Understanding Events and Streams An event is a record of an occurrence, such as a […]

Read More
Reducing Kafka Lag: Optimizing Kafka Performance
Reducing Kafka Lag: Optimizing Kafka Performance

Apache Kafka is a powerful distributed messaging system, but like any system, it can face performance bottlenecks. One of the most common challenges is Kafka lag—the delay between producing and consuming messages. Addressing Kafka lag is crucial for maintaining real-time data pipelines and ensuring optimal performance. In this article, we explore practical strategies to reduce […]

Read More
KRaft: Apache Kafka Without ZooKeeper
KRaft: Apache Kafka Without ZooKeeper

Apache Kafka has been a cornerstone of modern event streaming architectures, enabling reliable and scalable data pipelines for businesses worldwide. Traditionally, Kafka has relied on ZooKeeper for managing metadata, configurations, and cluster coordination. However, the introduction of KRaft (Kafka Raft) marks a significant shift in Kafka’s architecture, eliminating the need for ZooKeeper and simplifying cluster management. What […]

Read More
Understanding Basics of Apache Kafka
Understanding Basics of Apache Kafka

Apache Kafka is an open-source platform designed for building real-time data pipelines and streaming applications. Initially developed by LinkedIn and later donated to the Apache Software Foundation, Kafka has become a cornerstone for handling large-scale, high-throughput, and low-latency data streams. At its core, Kafka operates as a distributed messaging system. It allows systems to publish […]

Read More
Filebeat: Receiving Docker logs in Kafka
Filebeat: Receiving Docker logs in Kafka

To receive logs from your containers in Kafka topic, we have to do these steps: Install Filebeat echo “deb https://artifacts.elastic.co/packages/8.x/apt stable main” | sudo tee -a /etc/apt/sources.list.d/elastic-8.x.list sudo apt-get update && sudo apt-get install filebeat   2. Edit the configuration file /etc/filebeat/filebeat.yml with these lines  3. Enable Filebeat service sudo systemctl enable filebeat 4. Restart Filebeat.

Read More