Kafka Commands

Published on

How to start a zookeeper ? > zkServer.sh start How to start a Kafka Broker ? > kafka-server-start.sh /home/npntraining/opt/kafka_2.11-0.10.2.1/config/server.properties How to create a topic ? > kafka-topics.sh –create –zookeeper localhost:2181 –replication-factor 1 –partitions 1 –topic Hello-Kafka  

Message Retention in Kafka

Published on

The retention period of records in Kafka is configurable. The default retention period is 7 days. The retention period is specific to topic. SO in the cluster each topic can have their own retention period. The retention attribute is available in the server.properties of the apache kafka distribution. The attribute is log.retention.hours=168 Lets say the … Continue reading Message Retention in Kafka

What is a Distributed System

Published on

A distributed system is a model in which components located on networked computers communicate and coordinate their actions by passing messages. Some important points: Distributed systems are the systems which are designed in such a way that it distributes the load within the system and process the load simultaneously. To acheive simultaneuos process the load … Continue reading What is a Distributed System

Apache Spark Use Cases

Published on

Apache Spark is an analytics engine which can process huge data volumes at a speed much faster than MapReduce, because the data is persisted on Spark’s own processing framework. That is why it has been catching the attention of both professionals and the press. It was first developed at AMPLab of the University of California, … Continue reading Apache Spark Use Cases