Skip to main content

Apache Kafka Interview Questions and Answers



Q1) .Mention what is Apache Kafka?
Ans: Apache Kafka is a publish-subscribe messaging system developed by Apache written in Scala. It is a distributed, partitioned and replicated log service.
Q2). Mention what is the traditional method of message transfer?
Ans: The traditional method of message transfer includes two methods
  •  Queuing:In a queuing, a pool of consumers may read message from the server and each message goes to one of them
    •  Publish-Subscribe:In this model, messages are broadcasted to all consumers
    Kafka caters single consumer abstraction that generalized both of the above- the consumer group.
Q3). Mention what is the benefits of Apache Kafka over the traditional technique?
Ans: Apache Kafka has following benefits above traditional messaging technique
  • Fast:A single Kafka broker can serve thousands of clients by handling megabytes of reads and writes per second
      Scalable: Data are partitioned and streamlined over a cluster of machines to enable larger data
      Durable: Messages are persistent and is replicated within the cluster to prevent data loss
    •  Distributed by Design: It provides fault tolerance guarantees and durability
Q4). Mention what is the meaning of broker in Kafka?
Ans: In Kafka cluster, broker term is used to refer Server.
Q5). Compare Kafka & Flume
Ans:
CriteriaKafkaFlume
Data flowPullPush
Hadoop IntegrationLooseTight
FunctionalityPublish-subscribe model messaging systemSystem for data collection, aggregation & movement
Q6). What role ZooKeeper plays in a cluster of Kafka?
Ans: Kafka is an open source system and also a distributed system is built to use Zookeeper. The basic responsibility of Zookeeper is to build coordination between different nodes in a cluster. Since Zookeeper works as periodically commit offset so that if any node fails, it will be used to recover from previously committed to offset.
The ZooKeeper is also responsible for configuration management, leader detection, detecting if any node leaves or joins the cluster, synchronization, etc.
Q7). What is Kafka?
Ans: Kafka is a message divider project coded in Scala. Kafka is originally developed by LinkedIn and developed as an open sourced in early 2011. The purpose of the project is to achieve the best stand for conducting the real-time statistics nourishment.
Q8).Why do you think the replications are dangerous in Kafka?
Ans: Duplication assures that issued messages which are available are absorbed in the case of any appliance mistake, plan fault or recurrent software promotions.
Q9).What major role a Kafka Producer API plays?
Ans: It is responsible for covering the two producers- kafka.producer.SyncProducer and the kafka.producer.async.AsyncProducer. The main aim is to disclose all the producer performance through a single API to the clients.
Q10). Distinguish between the Kafka and Flume?
Ans: Flume’s major use-case is to gulp down the data into Hadoop. The Flume is incorporated with the Hadoop’s monitoring system, file formats, file system and utilities such as Morphlines. Flume’s design of sinks, sources and channels mean that with the aid of Flume one can shift data among other systems lithely, but the main feature is its Hadoop integration.
The Flume is the best option used when you have non-relational data sources if you have a long file to stream into the Hadoop.Kafka’s major use-case is a distributed publish- subscribe messaging system. Kafka is not developed specifically for Hadoop and using Kafka to read and write data to Hadoop is considerably trickier than it is in Flume.
Kafka can be used when you particularly need a highly reliable and scalable enterprise messaging system to connect many multiple systems like Hadoop.     

Comments

Popular posts from this blog

SAP GRC Interview Questions and Answers

1.What is the use of  SAP GRC? SAP Governance , Risk and Compliance solution enables organization to manage regulations and compliance and remove any risk in managing organizations key operations. As per changing market situation organizations are growing and rapidly changing and inappropriate documents, spreadsheets are not acceptable for external auditors and regulators. 2.What are the different activities that you can perform in SAP GRC? SAP GRC helps organization to manage their regulations and compliance and you can perform following activities Easy integration of GRC activities into existing process and automating key GRC activities. Low complexity and managing risk efficiently. Improve risk management activities. Managing fraud in business processed and audit management effectively. Organizations perform better and companies can protect their values. SAP GRC solution consists of three main areas: Analyze, manage and monitor. 3.What are the different GRC modul

Python Real Time Interview Questions and Answers

Q1).What is Python? Ans1:   Python  is a high-level, interpreted, interactive and object-oriented scripting language. Python is designed to be highly readable. It uses English keywords frequently where as other languages use punctuation, and it h as fewer syntactical constructions than other languages. Q2).Name some of the features of Python. Ans2:  Following are some of the salient features of  python It supports functional and structured programming methods as well as OOP. It can be used as a scripting language or can be compiled to byte-code for building large applications. It provides very high-level dynamic data types and supports dynamic type checking. It supports automatic garbage collection. It can be easily integrated with C, C++, COM, ActiveX, CORBA, and Java. Q3).Do you have any personal projects? Really? Ans3: This shows that you are willing to do more than the bare minimum in terms of keeping your skillset up to date. If you work on personal projects and cod

Git interview Questions and Answers

1) What is GIT? GIT  is a distributed version control system and source code management (SCM) system with an emphasis to handle small and large projects with speed and efficiency. 2) What is a repository in GIT? A repository contains a directory named .git, where git keeps all of its metadata for the repository. The content of the .git directory are private to git. 3) What is the command you can use to write a commit message? The command that is used to write a commit message is “git commit –a”.  The –a on the command line instructs git to commit the new content of all tracked files that have been modified. You can use “git add<file>” before git commit –a if new files need to be committed for the first time. 4)  What is the difference between GIT and SVN? The difference between GIT and SVN is a)      Git is less preferred for handling extremely large files or frequently changing binary files while SVN can handle multiple projects stored in the same repository.