Apache Kafka Connect Training Course
Kafka Connect serves as an API designed to facilitate the movement of extensive data collections between Apache Kafka and various other systems.
This instructor-led, live training (available online or onsite) targets developers aiming to integrate Apache Kafka with existing databases and applications for tasks such as processing and analysis.
Upon completion of this training, participants will be capable of:
- Utilizing Kafka Connect to ingest substantial volumes of data from a database into Kafka topics.
- Ingesting log data generated by application servers into Kafka topics.
- Making collected data accessible for stream processing.
- Exporting data from Kafka topics to secondary systems for storage and analysis.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to arrange.
Course Outline
Introduction
Overview of the Apache Kafka Ecosystem (Zookeeper, Streams, Connect, etc.)
The Components of a Kafka Connect Cluster
Installing and Running Apache Kafka
Configuring the Connectors
Ingesting Database Data into Apache Kafka
Applying Transformations
Ingesting Data from a Web Server Log
Validating the Connection
Managing Connections with the REST API
Ingesting Real-time Data from the Web
Case Study: Reading and Transforming Data from Twitter
Processing and Analyzing Data with Kafka Streams
Deploying Kafka Connect
Writing Your Own Connector
Defining Dynamic Input/Output Streams
Monitoring and Managing Kafka Connect in Production
Troubleshooting
Summary and Conclusion
Requirements
- Experience with Apache Kafka.
- Java programming experience.
Audience
- Developers
Open Training Courses require 5+ participants.
Apache Kafka Connect Training Course - Booking
Apache Kafka Connect Training Course - Enquiry
Apache Kafka Connect - Consultancy Enquiry
Testimonials (2)
Possibility to perform independent exercises in the training environment.
Tomasz - PKO Zycie Towarzystwo Ubezpieczen S.A.
Course - Kafka for Administrators
The trainer tried to make the most complicated topics , explain it in simpler way
Calvin Raj Antony - SICPA SA
Course - Administration of Kafka Message Queue
Upcoming Courses
Related Courses
Administration of Confluent Apache Kafka
21 HoursConfluent Apache Kafka is a distributed event streaming platform engineered for high-throughput, fault-tolerant data pipelines and real-time analytics.
This instructor-led, live training (available online or onsite) is designed for intermediate-level system administrators and DevOps professionals who aim to install, configure, monitor, and troubleshoot Confluent Apache Kafka clusters.
Upon completion of this training, participants will be able to:
- Grasp the components and architecture of Confluent Kafka.
- Deploy and manage Kafka brokers, Zookeeper quorums, and essential services.
- Configure advanced features such as security, replication, and performance tuning.
- Utilize management tools to monitor and maintain Kafka clusters.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to make arrangements.
Confluent Apache Kafka: Cluster Operations and Configuration
16 HoursConfluent Apache Kafka is an enterprise-grade distributed event streaming platform built on Apache Kafka. It supports high-throughput, fault-tolerant data pipelines and real-time streaming applications.
This instructor-led, live training (online or onsite) is aimed at intermediate-level engineers and administrators who wish to deploy, configure, and optimize Confluent Kafka clusters in production environments.
By the end of this training, participants will be able to:
- Install, configure, and operate Confluent Kafka clusters with multiple brokers.
- Design high-availability setups using Zookeeper and replication techniques.
- Tune performance, monitor metrics, and apply recovery strategies.
- Secure, scale, and integrate Kafka with enterprise environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Building Kafka Solutions with Confluent
14 HoursThis instructor-led, live training (available online or onsite) is designed for engineers who want to leverage Confluent (a distribution of Kafka) to build and manage a real-time data processing platform for their applications.
Upon completion of this training, participants will be able to:
- Install and configure the Confluent Platform.
- Utilize Confluent’s management tools and services to simplify Kafka operations.
- Store and process incoming stream data effectively.
- Optimize and manage Kafka clusters.
- Secure data streams.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- This course is based on the open-source version of Confluent: Confluent Open Source.
- To request customized training for this course, please contact us to arrange it.
Building Data Pipelines with Apache Kafka
7 HoursApache Kafka serves as a distributed streaming platform and has become the de facto standard for developing data pipelines. It addresses a wide array of data processing scenarios, functioning effectively as a message queue, a distributed log, a stream processor, and more.
This course begins by exploring the theoretical foundations of data pipelines in general, followed by an in-depth look at the core concepts underlying Kafka. We will also examine essential components such as Kafka Streams and Kafka Connect.
Distributed Messaging with Apache Kafka
14 HoursDesigned for enterprise architects, developers, system administrators, and anyone eager to master high-throughput distributed messaging, this course provides comprehensive insights into Apache Kafka. If your focus is narrower (such as exclusively system administration), the curriculum can be customized to align with your specific objectives.
Kafka for Administrators
21 HoursThis live, instructor-led training in Czech Republic (online or onsite) is aimed at beginner-level, intermediate-level, and advanced-level system administrators and operations engineers who wish to use Apache Kafka to deploy, secure, monitor, and troubleshoot Kafka clusters.
By the end of this training, participants will be able to explain Kafka architecture and KRaft mode, operate and secure Kafka clusters, monitor performance and reliability, and resolve common production issues.
Apache Kafka for Developers
21 HoursThis instructor-led, live training in Czech Republic (online or onsite) is aimed at intermediate-level developers who wish to develop big data applications with Apache Kafka.
By the end of this training, participants will be able to:
- Develop Kafka producers and consumers to send and read data from Kafka.
- Integrate Kafka with external systems using Kafka Connect.
- Write streaming applications with Kafka Streams & ksqlDB.
- Integrate a Kafka client application with Confluent Cloud for cloud-based Kafka deployments.
- Gain practical experience through hands-on exercises and real-world use cases.
Apache Kafka for Python Programmers
7 HoursThis instructor-led live training, offered Czech Republic (online or onsite), is aimed at data engineers, data scientists, and developers who wish to utilize Apache Kafka features for data streaming with Python.
By the end of this training, participants will be able to use Apache Kafka to monitor and manage conditions in continuous data streams using Python programming.
Kafka Fundamentals for Java Developers
14 HoursThis instructor-led, live training in Czech Republic (online or onsite) is aimed at intermediate-level Java developers who wish to integrate Apache Kafka into their applications for reliable, scalable, and high-throughput messaging.
By the end of this training, participants will be able to:
- Understand the architecture and core components of Kafka.
- Set up and configure a Kafka cluster.
- Produce and consume messages using Java.
- Implement Kafka Streams for real-time data processing.
- Ensure fault tolerance and scalability in Kafka applications.
Administration of Kafka Message Queue
14 HoursThis instructor-led, live training in Czech Republic (online or onsite) is designed for system administrators with an intermediate skill set who want to effectively utilize Kafka's message queuing features.
Upon completion of this training, participants will be equipped to:
- Comprehend Kafka's message queuing capabilities and underlying architecture.
- Set up Kafka topics tailored for message queuing scenarios.
- Produce and consume messages via Kafka.
- Monitor and manage Kafka when used as a message queue.
Security for Apache Kafka
7 HoursThis instructor-led, live training in Czech Republic (online or onsite) is aimed at software testers who wish to implement network security measures into an Apache Kafka application.
By the end of this training, participants will be able to:
- Deploy Apache Kafka onto a cloud based server.
- Implement SSL encryption to prevent attacks.
- Add ACL authentication to track and control user access.
- Ensure credible clients have access to Kafka clusters with SSL and SASL authentication.
Apache Kafka and Spring Boot
7 HoursThis instructor-led, live training in Czech Republic (online or onsite) is aimed at intermediate-level developers who wish to learn the fundamentals of Kafka and integrate it with Spring Boot.
By the end of this training, participants will be able to:
- Understand Kafka and its architecture.
- Learn how to install, configure, and set up a basic Kafka environment.
- Integrate Kafka with Spring Boot.
Stream Processing with Kafka Streams
7 HoursKafka Streams is a client-side library designed for building applications and microservices where data flows through a Kafka messaging system. Traditionally, Apache Kafka has depended on Apache Spark or Apache Storm to handle data processing between message producers and consumers. By invoking the Kafka Streams API directly within an application, data can be processed internally within Kafka, eliminating the need to forward data to a separate cluster for processing.
In this instructor-led live training, participants will learn how to integrate Kafka Streams into a series of sample Java applications that exchange data with Apache Kafka for stream processing.
By the end of this training, participants will be able to:
- Understand Kafka Streams features and advantages over other stream processing frameworks
- Process stream data directly within a Kafka cluster
- Write a Java or Scala application or microservice that integrates with Kafka and Kafka Streams
- Write concise code that transforms input Kafka topics into output Kafka topics
- Build, package and deploy the application
Audience
- Developers
Format of the course
- Part lecture, part discussion, exercises and heavy hands-on practice
Notes
- To request a customized training for this course, please contact us to arrange
Administration of Kafka Topic
14 HoursThis guided, live training in Czech Republic (online or in-person) is designed for beginner to intermediate system administrators who wish to learn how to effectively manage Kafka topics for efficient data streaming and processing.
Upon completion of this training, participants will be able to:
- Grasp the fundamentals and architecture of Kafka topics.
- Create, configure, and administer Kafka topics.
- Monitor Kafka topics for health, performance, and availability.
- Apply security protocols to Kafka topics.
SMACK Stack for Data Science
14 HoursThis instructor-led live training in Czech Republic (online or onsite) targets data scientists who wish to leverage the SMACK stack to build data processing platforms for big data solutions.
By the end of this training, participants will be able to:
- Implement a data pipeline architecture for processing big data.
- Develop cluster infrastructure using Apache Mesos and Docker.
- Analyze data with Spark and Scala.
- Manage unstructured data with Apache Cassandra.