Course Outline
Kafka Administration Essentials
- Where Kafka fits in a modern data platform and typical production responsibilities
- Core concepts for operators: brokers, topics, partitions, offsets, consumer groups
- Replication fundamentals: leaders and followers, in-sync replicas, availability trade-offs
- Kafka operational highlights and common terminology used in runbooks
KRaft Mode and Cluster Design
- KRaft basics: controllers, metadata quorum, elections, and why it matters operationally
- Deployment planning: sizing for throughput, partitions, retention, and growth
- Node roles and layouts: combined vs dedicated controllers, fault domain considerations
- Lab: inspect KRaft metadata, validate quorum health, and interpret controller logs
Installation, Configuration, and Day-to-Day Operations
- Installation approaches (packages, tarball, containers) and what to standardize in enterprise environments
- Core broker configuration that impacts reliability: listeners, replication, log directories, retention
- Safe service operations: startup order, graceful shutdown, and validation checks
- Lab: deploy a multi-node cluster, verify broker registration, and confirm baseline produce and consume
Managing Topics, Partitions, and Data Placement
- Topic lifecycle using the Kafka CLI: create, describe, update configs, delete
- Choosing partitions and replication factors for real workloads, including common anti-patterns
- Reassignments and balancing: when to move partitions and how to verify progress safely
- Lab: create topics, trigger a partition reassignment, simulate a broker outage, and confirm recovery
Securing Kafka for Production
- TLS for client and inter-broker traffic: certificates, trust chains, and validation steps
- Authentication with SASL: selecting common mechanisms and avoiding misconfiguration
- Authorization with ACLs: least-privilege patterns for admins, producers, and consumers
- Lab: enable TLS and SASL, validate client connectivity, and apply ACLs for application roles
Observability, Reliability, and Troubleshooting
- Monitoring essentials: controller health, under-replicated partitions, request latency, disk and network saturation
- Logs and metrics: reading broker logs and exposing metrics via JMX exporter to common observability stacks
- Operational playbooks: rolling restarts, safe config changes, handling disk-full and ISR issues
- Lab: build a minimal alert set, diagnose a degraded cluster, and restore healthy replication
Upgrades and Disaster Recovery Readiness
- Upgrade planning for Kafka: compatibility checks, staging, and rollback approach
- Backups and recovery expectations: what can be backed up, what cannot, and configuration recovery basics
- Cross-cluster replication overview and when to use MirrorMaker 2 for DR and migrations
- Wrap-up: operational checklist, handover artifacts, and next steps for production rollout
Requirements
- An understanding of basic Linux administration (users, services, files, permissions)
- Experience with TCP/IP networking concepts (DNS, ports, firewalls, load balancers)
- Basic scripting experience (Bash, PowerShell, or similar) for routine operational tasks
Audience
- Kafka administrators and platform engineers responsible for operating Kafka clusters
- Site reliability engineers and DevOps engineers supporting streaming platforms
- Infrastructure and operations teams deploying new KRaft-based Kafka clusters or migrating from ZooKeeper
Delivery Options
Private Group Training
Our identity is rooted in delivering exactly what our clients need.
- Pre-course call with your trainer
- Customisation of the learning experience to achieve your goals -
- Bespoke outlines
- Practical hands-on exercises containing data / scenarios recognisable to the learners
- Training scheduled on a date of your choice
- Delivered online, onsite/classroom or hybrid by experts sharing real world experience
Private Group Prices RRP from €6840 online delivery, based on a group of 2 delegates, €2160 per additional delegate (excludes any certification / exam costs). We recommend a maximum group size of 12 for most learning events.
Contact us for an exact quote and to hear our latest promotions
Public Training
Please see our public courses
Testimonials (5)
Possibility to perform independent exercises in the training environment.
Tomasz - PKO Zycie Towarzystwo Ubezpieczen S.A.
Course - Kafka for Administrators
To the point, proper pace (bash basics required though)
Krzysztof - Agora SA
Course - Kafka for Administrators
Trainer accepts questions at any time of the session, even if the subject was taught few days past.
GOODLUCK MASHIMBA - Tanzania Revenue Authority
Course - Kafka for Administrators
Nice presentation skill
Md Maruf Hossain - ATOS PGS sp. z o.o.
Course - Kafka for Administrators
Grate skills, examples, very good exercises