Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Enterprise Architecture and Pipeline Design
- Multi-layer ETL architectures.
- Designing modular and reusable components.
- Hybrid approaches across systems.
Advanced Performance Engineering
- Step-level optimization.
- Parallelism and threading strategies.
- Monitoring high-load pipelines.
Automation, Scripting, and Custom Extensions
- Scripting inside transformations.
- Developing custom plugins.
- Extending PDI capabilities with Java and JavaScript.
Complex Data Processing and Integrations
- Real-time and streaming integrations.
- Working with big data platforms.
- Advanced file and API processing.
Data Governance, Security, and Compliance
- Securing transformations and credentials.
- Data lineage and traceability.
- Regulatory and compliance considerations.
Enterprise Orchestration and Scheduling
- Managing large job networks.
- Error recovery and failover design.
- Environment-level orchestration.
Repository, Version Control, and CI/CD
- Enterprise repository strategies.
- Integrating PDI with Git.
- Continuous deployment patterns.
Deployment, Monitoring, and Production Operations
- Promoting solutions across environments.
- Operational tooling and dashboards.
- End-to-end production readiness.
Summary and Next Steps
Requirements
- Knowledge of ETL pipelines and data modeling concepts.
- Practical experience with intermediate-level PDI transformations.
- Strong proficiency in SQL and scripting.
Audience
- Senior data engineers.
- ETL architects.
- Professionals responsible for managing complex data integration workloads.
21 Hours
Testimonials (2)
Very useful in because it helps me understand what we can do with the data in our context. It will also help me
Nicolas NEMORIN - Adecco Groupe France
Course - KNIME Analytics Platform for BI
It's a hands-on session.