Applied Data Engineering Course (online)
Description
COURSE OBJECTIVES
On successful completion of this course, trainees will be able to:
-
Demonstrate a comprehensive understanding of essential data analytics topics, establishing a strong foundation for further exploration and application in the field of data engineering.
-
Apply Data Warehousing Concepts to design and construct efficient data warehouses, including the ability to articulate and implement key architectural principles.
-
Execute data sourcing and cleaning processes effectively, utilizing scripting techniques to ensure the accuracy and reliability of data integrated into the warehouse.
-
Develop practical skills in ETL (Extract, Transform, Load) processes using industry-standard tools such as SSIS and Talend, enabling trainees to design and implement robust data pipelines.
-
Implement advanced testing techniques in ETL processes, ensuring the quality and integrity of data transformations, and demonstrate proficiency in configuration management for maintaining the reliability of the data pipeline.
COURSE SYLLABUS
- Introduction to essential data analytics topics
- Data Warehousing Concepts
- Data Warehousing Architecture
- Data Warehousing Design: Building Blocks
- Data Sourcing
- Data Cleaning
- Script Requirements for Data Sourcing
- Metadata
- Data Integration
- File Validation
- The Staging Layer
- Business Validation Layer
- Data Warehouse Layer
- Developing ETL with SSIS
- Developing ETL with Talend
- Testing Techniques in ETL
- Mapping examples
- Audit, Balance and Control
- Configuration Management
- Implementing ChatGPT in Python
- Reviewing the techniques of Prompt Engineering and Generative AI
- Final Project: Applying Data Engineering techniques to real datasets and building the required ETL pipeline