Certified Data Engineering Professional
Master the art of data engineering with our expertly designed program that covers everything from SQL proficiency to advanced big data concepts. Gain unparalleled insights into modernizing data infrastructures while learning from industry leaders. Enroll now to transform your career trajectory with cutting-edge skills tailored for today's digital landscape.
- Available in:
- Malaysia

Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Features
Subsidies

What you'll learn
- Understand and implement both relational and NoSQL data models.
- Learn to automate robust data pipelines using Apache Airflow.
- Work with diverse NoSQL databases such as Cassandra, Riak, Redis, Neo4j, and Elasticsearch.
- Develop proficiency in SQL using PostgreSQL for effective database management.
- Gain expertise in business intelligence tools like Pentaho for enhanced decision-making.
- Optimize performance in Spark-based environments for efficient data processing.
- Explore big data fundamentals including HDFS and MapReduce.
Why should you attend?
This course offers a comprehensive exploration of data engineering concepts, focusing on the practical application of SQL and PostgreSQL to build fluency in database management. Participants will learn to create relational data models and understand the principles of normalization, providing a solid foundation for efficient data handling. The curriculum delves into data modeling, contrasting SQL with NoSQL data models, and guides learners through implementing denormalized schemas such as STAR and Snowflake. Additionally, students will gain hands-on experience creating NoSQL databases using Apache Cassandra. Business intelligence and data warehousing are covered extensively, with modules on implementing data warehouses on AWS and building multi-dimensional cubes using Pentaho. The course also introduces SparkSQL, DataFrames, and Datasets, emphasizing their use over traditional RDDs and exploring Spark MLLib for machine learning applications. Learners will explore the power of Spark in managing data lakes, including techniques for debugging and optimization. The course highlights the importance of modernizing data lakes and warehouses to enhance business operations through successful data pipelines. Automation is another key focus area, where participants will create data pipelines with Apache Airflow while ensuring data quality and tracking lineage. The fundamentals of big data are addressed through topics like HDFS, MapReduce in Hadoop, and working with various Hadoop ecosystem components such as Hive and HBase. Finally, the course covers working with Cassandra and other common NoSQL databases like Riak, Redis, Neo4j, and Elasticsearch. It concludes with an introduction to MapReduce architecture, detailing its phases and benefits.
Course Syllabus
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 2
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 3
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 4
Minimum Qualification
Target Audience
Methodologies
Why should you attend?
This course offers a comprehensive exploration of data engineering concepts, focusing on the practical application of SQL and PostgreSQL to build fluency in database management. Participants will learn to create relational data models and understand the principles of normalization, providing a solid foundation for efficient data handling. The curriculum delves into data modeling, contrasting SQL with NoSQL data models, and guides learners through implementing denormalized schemas such as STAR and Snowflake. Additionally, students will gain hands-on experience creating NoSQL databases using Apache Cassandra. Business intelligence and data warehousing are covered extensively, with modules on implementing data warehouses on AWS and building multi-dimensional cubes using Pentaho. The course also introduces SparkSQL, DataFrames, and Datasets, emphasizing their use over traditional RDDs and exploring Spark MLLib for machine learning applications. Learners will explore the power of Spark in managing data lakes, including techniques for debugging and optimization. The course highlights the importance of modernizing data lakes and warehouses to enhance business operations through successful data pipelines. Automation is another key focus area, where participants will create data pipelines with Apache Airflow while ensuring data quality and tracking lineage. The fundamentals of big data are addressed through topics like HDFS, MapReduce in Hadoop, and working with various Hadoop ecosystem components such as Hive and HBase. Finally, the course covers working with Cassandra and other common NoSQL databases like Riak, Redis, Neo4j, and Elasticsearch. It concludes with an introduction to MapReduce architecture, detailing its phases and benefits.
What you'll learn
- Understand and implement both relational and NoSQL data models.
- Learn to automate robust data pipelines using Apache Airflow.
- Work with diverse NoSQL databases such as Cassandra, Riak, Redis, Neo4j, and Elasticsearch.
- Develop proficiency in SQL using PostgreSQL for effective database management.
- Gain expertise in business intelligence tools like Pentaho for enhanced decision-making.
- Optimize performance in Spark-based environments for efficient data processing.
- Explore big data fundamentals including HDFS and MapReduce.
Course Syllabus
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 2
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 3
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 4
Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Features
Subsidies

Minimum Qualification
Target Audience
Methodologies
Our Offers

Become a Trainer
Teach what you love. Abundent Academy gives you the tools you need to run your own trainings! We provide you with the platform, the students, the materials, and the support you need to succeed!
- Higher trainer payouts
- Ready-made course materials
- Student management system
- AI digital marketing assistant

Academy for Business
Get unlimited access to all of Abundent Academy's carefully curated courses for your team, all organized according to job category and role! Perfect for companies looking to upskill their workforce and stay ahead in the tech industry.
- Carefully curated courses
- Role-based learning paths
- Team progress tracking
- Gap Identification and Analysis