DEA642

Certified Data Engineering Professional

Q: What will I learn in this Certified Data Engineering Professional course?

This comprehensive 4-day course covers essential data engineering concepts including SQL and PostgreSQL proficiency, relational and NoSQL data modeling, data warehousing on AWS, Apache Spark and SparkSQL, data pipeline automation with Apache Airflow, big data fundamentals with Hadoop ecosystem (HDFS, MapReduce, Hive, HBase), and working with various NoSQL databases like Cassandra, Redis, Neo4j, and Elasticsearch. You'll also gain hands-on experience with business intelligence tools like Pentaho and learn data lake management techniques.

Q: What are the prerequisites for this course?

This is a beginner-level course designed for entry-level professionals and engineers with a graduate qualification. While no extensive prior experience in data engineering is required, basic familiarity with databases and programming concepts would be helpful. The course is structured to build foundational knowledge from the ground up, making it accessible to those new to data engineering.

Q: Will I receive a certification upon completion?

Yes, upon successful completion of this course, you will receive the 'Certified Data Engineering Professional' certification. This certification validates your knowledge and skills in data engineering concepts, tools, and best practices covered throughout the program.

Q: What teaching methods are used in this course?

The course employs a variety of interactive teaching methodologies including lectures, slide presentations, hands-on labs, case studies, group discussions, and Q&A sessions. This multi-faceted approach ensures both theoretical understanding and practical application of data engineering concepts, providing a well-rounded learning experience.

Q: How does this course cover both SQL and NoSQL databases?

The course provides comprehensive coverage of both database paradigms. You'll develop SQL proficiency using PostgreSQL for relational database management, learn normalization principles, and create relational data models. For NoSQL, you'll work with Apache Cassandra, implement denormalized schemas like STAR and Snowflake, and gain hands-on experience with various NoSQL databases including Riak, Redis, Neo4j, and Elasticsearch.

Q: What big data technologies will I learn to work with?

You'll gain expertise in key big data technologies including Apache Spark (SparkSQL, DataFrames, Datasets, and MLLib), Apache Airflow for data pipeline automation, Hadoop ecosystem components (HDFS, MapReduce, Hive, HBase), and cloud-based solutions like AWS data warehousing. The course also covers data lake management, optimization techniques, and modern data infrastructure concepts.

Q: How will this course help advance my career in data engineering?

This course provides cutting-edge skills tailored for today's digital landscape, covering both foundational and advanced data engineering concepts. You'll gain practical experience with industry-standard tools and technologies, learn to build robust data pipelines, and understand modern data infrastructure. The certification and comprehensive skill set will position you for roles in data engineering, business intelligence, and big data analytics across various industries.

Q: Where is the training conducted?

Our online trainings are conducted usually using Zoom or Google Meet. Our physical trainings are normally held in Common Ground, Bangsar South .

Q: Can international participants join this course?

Please check the Available in section to see which countries the course is available in. If your country is not listed, please contact us.

Q: What is the difference between public, corporate, and training provider pricing?

Public pricing : applies for individuals signing up from different companies. Corporate pricing : applies if a company wants to have an intake for its employees only. Training provider pricing : applies only for other training providers looking to hire our trainers and use our content. Our content has a licensing fee.

Master the art of data engineering with our expertly designed Malaysia data engineering course that covers everything from SQL proficiency to advanced big data concepts. Gain unparalleled insights into modernizing data infrastructures while learning from industry leaders. Enroll now to transform your career trajectory with cutting-edge skills tailored for today's digital landscape.

Available in:
Malaysia
Indonesia

Face-to-Face Oct 27-30, 2025 9:00 AM - 5:00 PM Mohammad Mehdi Lotfinejad

Last Modified: October 15th 2025, 6:00 AM MYT

beginner

We price match

Public Pricing

MYR 7000

Corporate Pricing

Pax:

Training Fees: MYR 6500/day

Total Fees: MYR 26000 ++

Training Provider Pricing

Pax:

Training Fees: MYR 9600

Material Fees: MYR 400

Total Fees: MYR 10000

Certification

Certified Data Engineering Professional

CCSD

Validity: 2 years

Price: $149.00

Features

4 days

28 modules

6 intakes

English

Subsidies

What you'll learn

Understand and implement both relational and NoSQL data models.
Learn to automate robust data pipelines using Apache Airflow.
Work with diverse NoSQL databases such as Cassandra, Riak, Redis, Neo4j, and Elasticsearch.
Develop proficiency in SQL using PostgreSQL for effective database management.
Gain expertise in business intelligence tools like Pentaho for enhanced decision-making.
Optimize performance in Spark-based environments for efficient data processing.
Explore big data fundamentals including HDFS and MapReduce.

Why should you attend?

This data engineering course offers a comprehensive exploration of data engineering concepts, focusing on the practical application of SQL and PostgreSQL to build fluency in database management. Participants will learn to create relational data models and understand the principles of normalization, providing a solid foundation for efficient data handling. The data engineering course delves into data modeling, contrasting SQL with NoSQL data models, and guides learners through implementing denormalized schemas such as STAR and Snowflake. Additionally, students of this data engineer class will gain hands-on experience creating NoSQL databases using Apache Cassandra. Business intelligence and data warehousing are covered extensively in this data engineer class with modules on implementing data warehouses on AWS and building multi-dimensional cubes using Pentaho. This data engineer course in Malaysia also introduces SparkSQL, DataFrames, and Datasets, emphasizing their use over traditional RDDs and exploring Spark MLLib for machine learning applications. Learners of Apache Spark training will explore the power of Spark in managing data lakes, including techniques for debugging and optimization in this course for data engineers. This course for data engineers highlights the importance of modernizing data lakes and warehouses to enhance business operations through successful data pipelines. Automation is another key focus area, where participants of one of the best courses for data engineering in Malaysia will create data pipelines with Apache Airflow while ensuring data quality and tracking lineage. The fundamentals of big data are addressed through topics like HDFS, MapReduce in Hadoop, and working with various Hadoop ecosystem components such as Hive and HBase in our data engineer training in Malaysia. Finally, the Airflow ETL pipelines section of this course covers working with Cassandra and other common NoSQL databases like Riak, Redis, Neo4j, and Elasticsearch. It concludes with an introduction to MapReduce architecture, detailing its phases and benefits.

Course Syllabus

Day 1 - SQL and Data Modeling

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 1

Day 2 - Spark and Data Pipelines

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 2

Day 3 - Big Data and Hadoop

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 3

Day 4 - NoSQL and MapReduce

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 4

Ratings and Reviews

( 3 ratings )

100%

( 3 reviews )

Instructor

Tarun Sukhani Founder & CTO Teaching

Tarun Sukhani is a distinguished professional trainer and consultant with over 25 years of comprehensive industry experience spanning multinational corporations across the US, Europe, Asia, South America, and the Middle East. His extensive background encompasses senior executive roles including CIO/CTO, director, and board member positions at renowned organizations such as Dell, AMD, and Experian, as well as regional conglomerates like Indra in Asia Pacific. This diverse corporate experience provides him with unique insights into enterprise-level challenges and solutions across multiple business functions including HR, Finance, Operations, Sales, Risk Management, Engineering, and Accounting. As a highly sought-after trainer, Tarun specializes in an impressive array of cutting-edge technologies and methodologies. His expertise spans Agile/Scrum/SAFe frameworks, enterprise architecture (TOGAF/COBIT/ITIL), cybersecurity (CISSP/CEH/CISO), project management (PRINCE2/PMP), Big Data technologies (Hadoop/Spark), Data Science with Python and R, DevOps practices, Machine Learning/AI, cloud computing, blockchain technologies, and modern development frameworks. This comprehensive skill set enables him to deliver training across the entire technology spectrum, from foundational concepts to advanced implementations. His training delivery extends throughout the Asia Pacific region, including Malaysia, Indonesia, Philippines, Thailand, and Singapore, where he has successfully conducted workshops and training programs for both large enterprises and SMEs. Tarun's client portfolio includes industry leaders such as Dell, AMD, Western Digital, Singtel, CIMB, Digi, Tenaga Nasional, and Sime Darby, demonstrating his ability to work with diverse organizational cultures and technical requirements. Academically, Tarun holds exceptional credentials including an MSc in Information Systems and MBA in Finance and Operations Management from Loyola University Chicago, where he graduated summa cum laude with Beta Gamma Sigma and Alpha Sigma Nu honors. His educational foundation is further strengthened by Bachelor's degrees in Biology, Math, Computer Science, and Business Administration, plus advanced programs from MIT and Stanford in AI, Blockchain, and Entrepreneurship. His extensive certifications as an Agile/Scrum trainer, Java/.NET developer, Machine Learning specialist, and InfoSec expert validate his technical proficiency and commitment to continuous learning, making him an ideal trainer for organizations seeking comprehensive technology education and transformation guidance.'

21 Students

245 Courses

English, Malay, Spanish

25 Years

( 13 ratings )

100%

( 13 reviews )

Instructor

Mohammad Mehdi Lotfinejad Certified Data Science Trainer and Data Engineer Teaching

Mohammad Mehdi Lotfinejad is an accomplished Chief Data Officer and certified HRDF trainer with over 15 years of experience in computer science instruction and professional data science/engineering training. He combines extensive academic credentials with deep industry expertise, holding a PhD in Computer Science from Universiti Malaya and Harvard Business School certification in Business Analytics. His comprehensive technical background spans Apache Spark, MySQL, PostgreSQL, MongoDB, Snowflake, Redshift, Apache Airflow, API development, microservices, and Amazon Web Services. Currently serving as Chief Data and Knowledge Officer at Magna.ai, a Florida-based lawtech company, Lotfinejad leads the development of AI-driven legal case analysis systems, architecting graph databases, data warehouses, and workflow engines while ensuring compliance with legal standards. His concurrent role as Senior Data Engineer at AXIATA Digital Advertising (ADA) in Malaysia demonstrates his ability to manage complex, multi-regional data operations across Southeast Asian markets, designing automated pipelines using AWS RedShift, Snowflake, and Google BigQuery. His training expertise was honed during his tenure as Lead Senior Data Scientist Professional Trainer at The Center of Applied Data Science, where he designed and delivered comprehensive training programs for major corporations including CIMB, PETRONAS, SHELL, and TNB. He successfully led teams of data scientists and engineers in developing cutting-edge curriculum and migrating legacy systems to modern data management solutions. His academic foundation includes faculty positions at multiple universities where he taught computer architecture, programming languages, software engineering, and data structures while publishing numerous high-impact research papers and books. Lotfinejad's unique combination of technical leadership, educational expertise, and industry experience makes him exceptionally qualified to deliver sophisticated software training programs. His proven track record of leading cross-functional teams, developing enterprise-level solutions, and translating complex technical concepts into accessible learning materials positions him as an ideal trainer for organizations seeking to advance their technical capabilities in data science, engineering, and modern software development practices.'

8 Students

77 Courses

18 Years

( 3 ratings )

100%

( 3 reviews )

Instructor

Saima Nasar IBM Certified Trainer Teaching

Saima Nisar is a seasoned Data Science and Digital Health Informatics Trainer, holding a prestigious PhD in Information Technology. With an IBM certification as a Data Science Professional, Saima brings a wealth of knowledge, particularly in Python, SmartPLS, and Machine Learning. She is fervently dedicated to empowering learners and enhancing data practices within the healthcare sector. Her extensive experience includes serving as a Researcher at Universiti Utara Malaysia since 2014, where she contributes significantly to the academic community. As a Graduate Research Assistant at Universiti Teknologi Petronas from 2011 to 2013, Saima honed her skills in creating engaging course materials and managing complex datasets. Her practical approach is further evidenced by her successful projects like developing machine learning pipelines with Azure ML Studio and diagnosing Parkinson's disease using advanced hybrid models. Saima has proven her commitment to professional development and sharing knowledge through mentorship roles in competitions like the Malaysian Public Policy Competition and active participation in organizations such as Girls in Tech Kuala Lumpur. Her expertise is recognized globally as an Articles Reviewer for the Web of Science. Additionally, Saima maintains an active presence in the tech community by contributing to volunteer work with ICANN since November 2022.

9 Courses

( 0 ratings )

( 0 reviews )

Minimum Qualification

graduate

Target Audience

entry level

engineers

Methodologies

lecture

slides

case studies

labs

group discussion

q&A

Course Reviews

Lisa K.

3 years ago

I definitely recommend this course to the folk which are in process of aiming for Data related roles. All the concepts are clearly explained and shown practically.

James Yap

3 years ago

This course is a good match to me. This course gave good understading of all concepts and also multiple ways of setting up the environment while practicing.

Greg H.

3 years ago

its the best course if you want to breakout in field of big data

Instructor Reviews

Tarun Sukhani Founder & CTO

Michael Wong Shen Kai

3 years ago

He was indeed very skilled, knowledgeable and passionate in the data science realm. I was impressed with his business know-how (how the world economy works and how all things can be explain with data, with/without bias) and technical skills in converting data into insights. I will not hesitate to recommend Tarun for any data science related training as I would like to attend more classes myself to learn from the best of the best.

Anak Agung

3 years ago

I attended one of Tarun's Data Science course in Jakarta (CDSS). He was a professional trainer & very knowledgeable in Data Science. In his course, Tarun gave many practical examples & valuable information regarding how to conduct Data Science & it's related components (e.g. Software & Deployment Architecture). In addition to those lessons, he also gave very useful insights on building a career as a Data Scientist.

Pei Cher Chai

3 years ago

Attended "Blockchain Training: An Overview for Business Professionals" conducted by Dr. Tarun. The reference materials are very comprehensive and an excellent means of conveying information. I was very impressed with how this technology works and adapted into business

LJ Ong

3 years ago

He shared his professional insights on data science with a sense of humor that cleared up so many of my questions about the content and real-world applications. Information, tools, and resources given are very useful

Aamer S

3 years ago

His knowledge of multiple subjects exceeds far greater than that of any IT or non-IT person I have met or interacted with in a long time. The breadth and depth of the subject matter he has acquired is exemplary.

Jovyn Kim

3 years ago

Training with Tarun has been awesome. He’s super knowledgable, funny, empathetic and a great educator in general. As someone who didn’t come from a computer science background, his teachings didn’t make me feel stupid or impossible to eventually arrive at being a competent developer. I could understand him as he communicates well & has helped me see the big picture of the computer science field beyond the scope of syntaxes. If you similarly did not come from a CS background and hope to transition into the world of programming but struggle to learn on your own, understand all the foreign & abstract concepts and connect the dots, I think the right person to guide you on your journey would make a big difference. Having someone who’s deep in the field with many years of experience narrow and communicate the relevant areas to focus would also close a big gap from having to struggle and figure out a lot of things on your own. Being able to maintain your interest during your learning journey is important too, thus finding that someone is important. All in all, I would wholeheartedly recommend Tarun and the backend course I took.

Srikanth K

3 years ago

Tarun is a results-driven & inspirational technology leader with a clear vision, direction, and broad-based technology expertise. He is passionate, intuitive, engaged, pragmatic, systematic, agile. His experiences span from small start-ups to complex, global companies, from being technical lead to technical strategist to being the leader of larger group of architecture and engineering teams. Much of his experiences are in the area of Java, Scala, Machine Learning, Neural Networks, Cloud Computing, Data Science and what not. I am truly amazed to experience his breadth & depth of technological expertise and pleasure to be part of his team.

Zulfikri Y

3 years ago

Tarun is very passionate on the domains and gave numerous insights to support critical business decisions and develop data products to transform daily encounters and processes. He was a professional trainer & very knowledgeable in Data Science. His material is presented through a sequence of brief lectures, interactive demonstrations, great hands-on exercises, and discussions.

Marti Sigi

5 years ago

We’ve been collaborated many times in doing courses for the accountants. He spoke to quiet number of event in our company with various topic regards to accountants need. The collaboration was very smooth and his session definitely made a huge impact on our success. Mr Tarun is a great Professional!

Pravena K

3 years ago

Mr. Tarun is a driven, hardworking, and knowledgeable entrepreneur in his field." A broad-minded trainer who embraces change and inspires people to do better every day. Mr. Tarun sets a good example by being enthusiastic and dedicated, and he inspires and motivates others. I am delighted to be working for such personnel

Mohammad Mehdi Lotfinejad Certified Data Science Trainer and Data Engineer

Michael Ogheneme

1 year ago

Mehdi and I worked on several projects with company such as Petronas , Shell and CIMB Regional ETC. I must say Mehdi's training was highly appreciated by our clients as he was able to exhibit in full display his vast knowledge as a Data professional. I would highly recommend him to anyone looking for a top tier training expert.

Amin Jula

1 year ago

Not only knowledgeable but also having hands dirty on what he knows Friendly and building networks quickly.

Kennedy Okonkwo

1 year ago

I had the pleasure of working with Mehdi together on some high-level initiatives such as the Petronas data scientist program and Shell's project to become a data-driven organization. During these projects, Mehdi received numerous accolades for his ability to share his knowledge and mentor up-and-coming data scientists. Based on our shared experiences, I have no hesitation in recommending Mehdi for any project or position he may be considered for.

FAQs

What will I learn in this Certified Data Engineering Professional course?

What are the prerequisites for this course?

Will I receive a certification upon completion?

What teaching methods are used in this course?

How does this course cover both SQL and NoSQL databases?

What big data technologies will I learn to work with?

How will this course help advance my career in data engineering?

Where is the training conducted?

Can international participants join this course?

What is the difference between public, corporate, and training provider pricing?

Do I have to pay right away?

I just enrolled in an intake. Does that mean I can attend the course on that date?

What happens if quorum for an intake is not reached?

Why should you attend?

What you'll learn

Understand and implement both relational and NoSQL data models.
Learn to automate robust data pipelines using Apache Airflow.
Work with diverse NoSQL databases such as Cassandra, Riak, Redis, Neo4j, and Elasticsearch.
Develop proficiency in SQL using PostgreSQL for effective database management.
Gain expertise in business intelligence tools like Pentaho for enhanced decision-making.
Optimize performance in Spark-based environments for efficient data processing.
Explore big data fundamentals including HDFS and MapReduce.

Course Syllabus

Day 1 - SQL and Data Modeling

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 1

Day 2 - Spark and Data Pipelines

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 2

Day 3 - Big Data and Hadoop

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 3

Day 4 - NoSQL and MapReduce

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

Lunch

1 hour

Short Break

15 mins

Short Break

15 mins

Short Break

15 mins

Recap and Q&A

15 mins

End of Day 4

Course Reviews

Lisa K.

3 years ago

I definitely recommend this course to the folk which are in process of aiming for Data related roles. All the concepts are clearly explained and shown practically.

James Yap

3 years ago

This course is a good match to me. This course gave good understading of all concepts and also multiple ways of setting up the environment while practicing.

Greg H.

3 years ago

its the best course if you want to breakout in field of big data

Instructor Reviews

Tarun Sukhani Founder & CTO

Michael Wong Shen Kai

3 years ago

Anak Agung

3 years ago

Pei Cher Chai

3 years ago

LJ Ong

3 years ago

Aamer S

3 years ago

Jovyn Kim

3 years ago

Srikanth K

3 years ago

Zulfikri Y

3 years ago

Marti Sigi

5 years ago

Pravena K

3 years ago

Mohammad Mehdi Lotfinejad Certified Data Science Trainer and Data Engineer

Michael Ogheneme

1 year ago

Amin Jula

1 year ago

Not only knowledgeable but also having hands dirty on what he knows Friendly and building networks quickly.

Kennedy Okonkwo

1 year ago

We price match

Public Pricing

MYR 7000

Corporate Pricing

Pax:

Training Fees: MYR 6500/day

Total Fees: MYR 26000 ++

Training Provider Pricing

Pax:

Training Fees: MYR 9600

Material Fees: MYR 400

Total Fees: MYR 10000

Certification

Certified Data Engineering Professional

CCSD

Validity: 2 years

Price: $149.00

Features

4 days

28 modules

6 intakes

English

Subsidies

Ratings and Reviews

( 3 ratings )

100%

( 3 reviews )

Instructors

Tarun Sukhani Founder & CTO Teaching

21 Students

245 Courses

English, Malay, Spanish

25 Years

( 13 ratings )

100%

( 13 reviews )

Mohammad Mehdi Lotfinejad Certified Data Science Trainer and Data Engineer Teaching

8 Students

77 Courses

18 Years

( 3 ratings )

100%

( 3 reviews )

Saima Nasar IBM Certified Trainer Teaching

9 Courses

( 0 ratings )

( 0 reviews )

Minimum Qualification

graduate

Target Audience

entry level

engineers

Methodologies

lecture

slides

case studies

labs

group discussion

q&A

FAQs

What will I learn in this Certified Data Engineering Professional course?

What are the prerequisites for this course?

Will I receive a certification upon completion?

What teaching methods are used in this course?

How does this course cover both SQL and NoSQL databases?

What big data technologies will I learn to work with?

How will this course help advance my career in data engineering?

Where is the training conducted?

Can international participants join this course?

What is the difference between public, corporate, and training provider pricing?

Do I have to pay right away?

I just enrolled in an intake. Does that mean I can attend the course on that date?

What happens if quorum for an intake is not reached?

Certified Data Engineering Professional

Share this course

Public Pricing

Corporate Pricing

Training Provider Pricing

Certification

Certified Data Engineering Professional

CCSD

Features

Subsidies

What you'll learn

Why should you attend?

Course Syllabus

Day 1 - SQL and Data Modeling

SQL and PostgreSQL

Short Break

SQL and PostgreSQL

Short Break

Data Modelling

Recap and Q&A

Lunch

Data Modelling

Short Break

Business Intelligence and Data warehousing using Pentaho

Short Break

Business Intelligence and Data warehousing using Pentaho

Short Break

SparkSQL, DataFrames, and Datasets

Recap and Q&A

End of Day 1

Day 2 - Spark and Data Pipelines

SparkSQL, DataFrames, and Datasets

Short Break

Data Lakes with Spark

Short Break

Data Lakes with Spark

Recap and Q&A

Lunch

Modernizing Data Lakes and Data Warehouses

Short Break

Automate Data Pipelines

Short Break

Automate Data Pipelines

Short Break

Big Data Fundamental and Concepts

Recap and Q&A

End of Day 2

Day 3 - Big Data and Hadoop

Big Data Fundamental and Concepts

Short Break

Big Data Fundamental and Concepts

Short Break

Big Data Fundamental and Concepts

Recap and Q&A

Lunch

Working with Hadoop Ecosystem Components and Databases

Short Break

Working with Hadoop Ecosystem Components and Databases

Short Break

Working with Cassandra and other common NoSQL Databases

Short Break

Working with Cassandra and other common NoSQL Databases

Recap and Q&A

End of Day 3

Day 4 - NoSQL and MapReduce

Working with Cassandra and other common NoSQL Databases

Short Break

Working with Cassandra and other common NoSQL Databases

Short Break

Introduction to MapReduce

Recap and Q&A

Lunch

Introduction to MapReduce

Short Break

MapReduce Architecture

Short Break

MapReduce Architecture

Short Break

MapReduce Architecture

Recap and Q&A