Data Engineering - SQL, Spark & Pipeline Automation
Elevate your career with our cutting-edge Data Engineering course. Embark on an immersive learning experience that seamlessly integrates core concepts like SQL, BI tools, Spark processing, NoSQL databases, pipeline automation, Big Data fundamentals, and MapReduce techniques. Enhance your skill set through practical implementation in a dynamic environment.
Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Certification
Certified Beginner Data Engineer
Veritas
Features
Target Audience
Methodologies
Subsidies

What you'll learn
- Develop expertise in MapReduce programming model for large-scale distributed computing.
- Automate complex Data Pipelines utilizing Apache Airflow for efficient data management.
- Comprehend the principles of Big Data including handling various types of datasets.
- Gain proficiency in SQL using PostgreSQL and understand relational database design.
- Understand NoSQL databases creation with Apache Cassandra and contrast it with SQL models.
- Learn Business Intelligence concepts and implement a Data Warehouse using Pentaho on AWS.
- Acquire hands-on experience in Hive Query Language for interacting with big data stores.
- Master big data processing using SparkSQL, DataFrames, Datasets including MLlib & GraphX.
Why should you attend?
Dive into the world of data engineering with our meticulously crafted course, designed to equip you with the essential skills required in this ever-evolving field. Begin your journey by mastering SQL and PostgreSQL, where you'll gain fluency in SQL commands and understand the intricacies of creating relational data models and normalization processes. Progress to Business Intelligence (BI) and Data Warehousing using Pentaho, learning the fundamentals of data warehousing, integration, and how to implement these concepts on AWS, including building multi-dimensional cubes. Advance further with SparkSQL, DataFrames, and Datasets to handle big data processing with ease. You'll explore the capabilities of SparkSQL, learn to manipulate data using DataFrames and RDDs, and delve into Spark's MLLib for machine learning applications. Gain insights into Data Lakes and acquire skills in data wrangling for more efficient data processing. The course also covers NoSQL database creation with Apache Cassandra, providing a solid foundation in Data Modelling. You will compare SQL vs NoSQL data models and learn about denormalized schemas like STAR and Snowflake. Automating Data Pipelines is another critical area you'll master, using tools such as Apache Airflow to create robust pipelines that ensure data quality and track lineage. Finally, grasp Big Data Fundamentals by understanding the 4 V's—veracity, variability, visualization, and value—and working with different types of data. Explore Hive and HBase along with Hive Query Language before concluding with MapReduce where you'll learn about partitioning mappers and reducers for efficient big data processing.
Course Syllabus
Day 1 - Data Engineering Foundations
Short Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Ratings and Reviews
Instructor
Instructor
Course Reviews
"Overall, I am now confident in my knowledge of Data Engineering. The only criticism is that there is a lot of material to take in, so taking running notes is a good idea (helps revise for assignments as well)"
"This course was well-designed and educational. I came in knowing very little about data engineering and now have a lot better idea of what it is all about."
"A good lesson for beginners to learn the fundamentals of data engineering. People should consider becoming a data engineer, scientist, or analyst."
"Excellent preparation for becoming a Data Engineer. It explains what Data Engineers do and what they should know."
Instructor Reviews
Mohammad Mehdi Lotfinejad
Chief Data Officer & Data Science Trainer"Mehdi and I worked on several projects with company such as Petronas , Shell and CIMB Regional ETC. I must say Mehdi's training was highly appreciated by our clients as he was able to exhibit in full display his vast knowledge as a Data professional. I would highly recommend him to anyone looking for a top tier training expert."
"Not only knowledgeable but also having hands dirty on what he knows Friendly and building networks quickly."
"I had the pleasure of working with Mehdi together on some high-level initiatives such as the Petronas data scientist program and Shell's project to become a data-driven organization. During these projects, Mehdi received numerous accolades for his ability to share his knowledge and mentor up-and-coming data scientists. Based on our shared experiences, I have no hesitation in recommending Mehdi for any project or position he may be considered for."
FAQ
Frequently Asked Questions About This Course
- Public pricing: applies for individuals signing up from different companies.
- Corporate pricing: applies if a company wants to have an intake for its employees only.
- Training provider pricing: applies only for other training providers looking to hire our trainers and use our content. Our content has a licensing fee.
We will keep you updated on the status of the intake after you enroll.
Why should you attend?
Dive into the world of data engineering with our meticulously crafted course, designed to equip you with the essential skills required in this ever-evolving field. Begin your journey by mastering SQL and PostgreSQL, where you'll gain fluency in SQL commands and understand the intricacies of creating relational data models and normalization processes. Progress to Business Intelligence (BI) and Data Warehousing using Pentaho, learning the fundamentals of data warehousing, integration, and how to implement these concepts on AWS, including building multi-dimensional cubes. Advance further with SparkSQL, DataFrames, and Datasets to handle big data processing with ease. You'll explore the capabilities of SparkSQL, learn to manipulate data using DataFrames and RDDs, and delve into Spark's MLLib for machine learning applications. Gain insights into Data Lakes and acquire skills in data wrangling for more efficient data processing. The course also covers NoSQL database creation with Apache Cassandra, providing a solid foundation in Data Modelling. You will compare SQL vs NoSQL data models and learn about denormalized schemas like STAR and Snowflake. Automating Data Pipelines is another critical area you'll master, using tools such as Apache Airflow to create robust pipelines that ensure data quality and track lineage. Finally, grasp Big Data Fundamentals by understanding the 4 V's—veracity, variability, visualization, and value—and working with different types of data. Explore Hive and HBase along with Hive Query Language before concluding with MapReduce where you'll learn about partitioning mappers and reducers for efficient big data processing.
What you'll learn
- Develop expertise in MapReduce programming model for large-scale distributed computing.
- Automate complex Data Pipelines utilizing Apache Airflow for efficient data management.
- Comprehend the principles of Big Data including handling various types of datasets.
- Gain proficiency in SQL using PostgreSQL and understand relational database design.
- Understand NoSQL databases creation with Apache Cassandra and contrast it with SQL models.
- Learn Business Intelligence concepts and implement a Data Warehouse using Pentaho on AWS.
- Acquire hands-on experience in Hive Query Language for interacting with big data stores.
- Master big data processing using SparkSQL, DataFrames, Datasets including MLlib & GraphX.
Course Syllabus
Day 1 - Data Engineering Foundations
Short Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Course Reviews
"Overall, I am now confident in my knowledge of Data Engineering. The only criticism is that there is a lot of material to take in, so taking running notes is a good idea (helps revise for assignments as well)"
"This course was well-designed and educational. I came in knowing very little about data engineering and now have a lot better idea of what it is all about."
"A good lesson for beginners to learn the fundamentals of data engineering. People should consider becoming a data engineer, scientist, or analyst."
"Excellent preparation for becoming a Data Engineer. It explains what Data Engineers do and what they should know."
Instructor Reviews
Mohammad Mehdi Lotfinejad
Chief Data Officer & Data Science Trainer"Mehdi and I worked on several projects with company such as Petronas , Shell and CIMB Regional ETC. I must say Mehdi's training was highly appreciated by our clients as he was able to exhibit in full display his vast knowledge as a Data professional. I would highly recommend him to anyone looking for a top tier training expert."
"Not only knowledgeable but also having hands dirty on what he knows Friendly and building networks quickly."
"I had the pleasure of working with Mehdi together on some high-level initiatives such as the Petronas data scientist program and Shell's project to become a data-driven organization. During these projects, Mehdi received numerous accolades for his ability to share his knowledge and mentor up-and-coming data scientists. Based on our shared experiences, I have no hesitation in recommending Mehdi for any project or position he may be considered for."
Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Certification
Certified Beginner Data Engineer
Veritas
Features
Target Audience
Methodologies
Subsidies

Ratings and Reviews
Instructors
FAQ
Frequently Asked Questions About This Course
- Public pricing: applies for individuals signing up from different companies.
- Corporate pricing: applies if a company wants to have an intake for its employees only.
- Training provider pricing: applies only for other training providers looking to hire our trainers and use our content. Our content has a licensing fee.
We will keep you updated on the status of the intake after you enroll.
Our Offers
Academy for Trainers Academy for Trainers
Teach what you love. Abundent Academy gives you the tools you need to run your own trainings! We provide you with the platform, the students, the materials, and the support you need to succeed!
- Higher trainer payouts
- Ready-made course materials
- Student management system
- AI digital marketing assistant
Academy for Corporates Academy for Corporates
Get unlimited access to all of Abundent Academy's carefully curated courses for your team, all organized according to learning paths and roles! Perfect for companies looking to upskill their workforce and stay ahead in the tech industry.
- Carefully curated courses
- Role-based learning paths
- Team progress tracking
- Gap Identification and Analysis
Academy for Partners Academy for Partners
White-label IT training delivery for training providers. We become your behind-the-scenes delivery arm so you can say yes to more clients without hiring more trainers.
- Expand your training catalog
- 40+ expert trainers ready
- White-label delivery
- You keep client relationships