AWS Data Engineering: AWS Glue, Athena, and PySpark
Master the art of data engineering with our comprehensive training program focused on AWS Glue, Athena, and PySpark. Under expert guidance, you'll build robust ETL pipelines and optimize performance for real-world applications. Enroll now to transform your skills in cloud-based data processing.
- Available in:
- Malaysia

Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Features
Subsidies

What you'll learn
- Set up and configure an AWS environment including S3 Data Lake and IAM roles.
- Understand foundational concepts of data engineering within a banking context.
- Utilize AWS Glue Data Catalogs for effective data management.
- Implement PySpark operations for efficient data processing.
- Develop basic to advanced ETL jobs using AWS Glue.
- Optimize performance of Spark jobs through partitioning strategies.
- Design secure and compliant end-to-end data pipelines.
- Query large datasets using Amazon Athena effectively.
Why should you attend?
This course provides an in-depth exploration of data engineering using AWS Glue, Athena, and PySpark. It begins with foundational concepts, setting the stage for understanding the role of data engineering within a banking context. Participants will gain insights into the AWS Data Services Ecosystem and learn the differences between ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes. The course progresses to practical skills such as setting up and configuring an AWS environment. Learners will establish a data lake foundation using Amazon S3 and configure IAM roles for secure access management. Navigation through the AWS Console is also covered to ensure participants can efficiently manage their resources. A significant portion of the training focuses on AWS Glue Data Catalog fundamentals, where learners will explore data catalog architecture and utilize Glue Crawlers for data inspection and validation. The hands-on exercises reinforce these concepts by guiding participants through crawler creation. Further modules introduce PySpark for data processing, covering essential operations and common data preparation tasks. Participants will write basic AWS Glue ETL jobs, execute them, and troubleshoot any issues that arise. Advanced topics include PySpark transformations, performance optimization strategies, and monitoring techniques to ensure efficient job execution. The course also delves into Amazon Athena for querying large datasets directly from S3. Through hands-on labs and case studies like fraud detection pipelines, learners apply their knowledge in real-world scenarios. Finally, the course concludes with discussions on security, governance, data quality monitoring, and end-to-end pipeline design.
Course Syllabus
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 2
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 3
Ratings and Reviews
Instructor
Tarun Sukhani is a distinguished professional trainer and consultant with nearly 30 years of experience in the IT and business sectors, having worked internationally across AMER, APAC, and EMEA. His expertise spans a wide array of domains such as Agile methodologies (Scrum, SAFe, Kanban), enterprise architecture frameworks like TOGAF and COBIT, IT service management standards including ITIL and ISO27001, as well as cybersecurity certifications like CISO and CISSP. Tarun's proficiency extends to project management frameworks such as PRINCE2 and PMP, along with cutting-edge technologies in Big Data analytics using Hadoop and Spark, data science with Python or R, and data visualization tools like Tableau. Tarun has held numerous senior development and executive roles including CIO/CTO positions where he managed large-scale IT operations for multinational corporations such as Dell, AMD, and Experian. His leadership skills have been instrumental in enhancing business operations across various functions including HR, Finance, Operations, Sales, Risk Management, Engineering/Manufacturing, and Accounting. He has also contributed significantly to regional conglomerates like Indra in the Asia Pacific region. A passionate educator at heart, Tarun has facilitated training workshops throughout Asia Pacific countries such as Malaysia, Indonesia, Philippines, Thailand, and Singapore. His training sessions cover a broad spectrum of topics from project management to strategic leadership and soft skills development. Additionally, he specializes in advanced technical subjects like software architecture design patterns for reactive microservices architectures on cloud platforms. Tarun's academic credentials are equally impressive; he graduated summa cum laude with an MSc in Information Systems and an MBA in Finance and Operations Management from Loyola University Chicago. He holds multiple Bachelor's degrees in Biology, Math & Computer Science alongside Business Administration. Further enriching his knowledge base are certifications as an Agile/Scrum trainer; Java/.NET programmer; Machine Learning specialist; InfoSec expert; Business Intelligence professional; complemented by advanced studies in AI & Blockchain from prestigious institutions like MIT & Stanford. His client portfolio includes renowned organizations such as Western Digital/Sandisk for machine learning product management projects; Singtel & CIMB for agile product development initiatives; Tenaga Nasional & JPJ for agile project management engagements among others. Tarun's dynamic approach combined with his deep-rooted passion for developing people makes him a sought-after speaker at international conferences where he shares insights on digital transformation strategies.
Minimum Qualification
Target Audience
Methodologies
Instructor Reviews
He was indeed very skilled, knowledgeable and passionate in the data science realm. I was impressed with his business know-how (how the world economy works and how all things can be explain with data, with/without bias) and technical skills in converting data into insights. I will not hesitate to recommend Tarun for any data science related training as I would like to attend more classes myself to learn from the best of the best.
I attended one of Tarun's Data Science course in Jakarta (CDSS). He was a professional trainer & very knowledgeable in Data Science. In his course, Tarun gave many practical examples & valuable information regarding how to conduct Data Science & it's related components (e.g. Software & Deployment Architecture). In addition to those lessons, he also gave very useful insights on building a career as a Data Scientist.
Attended "Blockchain Training: An Overview for Business Professionals" conducted by Dr. Tarun. The reference materials are very comprehensive and an excellent means of conveying information. I was very impressed with how this technology works and adapted into business
He shared his professional insights on data science with a sense of humor that cleared up so many of my questions about the content and real-world applications. Information, tools, and resources given are very useful
His knowledge of multiple subjects exceeds far greater than that of any IT or non-IT person I have met or interacted with in a long time. The breadth and depth of the subject matter he has acquired is exemplary.
Training with Tarun has been awesome. He’s super knowledgable, funny, empathetic and a great educator in general. As someone who didn’t come from a computer science background, his teachings didn’t make me feel stupid or impossible to eventually arrive at being a competent developer. I could understand him as he communicates well & has helped me see the big picture of the computer science field beyond the scope of syntaxes. If you similarly did not come from a CS background and hope to transition into the world of programming but struggle to learn on your own, understand all the foreign & abstract concepts and connect the dots, I think the right person to guide you on your journey would make a big difference. Having someone who’s deep in the field with many years of experience narrow and communicate the relevant areas to focus would also close a big gap from having to struggle and figure out a lot of things on your own. Being able to maintain your interest during your learning journey is important too, thus finding that someone is important. All in all, I would wholeheartedly recommend Tarun and the backend course I took.
Tarun is a results-driven & inspirational technology leader with a clear vision, direction, and broad-based technology expertise. He is passionate, intuitive, engaged, pragmatic, systematic, agile. His experiences span from small start-ups to complex, global companies, from being technical lead to technical strategist to being the leader of larger group of architecture and engineering teams. Much of his experiences are in the area of Java, Scala, Machine Learning, Neural Networks, Cloud Computing, Data Science and what not. I am truly amazed to experience his breadth & depth of technological expertise and pleasure to be part of his team.
Tarun is very passionate on the domains and gave numerous insights to support critical business decisions and develop data products to transform daily encounters and processes. He was a professional trainer & very knowledgeable in Data Science. His material is presented through a sequence of brief lectures, interactive demonstrations, great hands-on exercises, and discussions.
We’ve been collaborated many times in doing courses for the accountants. He spoke to quiet number of event in our company with various topic regards to accountants need. The collaboration was very smooth and his session definitely made a huge impact on our success. Mr Tarun is a great Professional!
Mr. Tarun is a driven, hardworking, and knowledgeable entrepreneur in his field." A broad-minded trainer who embraces change and inspires people to do better every day. Mr. Tarun sets a good example by being enthusiastic and dedicated, and he inspires and motivates others. I am delighted to be working for such personnel
Courses you may like
Why should you attend?
This course provides an in-depth exploration of data engineering using AWS Glue, Athena, and PySpark. It begins with foundational concepts, setting the stage for understanding the role of data engineering within a banking context. Participants will gain insights into the AWS Data Services Ecosystem and learn the differences between ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes. The course progresses to practical skills such as setting up and configuring an AWS environment. Learners will establish a data lake foundation using Amazon S3 and configure IAM roles for secure access management. Navigation through the AWS Console is also covered to ensure participants can efficiently manage their resources. A significant portion of the training focuses on AWS Glue Data Catalog fundamentals, where learners will explore data catalog architecture and utilize Glue Crawlers for data inspection and validation. The hands-on exercises reinforce these concepts by guiding participants through crawler creation. Further modules introduce PySpark for data processing, covering essential operations and common data preparation tasks. Participants will write basic AWS Glue ETL jobs, execute them, and troubleshoot any issues that arise. Advanced topics include PySpark transformations, performance optimization strategies, and monitoring techniques to ensure efficient job execution. The course also delves into Amazon Athena for querying large datasets directly from S3. Through hands-on labs and case studies like fraud detection pipelines, learners apply their knowledge in real-world scenarios. Finally, the course concludes with discussions on security, governance, data quality monitoring, and end-to-end pipeline design.
What you'll learn
- Set up and configure an AWS environment including S3 Data Lake and IAM roles.
- Understand foundational concepts of data engineering within a banking context.
- Utilize AWS Glue Data Catalogs for effective data management.
- Implement PySpark operations for efficient data processing.
- Develop basic to advanced ETL jobs using AWS Glue.
- Optimize performance of Spark jobs through partitioning strategies.
- Design secure and compliant end-to-end data pipelines.
- Query large datasets using Amazon Athena effectively.
Course Syllabus
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 1
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 2
Short Break
15 minsShort Break
15 minsRecap and Q&A
15 minsLunch
1 hourShort Break
15 minsShort Break
15 minsShort Break
15 minsRecap and Q&A
15 minsEnd of Day 3
Instructor Reviews
He was indeed very skilled, knowledgeable and passionate in the data science realm. I was impressed with his business know-how (how the world economy works and how all things can be explain with data, with/without bias) and technical skills in converting data into insights. I will not hesitate to recommend Tarun for any data science related training as I would like to attend more classes myself to learn from the best of the best.
I attended one of Tarun's Data Science course in Jakarta (CDSS). He was a professional trainer & very knowledgeable in Data Science. In his course, Tarun gave many practical examples & valuable information regarding how to conduct Data Science & it's related components (e.g. Software & Deployment Architecture). In addition to those lessons, he also gave very useful insights on building a career as a Data Scientist.
Attended "Blockchain Training: An Overview for Business Professionals" conducted by Dr. Tarun. The reference materials are very comprehensive and an excellent means of conveying information. I was very impressed with how this technology works and adapted into business
He shared his professional insights on data science with a sense of humor that cleared up so many of my questions about the content and real-world applications. Information, tools, and resources given are very useful
His knowledge of multiple subjects exceeds far greater than that of any IT or non-IT person I have met or interacted with in a long time. The breadth and depth of the subject matter he has acquired is exemplary.
Training with Tarun has been awesome. He’s super knowledgable, funny, empathetic and a great educator in general. As someone who didn’t come from a computer science background, his teachings didn’t make me feel stupid or impossible to eventually arrive at being a competent developer. I could understand him as he communicates well & has helped me see the big picture of the computer science field beyond the scope of syntaxes. If you similarly did not come from a CS background and hope to transition into the world of programming but struggle to learn on your own, understand all the foreign & abstract concepts and connect the dots, I think the right person to guide you on your journey would make a big difference. Having someone who’s deep in the field with many years of experience narrow and communicate the relevant areas to focus would also close a big gap from having to struggle and figure out a lot of things on your own. Being able to maintain your interest during your learning journey is important too, thus finding that someone is important. All in all, I would wholeheartedly recommend Tarun and the backend course I took.
Tarun is a results-driven & inspirational technology leader with a clear vision, direction, and broad-based technology expertise. He is passionate, intuitive, engaged, pragmatic, systematic, agile. His experiences span from small start-ups to complex, global companies, from being technical lead to technical strategist to being the leader of larger group of architecture and engineering teams. Much of his experiences are in the area of Java, Scala, Machine Learning, Neural Networks, Cloud Computing, Data Science and what not. I am truly amazed to experience his breadth & depth of technological expertise and pleasure to be part of his team.
Tarun is very passionate on the domains and gave numerous insights to support critical business decisions and develop data products to transform daily encounters and processes. He was a professional trainer & very knowledgeable in Data Science. His material is presented through a sequence of brief lectures, interactive demonstrations, great hands-on exercises, and discussions.
We’ve been collaborated many times in doing courses for the accountants. He spoke to quiet number of event in our company with various topic regards to accountants need. The collaboration was very smooth and his session definitely made a huge impact on our success. Mr Tarun is a great Professional!
Mr. Tarun is a driven, hardworking, and knowledgeable entrepreneur in his field." A broad-minded trainer who embraces change and inspires people to do better every day. Mr. Tarun sets a good example by being enthusiastic and dedicated, and he inspires and motivates others. I am delighted to be working for such personnel
Corporate Pricing
Pax:
Training Provider Pricing
Pax:
Features
Subsidies

Ratings and Reviews
Instructor
Tarun Sukhani is a distinguished professional trainer and consultant with nearly 30 years of experience in the IT and business sectors, having worked internationally across AMER, APAC, and EMEA. His expertise spans a wide array of domains such as Agile methodologies (Scrum, SAFe, Kanban), enterprise architecture frameworks like TOGAF and COBIT, IT service management standards including ITIL and ISO27001, as well as cybersecurity certifications like CISO and CISSP. Tarun's proficiency extends to project management frameworks such as PRINCE2 and PMP, along with cutting-edge technologies in Big Data analytics using Hadoop and Spark, data science with Python or R, and data visualization tools like Tableau. Tarun has held numerous senior development and executive roles including CIO/CTO positions where he managed large-scale IT operations for multinational corporations such as Dell, AMD, and Experian. His leadership skills have been instrumental in enhancing business operations across various functions including HR, Finance, Operations, Sales, Risk Management, Engineering/Manufacturing, and Accounting. He has also contributed significantly to regional conglomerates like Indra in the Asia Pacific region. A passionate educator at heart, Tarun has facilitated training workshops throughout Asia Pacific countries such as Malaysia, Indonesia, Philippines, Thailand, and Singapore. His training sessions cover a broad spectrum of topics from project management to strategic leadership and soft skills development. Additionally, he specializes in advanced technical subjects like software architecture design patterns for reactive microservices architectures on cloud platforms. Tarun's academic credentials are equally impressive; he graduated summa cum laude with an MSc in Information Systems and an MBA in Finance and Operations Management from Loyola University Chicago. He holds multiple Bachelor's degrees in Biology, Math & Computer Science alongside Business Administration. Further enriching his knowledge base are certifications as an Agile/Scrum trainer; Java/.NET programmer; Machine Learning specialist; InfoSec expert; Business Intelligence professional; complemented by advanced studies in AI & Blockchain from prestigious institutions like MIT & Stanford. His client portfolio includes renowned organizations such as Western Digital/Sandisk for machine learning product management projects; Singtel & CIMB for agile product development initiatives; Tenaga Nasional & JPJ for agile project management engagements among others. Tarun's dynamic approach combined with his deep-rooted passion for developing people makes him a sought-after speaker at international conferences where he shares insights on digital transformation strategies.
Minimum Qualification
Target Audience
Methodologies
Courses you may like
Our Offers

Become a Trainer
Teach what you love. Abundent Academy gives you the tools you need to run your own trainings! We provide you with the platform, the students, the materials, and the support you need to succeed!
- Higher trainer payouts
- Ready-made course materials
- Student management system
- AI digital marketing assistant

Academy for Business
Get unlimited access to all of Abundent Academy's carefully curated courses for your team, all organized according to job category and role! Perfect for companies looking to upskill their workforce and stay ahead in the tech industry.
- Carefully curated courses
- Role-based learning paths
- Team progress tracking
- Gap Identification and Analysis