Lesson 1
Introduction to Automating Data Pipelines
Welcome to Automating Data Pipelines. In this lesson, you'll be introduced to the topic, prerequisites for the course, and the environment and tools you'll be using to build data pipelines.
Course
In this course, you'll build pipelines leveraging Airflow DAGs to organize your tasks along with AWS resources such as S3 and Redshift.
In this course, you'll build pipelines leveraging Airflow DAGs to organize your tasks along with AWS resources such as S3 and Redshift.
Intermediate
4 weeks
Real-world Projects
Completion Certificate
Last Updated May 20, 2024
Lesson 1
Welcome to Automating Data Pipelines. In this lesson, you'll be introduced to the topic, prerequisites for the course, and the environment and tools you'll be using to build data pipelines.
Lesson 2
In this lesson, you'll learn about the components of a data pipeline including Directed Acyclic Graphs (DAGs). You'll practice creating data pipelines with DAGs and Apache Airflow
Lesson 3
This lesson creates connections between Airflow and AWS first by creating credentials, then copying S3 data, leveraging connections and hooks, and building S3 data to the Redshift DAG.
Lesson 4
Students will learn how to track data lineage and set up data pipeline schedules, partition data to optimize pipelines, investigating Data Quality issues, and write tests to ensure data quality.
Lesson 5
In this last lesson, students will learn how to build Pipelines with maintainability and reusability in mind. They will also learn about pipeline monitoring.
Lesson 6 • Project
Students work on a music streaming company’s data infrastructure by creating and automating a set of data pipelines with Airflow, monitoring and debugging production pipelines
Professor at Brigham Young University Idaho
Sean currently teaches cybersecurity and DevOps courses at Brigham Young University Idaho. He has been a software engineer for over 16 years. Some of the most exciting projects he has worked on involved data pipelines for DNA processing and vehicle telematics.
Combine technology training for employees with industry experts, mentors, and projects, for critical thinking that pushes innovation. Our proven upskilling system goes after success—relentlessly.
Demonstrate proficiency with practical projects
Projects are based on real-world scenarios and challenges, allowing you to apply the skills you learn to practical situations, while giving you real hands-on experience.
Gain proven experience
Retain knowledge longer
Apply new skills immediately
Top-tier services to ensure learner success
Reviewers provide timely and constructive feedback on your project submissions, highlighting areas of improvement and offering practical tips to enhance your work.
Get help from subject matter experts
Learn industry best practices
Gain valuable insights and improve your skills
Unlimited access to our top-rated courses
Real-world projects
Personalized project reviews
Program certificates
Proven career outcomes
Full Catalog Access
One subscription opens up this course and our entire catalog of projects and skills.
Average time to complete a Nanodegree program
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Intermediate
3 hours
, Intermediate
4 weeks
, Intermediate
1 month
, Advanced
4 weeks
, Intermediate
3 weeks
, Advanced
4 weeks
, Advanced
4 weeks
, Advanced
7 hours
, Fluency
4 weeks
, Intermediate
(2)
4 months
, Advanced
3 weeks
, Advanced
4 weeks
, Intermediate
4 weeks
, Beginner
Automate Data Pipelines
4 weeks
, Intermediate
4 weeks
, Intermediate
4 weeks
, Intermediate
3 hours
, Intermediate
4 weeks
, Intermediate
1 month
, Advanced
4 weeks
, Intermediate
3 weeks
, Advanced
4 weeks
, Advanced
4 weeks
, Advanced
7 hours
, Fluency
4 weeks
, Intermediate
(2)
4 months
, Advanced
3 weeks
, Advanced
4 weeks
, Intermediate
4 weeks
, Beginner