The ultimate Data Engineer Roadmap 2026.
You don’t need expensive courses. This repository collects the best 100% FREE video tutorials and documentation to help you master data engineering from foundations to production systems.
Data Engineering is about building reliable, scalable data systems.
This roadmap starts with programming & SQL, moves through databases, ETL, big data, then covers tooling, cloud, orchestration, and finally production optimization & specializations.
- Phase 1: Foundations
- Phase 2: Core Data Engineering Skills
- Phase 3: Big Data Technologies
- Phase 4: Tooling & Infrastructure
- Phase 5: Production & Optimization
- Phase 6: Advanced & Specializations
The bedrock skills every data engineer must master first.
Primary language for data processing and automation.
Video Resources:
| Python Full Course | Python Beginner to Pro |
|---|---|
![]() |
![]() |
Documentation & Reading:
Core skill for querying and managing structured data.
Video Resources:
| SQL Beginner to Advanced | SQL Full Course |
|---|---|
![]() |
![]() |
Documentation & Reading:
Used mainly for distributed data processing.
Video Resources:
| Scala for Beginners | Scala Full Course |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://docs.scala-lang.org/
- https://docs.scala-lang.org/tour/tour-of-scala.html
- https://www.w3resource.com/scala-exercises/
Automate data workflows and system tasks.
Video Resources:
| Bash Full Course | Bash for Beginners |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.gnu.org/software/bash/manual/
- https://guide.bash.academy/
- https://www.w3schools.com/bash/
Track changes and collaborate effectively.
Video Resources:
| Git & GitHub Crash Course | Git in 1 Hour |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://git-scm.com/book/en/v2
- https://www.atlassian.com/git/tutorials
- https://learngitbranching.js.org/
✅ Want a structured Data Engineer roadmap?
Access the interactive visual roadmap with free resources here:
👉 Data Engineer Roadmap
Daily skills used in real-world data systems.
Video Resources:
| PostgreSQL Full Course | PostgreSQL Beginners |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.w3schools.com/postgresql/
- https://www.postgresql.org/docs/current/tutorial.html
- https://www.geeksforgeeks.org/postgresql-tutorial/
Video Resources:
| MongoDB in 1 Hour | MongoDB Beginners |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.w3schools.com/mongodb/
- https://learn.mongodb.com/
- https://www.freecodecamp.org/news/learn-mongodb-a4ce205e7739/
Video Resources:
| Data Modeling Basics | Data Modeling Full Course |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://learndatamodeling.com/
- https://www.geeksforgeeks.org/data-analysis/data-modeling-a-comprehensive-guide-for-analysts/
Video Resources:
| ETL Explained | ETL Portfolio Project |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://aws.amazon.com/what-is/etl/
- https://www.talend.com/resources/what-is-etl/
- https://learn.microsoft.com/en-us/azure/architecture/data-guide/relational-data/etl
Video Resources:
| Data Warehouse Tutorial | Data Warehousing Playlist |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.tutorialspoint.com/dwh/index.htm
- https://learn.microsoft.com/en-us/fabric/data-warehouse/tutorial-introduction
- https://www.geeksforgeeks.org/dbms/data-warehousing-tutorial/
Handling large-scale distributed data.
Video Resources:
| Hadoop Full Course | Big Data with Hadoop |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| Spark Quick Guide | Spark Full Course |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| Kafka Playlist | Kafka for Beginners |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://kafka.apache.org/documentation/
- https://developer.confluent.io/learn-kafka/
- https://www.baeldung.com/apache-kafka
Professional data engineering workflow.
Video Resources:
| Airflow Full Course | Airflow Tutorial |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| Docker for Data Engineers | Docker Beginners |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://docs.docker.com/
- https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-on-ubuntu-20-04
Video Resources:
| Kubernetes for Data Engineers | Kubernetes Basics |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| GitHub Actions Tutorial | CI/CD with Docker |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| AWS Full Course | AWS Cloud Practitioner |
|---|---|
![]() |
![]() |
Documentation & Reading:
Operate pipelines reliably at scale.
Video Resources:
| Pipeline Optimization | Cost Optimization |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.getdbt.com/resources/29-ways-to-optimize-costs-in-data-pipelines-workflows-and-analyses
- https://kanerika.com/blogs/data-pipeline-optimization/
Video Resources:
| Pipeline Monitoring | Data Quality Monitoring |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://www.montecarlodata.com/blog-data-pipeline-monitoring/
- https://www.datadoghq.com/knowledge-center/data-pipeline-monitoring/
- https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/
Choose after real production experience.
Video Resources:
| Data Security | Data Privacy |
|---|---|
![]() |
![]() |
Documentation & Reading:
Video Resources:
| Streaming Data Basics | Stream Processing |
|---|---|
![]() |
![]() |
Documentation & Reading:
- https://estuary.dev/blog/streaming-data-processing/
- https://learn.microsoft.com/en-us/azure/architecture/data-guide/technology-choices/stream-processing
Found a great free data engineering resource?
- Fork this repository
- Add the resource to the correct phase
- Submit a Pull Request
If this Data Engineer Roadmap helped you learn and save money, please give this repo a Star ⭐.
- Interactive Roadmap: https://codersnote.com/roadmap
- AI Tutor Lyra: https://codersnote.com/ai-tutor










































