Data Engineering Internship 2026
Chapter 1
Welcome to the Bootcamp
Congratulations — you made it into the Data Engineering Internship. Over the next 12 weeks you will go from SQL queries to full end-to-end data pipelines. This handbook is your day-to-day reference. Read it once now, then keep it open.
Your Instructors
| Name | Role |
|---|---|
| Jemima Villanueva | Lead Instructor |
| Raymond Laguerta | Instructor |
| Jesse Louise Calasin | Instructor |
What the 12 Weeks Look Like
| Week | Phase | Primary Focus | You Deliver |
|---|---|---|---|
| Day 0 | Setup | Environment configuration | Working local stack |
| 1 | Foundations | Database design, data warehousing, ERD, star schema | ERD + star schema design + presentation |
| 2 | Foundations | Core tools — Python, PySpark, Pandas, PostgreSQL, SQL | End-to-end pipeline: download → load → clean → query |
| 3–5 | Pipelines | ETL vs ELT, dbt transformations, Airflow orchestration | dbt + Airflow ETL pipeline |
| 6–7 | Visualisation | Power BI, DAX, data storytelling, dashboards | Interactive Power BI dashboard |
| 8–12 | Capstone | Integrate all skills on a real analytics problem | Movie analytics system |
What This Bootcamp Will and Will Not Cover
| Covered | Not Covered |
|---|---|
| Relational and non-relational databases | Cloud services (AWS, GCP, Azure) |
| Python, Pandas, PySpark data processing | Production client projects with live business users |
| dbt transformations and data modelling | Real-time streaming infrastructure |
| Airflow orchestration and scheduling | Enterprise-scale deployment |
| Power BI dashboards and DAX | ML model training and deployment |
| Git and GitHub for version control |
Pro Tip
The first week feels like drinking from a firehose. That is normal.
You will not understand everything immediately. Focus on running the commands, then understanding why.
Stuck for more than 30 minutes? Post in the help channel immediately. Do not sit on blockers.