Food Data Warehousing
Food Data Warehouse (ETL + Star Schema + Tableau Dashboards)
Saquib Hazari
π Sep 25, 2025
This project showcases an end-to-end data warehousing solution built using SQL Server and follows the Medallion Architecture (Bronze, Silver, Gold). It simulates a retail food business by consolidating synthetic ERP and CRM data sources into a robust, analytics-ready star schema.
Designed as a portfolio project, it demonstrates real-world practices in data engineering, ETL, data modeling, and analytical querying, serving as a powerful resource for data-driven decision-making.
The warehouse follows the Medallion Architecture:
.csv formatfact_salesdim_customers, dim_products


Ingest and unify data from multiple sources (ERP + CRM)
Implement ETL pipelines for each schema layer
Design a star schema for analytics
Generate valuable business insights using SQL
Visualize insights using Tableau dashboards
Dashboard:

| Table Name | Description |
|---|---|
dim_customers |
Customer demographics and profile |
dim_products |
Product attributes and categories |
fact_sales |
Transactions and order metrics |
dim_customers
|
|
fact_sales
|
|
dim_products
goldcategory, channel, engaged, payment_method).dbt for integrity testsGreat Expectations for profiling and validationπ§ This markdown serves as an EDA summary for documentation and QA. Use it as a living checklist for data engineers and analysts to validate your gold layer.
.
βββ products.csv # Product data
βββ sales.csv # Sales transaction data
βββ License # Project license (e.g., MIT)
βββ Procedures/ # ETL procedures for data loading
β βββ bronze_layer_batch.sql # ETL batch script for Bronze layer
β βββ main.sql # Main SQL script for the project
β βββ silver_layer_batch.sql # ETL batch script for Silver layer
βββ project/ # Project-related files and analysis
β βββ Business_analysis/ # Business logic & insight generation
β βββ advance_analysis.sql # Advanced analysis SQL script
β βββ complex_analysis.sql # Additional complex analysis script
βββ assets # Project images and visuals
β βββ Integration_food_dataset.png
β βββ integration_model.png
βββ Business_Analysis # SQL analysis for business insights
β βββ complex_analysis_1.sql
β βββ complex_analysis_2.sql
βββ CSV # Raw data files
β βββ dim_customer.csv
β βββ dim_products.csv
β βββ fact_sales.csv
βββ EDA # Placeholder for exploratory data analysis
βββ food_DB # Source database files (CRM & ERP data)
β βββ crm_customers.csv
β βββ crm_feedback.csv
β βββ crm_marketing.csv
β βββ erp_order_items.csv
β βββ erp_orders.csv
β βββ erp_payments.csv
β βββ food_products.csv
βββ Gold # SQL for Gold Layer tables and views
β βββ god_layer_tables.sql
β βββ gold_layer_view.sql
βββ License # Licensing information
βββ procedures # Data transformation batch processes
β βββ bronze_layer_batch.sql
β βββ silver_layer_batch.sql
βββ python # Python scripts for data processing
β βββ main.py
βββ readme.md # Project overview and setup instructions
βββ requirements.txt # Python packages needed for the project
βββ script # SQL DDL scripts for database schema
β βββ ddl_bronze.sql
β βββ ddl_gold_layer.sql
β βββ ddl_silver_layer.sql
β βββ index.sql
βββ todo.todo # Project task tracking file
This project is licensed under the MIT License. You are free to use, modify, and share this project with proper attribution.
Hi there! Iβm a data science professional with a background in aeronautical engineering, now transitioning into marketing analytics. With a strong foundation in analytical thinking and data-driven decision-making, Iβve worked on a range of data science projects, including predictive modeling, ETL pipelines, and data warehousing.
Currently, I focus on:
I enjoy exploring new technologies in machine learning, AI, and deep learning. My approach is hands-on, and I love learning by doing β whether itβs building machine learning models or fine-tuning complex data pipelines.
Feel free to connect or check out my previous projects! Data warehouse project