Source Of Summary:
DM- Sec (DW)_FCDS.pdf
1. π Data Warehouse π
1.1 Overview
- A data warehouse, also called an enterprise data warehouse (EDW):
- An enterprise data platform used for the analysis and reporting of structured and semi-structured data from multiple data sources.
- A data warehouse can store both current and historical data in one place and is designed to give a long-range view of data over time. β
- A data warehouse is a central repository of information: that can be analyzed to make more informed decisions. π

2. ποΈ Data Warehouse Schema
2.1 Star Schema π
- In data warehousing, a star schema: a dimensional model for organizing data into a structure that helps to improve analytical query performance. π
- Two types of tables: fact and dimension. π
- Fact table: sits at the center of the model, surrounded by one or more dimension tables. π―

2.2 Snowflake Schema βοΈ
- Structure: one fact table in the middle, multiple dimension tables connected to it and to one another. π
- Normalization: dimensions are in normalized form across multiple related tables. π―

2.3 Fact Constellation Schema (Galaxy Schema) π
- Definition: two or more fact tables sharing one or more dimensions. Also called Galaxy schema. β¨
- No one βbestβ schema: choice depends on resources, data types, and analytical goals. π‘

3. π Case Study Questions
3.1 Q1 βοΈ
- Scenario: A data warehouse consists of four dimensions (time, item, branch, and location) and two measures (
unit_sold
and dollars_sold
).
- Task: Draw the star schema diagram for this data warehouse. π―

3.2 Q2 βοΈ
- Scenario: A data warehouse consists of seven dimensions (Time, Location, Item, Month, State, Brand, and Customer). π§©
- Location dimension:
- Attributes:
location_id
city
state_id
- Links via
state_id
to the State table, which provides state_id
and name
. π
- Time dimension:
- Attributes:
time_id
timestamp
month_id
- Links via
month_id
to the Month table, which provides month_id
and name
. π
- Item dimension:
- Attributes:
- Links via
brand_id
to the Brand table, which provides brand_id
and name
. π·οΈ
- Task: Draw the snowflake schema diagram for this data warehouse. π―
