Data Warehousing | DWH

Complete portal for DWH and BI

Star Schema in Data Warehouse

Star schema contain a single or mutiple fact table surrounded by dimension table.The dimension table in star schema will be denormalized.Most of the datawarehouse are designed according to star schema.Accordingly star schema as following features 1. Single(Simple Star Schema) or multiple(Complex Star Schema) fact table 2. There are fewer number of dimension table hence lesser [...]

grep command in unix with example

Grep command is very useful in UNIX.Grep searches for similar pattern on file and return the result.Example of grep command is grep ‘word’ filename – Will return the line which contain word. grep -i ‘nocase’ filename – This option will make grep case insensitive. grep – r ‘sear’ /etc/ – This will search in the [...]

File Compression and archival in UNIX

There are lot of times when you need to compress and archive files in UNIX in data warehousing project.The reason for this is files in DWH are huge.In this article I will explain how to compress files and than archive them.Files compression is not useful for small files.There are few command and utilities you can [...]

Snowflake Schema in data warehouse

A Snowflake schema has a centralized fact table surrounded by dimension tables.The dimension table in snow flake schema are normalized.There will be single fact table joined to multiple dimension table which in turn may be joined to other dimension tables.So snowflake schema has following features 1. Large number of dimension tables. 2. Since dimension table [...]

ETL tools in data warehousing?

Once you have decided to build a data warehouse for the company.The challenges lies in deciding which ETL tool to use.There are plenty of ETL tools like Pentaho Data Integration,Abnitio.The most used ETL tool is Informatica.ETL stands for Extract,Transform and Load.Although most of ETL tool can perform all the three function.Generally one or more process [...]

What is Slowly changing Dimension?

A slowly changing dimension is dimension table in which attribute changes with time.Suppose we have a customer who has bank account in Chicago.Now he changes his location and moves to new york.A dimension table for this customer will typically contain the following information Now the challenge is to how to store this changing information.In a [...]

What is Dimension Table?

Dimension table contain attribute which describe fact table.In my previous article I explained fact is nothing but  measures to value business like sales.Dimension on the other hand are used to analyze fact.A example of dimension table will be Region Country State City

Fact Table in Datawarehouse

Fact table is the centre table in snow flake scheme.A fact is measure of business for example sales.Generally facts are numbers.Dimension on other help us to analyze facts.Example of Dimension will be Region.Apart from these measures the fact table contain foreign key for dimension table.Fact table is generally large in size.Fact table are highly normalized [...]

Free web directory General Business Web Directory