<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Warehousing &#124; DWH &#187; Basic Concepts in Data Warehousing</title>
	<atom:link href="http://www.dwhnow.com/category/basic-concepts-in-data-warehousing/feed" rel="self" type="application/rss+xml" />
	<link>http://www.dwhnow.com</link>
	<description>Complete portal for DWH and BI</description>
	<lastBuildDate>Tue, 02 Feb 2010 12:06:22 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Star Schema in Data Warehouse</title>
		<link>http://www.dwhnow.com/star-schema-in-data-warehouse/28</link>
		<comments>http://www.dwhnow.com/star-schema-in-data-warehouse/28#comments</comments>
		<pubDate>Tue, 02 Feb 2010 12:06:22 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://www.dwhnow.com/?p=28</guid>
		<description><![CDATA[Star schema contain a single or mutiple fact table surrounded by dimension table.The dimension table in star schema will be denormalized.Most of the datawarehouse are designed according to star schema.Accordingly star schema as following features 1. Single(Simple Star Schema) or multiple(Complex Star Schema) fact table 2. There are fewer number of dimension table hence lesser [...]]]></description>
			<content:encoded><![CDATA[<p>Star schema contain a single or mutiple fact table surrounded by dimension table.The dimension table in star schema will be denormalized.Most of the datawarehouse are designed according to star schema.Accordingly star schema as following features</p>
<p>1. Single(Simple Star Schema) or multiple(Complex Star Schema) fact table<br />
2. There are fewer number of dimension table hence lesser joines and queries execute faster<br />
3. Considered a special case of snowflake schema.<br />
4. Dimensional tables are generally in 2nd normal form while fact table are in third normal form</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/star-schema-in-data-warehouse/28/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>grep command in unix with example</title>
		<link>http://www.dwhnow.com/grep-command-in-unix-with-example/34</link>
		<comments>http://www.dwhnow.com/grep-command-in-unix-with-example/34#comments</comments>
		<pubDate>Fri, 15 Jan 2010 08:24:44 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>
		<category><![CDATA[UNIX in Data Warehousing]]></category>

		<guid isPermaLink="false">http://www.dwhnow.com/?p=34</guid>
		<description><![CDATA[Grep command is very useful in UNIX.Grep searches for similar pattern on file and return the result.Example of grep command is grep &#8216;word&#8217; filename &#8211; Will return the line which contain word. grep -i &#8216;nocase&#8217; filename &#8211; This option will make grep case insensitive. grep &#8211; r &#8216;sear&#8217; /etc/ &#8211; This will search in the [...]]]></description>
			<content:encoded><![CDATA[<p>Grep command is very useful in UNIX.Grep searches for similar pattern on file and return the result.Example of grep command is<br />
grep &#8216;word&#8217; filename &#8211; Will return the line which contain word.<br />
grep -i &#8216;nocase&#8217; filename &#8211; This option will make grep case insensitive.<br />
grep &#8211; r &#8216;sear&#8217; /etc/ &#8211; This will search in the whole directory recursively<br />
grep &#8220;boo&#8221; filename &#8211; This will display line which contain exact match of the word boo.<br />
egrep -w &#8216;word1|word2&#8242; filename &#8211; This will search for both the word1 and word2 in the filename.<br />
grep &#8211; c word1 filename &#8211; Will return the number of lines where word1 is found.<br />
grep -v word1 filename &#8211; Will return the line which doesn&#8217;t contain the word1.<br />
grep word1 * &#8211; Will search for occurence of word1 in whole directory<br />
grep ^a.ad fruitlist.txt &#8211; Print line that contain the letter followed by a and in last contain letter ad.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/grep-command-in-unix-with-example/34/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>File Compression and archival in UNIX</title>
		<link>http://www.dwhnow.com/file-compression-and-archival-in-unix/30</link>
		<comments>http://www.dwhnow.com/file-compression-and-archival-in-unix/30#comments</comments>
		<pubDate>Fri, 15 Jan 2010 07:23:56 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>
		<category><![CDATA[UNIX in Data Warehousing]]></category>

		<guid isPermaLink="false">http://www.dwhnow.com/?p=30</guid>
		<description><![CDATA[There are lot of times when you need to compress and archive files in UNIX in data warehousing project.The reason for this is files in DWH are huge.In this article I will explain how to compress files and than archive them.Files compression is not useful for small files.There are few command and utilities you can [...]]]></description>
			<content:encoded><![CDATA[<p>There are lot of times when you need to compress and archive files in UNIX in data warehousing project.The reason for this is files in DWH are huge.In this article I will explain how to compress files and than archive them.Files compression is not useful for small files.There are few command and utilities you can use to compress files.They are as follows </p>
<p>1. compress<br />
2. gzip<br />
3. tar(with c option) &#8211; Arcchives files without compression<br />
Archive may contain one or more files.Generally archival is done for files which are not used much.<br />
<span id="more-30"></span><br />
The command to decompress are as follows respectively<br />
1.uncompress<br />
2. gunzip<br />
3. tar(with x option)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/file-compression-and-archival-in-unix/30/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Snowflake Schema in data warehouse</title>
		<link>http://www.dwhnow.com/snowflake-schema-in-data-warehouse/25</link>
		<comments>http://www.dwhnow.com/snowflake-schema-in-data-warehouse/25#comments</comments>
		<pubDate>Mon, 11 Jan 2010 06:57:52 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://www.dwhnow.com/?p=25</guid>
		<description><![CDATA[A Snowflake schema has a centralized fact table surrounded by dimension tables.The dimension table in snow flake schema are normalized.There will be single fact table joined to multiple dimension table which in turn may be joined to other dimension tables.So snowflake schema has following features 1. Large number of dimension tables. 2. Since dimension table [...]]]></description>
			<content:encoded><![CDATA[<p>A Snowflake schema has a centralized fact table surrounded by dimension tables.The dimension table in snow flake schema are normalized.There will be single fact table joined to multiple dimension table which in turn may be joined to other dimension tables.So snowflake schema has following features</p>
<p>1. Large number of dimension tables.<br />
2. Since dimension table are normalized the size of dimension table is small<br />
3. For query purpose we need to retrieve data from multiple dimension table resulting in more joins and slower performance.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/snowflake-schema-in-data-warehouse/25/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ETL tools in data warehousing?</title>
		<link>http://www.dwhnow.com/etl-tools-in-data-warehousing/19</link>
		<comments>http://www.dwhnow.com/etl-tools-in-data-warehousing/19#comments</comments>
		<pubDate>Fri, 08 Jan 2010 07:16:25 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://www.dwhnow.com/?p=19</guid>
		<description><![CDATA[Once you have decided to build a data warehouse for the company.The challenges lies in deciding which ETL tool to use.There are plenty of ETL tools like Pentaho Data Integration,Abnitio.The most used ETL tool is Informatica.ETL stands for Extract,Transform and Load.Although most of ETL tool can perform all the three function.Generally one or more process [...]]]></description>
			<content:encoded><![CDATA[<p>Once you have decided to build a data warehouse for the company.The challenges lies in deciding which ETL tool to use.There are plenty of ETL tools like Pentaho Data Integration,Abnitio.The most used ETL tool is Informatica.ETL stands for Extract,Transform and Load.Although most of ETL tool can perform all the three function.Generally one or more process of extraction,transformation and load may be performed by some external software.Lets us understand all the three process one by one<br />
<span id="more-19"></span></p>
<p>Extract &#8211; The process involves extraction fo data from homogenoes or heterogenoes sources to load into the data warehouse.</p>
<p>Transform &#8211; Once the data is extracted various transformation need to be performed so that the data is useful for various analysis.</p>
<p>Loading &#8211; The last step is to load the the data into datawarehouse.</p>
<p>The list of popular ETL tools in the market are<br />
1. Informatica<br />
2. Oracle Warehouse Builder<br />
3. Pentaho<br />
4. Ab Initio<br />
5. Datastage</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/etl-tools-in-data-warehousing/19/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What is Slowly changing Dimension?</title>
		<link>http://www.dwhnow.com/what-is-slowly-changing-dimension/12</link>
		<comments>http://www.dwhnow.com/what-is-slowly-changing-dimension/12#comments</comments>
		<pubDate>Sun, 03 Jan 2010 06:34:36 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://dwhnow.com/?p=12</guid>
		<description><![CDATA[A slowly changing dimension is dimension table in which attribute changes with time.Suppose we have a customer who has bank account in Chicago.Now he changes his location and moves to new york.A dimension table for this customer will typically contain the following information Now the challenge is to how to store this changing information.In a [...]]]></description>
			<content:encoded><![CDATA[<p>A slowly changing dimension is dimension table in which attribute changes with time.Suppose we have a customer who has bank account in Chicago.Now he changes his location and moves to new york.A dimension table for this customer will typically contain the following information</p>

<table id="wp-table-reloaded-id-2-no-1" class="wp-table-reloaded wp-table-reloaded-id-2">
<tbody>
	<tr class="row-1 odd">
		<td class="column-1">Name</td><td class="column-2">Age</td><td class="column-3">City</td><td class="column-4">Gender</td>
	</tr>
	<tr class="row-2 even">
		<td class="column-1">Jack</td><td class="column-2">25</td><td class="column-3">Chicago</td><td class="column-4">Male</td>
	</tr>
	<tr class="row-3 odd">
		<td class="column-1"></td><td class="column-2"></td><td class="column-3"></td><td class="column-4"></td>
	</tr>
	<tr class="row-4 even">
		<td class="column-1"></td><td class="column-2"></td><td class="column-3"></td><td class="column-4"></td>
	</tr>
	<tr class="row-5 odd">
		<td class="column-1"></td><td class="column-2"></td><td class="column-3"></td><td class="column-4"></td>
	</tr>
</tbody>
</table>

<p>Now the challenge is to how to store this changing information.In a datawarehouse this is generally done in three ways.Accordingly such type of dimension are called type 1,type 2 and type 3</p>
<p><span id="more-12"></span></p>
<p>Type 1 Dimension<br />
Type 1 dimension will not maintain history of the data at all.In the following example if the customer moves from one city to another.The new information will overwrite the old information.</p>
<p>Type 2 Dimension<br />
As you remember in my previous article I explained we have surrogate key as primary key for dimension table.Now in type 2 dimension we maintain all the history.Suppose Jack is customer of bank he moves from Chicago to Miami and than to New york.The table will contain all this information in record form.Each time a new surrogate key is for created for each record.</p>
<p>Type 3 Dimension<br />
Type 3 dimension will generally store limited history value.In most cases it will store the current and previous value..Suppose Jack is customer of bank he moves from Chicago to Miami and than to New york.Table will store Miami and New York.The information for Chicago will be deleted.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/what-is-slowly-changing-dimension/12/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What is Dimension Table?</title>
		<link>http://www.dwhnow.com/what-is-dimension-table/8</link>
		<comments>http://www.dwhnow.com/what-is-dimension-table/8#comments</comments>
		<pubDate>Sun, 03 Jan 2010 06:11:38 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://dwhnow.com/?p=8</guid>
		<description><![CDATA[Dimension table contain attribute which describe fact table.In my previous article I explained fact is nothing but  measures to value business like sales.Dimension on the other hand are used to analyze fact.A example of dimension table will be Region Country State City Now Fact table will contain some column like sales.So dimension table can be [...]]]></description>
			<content:encoded><![CDATA[<p>Dimension table contain attribute which describe fact table.In my previous article I explained fact is nothing but  measures to value business like sales.Dimension on the other hand are used to analyze fact.A example of dimension table will be</p>
<table border="1">
<tbody>
<tr>Region</tr>
</tbody>
</table>
<table border="2">
<tbody>
<tr>Country</tr>
<tr>State</tr>
<tr>City</tr>
</tbody>
</table>
<p><span id="more-8"></span><br />
Now Fact table will contain some column like sales.So dimension table can be used to analyze sales according to Country like USA,UK.Dimension table generally have  surrogate key as primary key.The size of dimension table is small.Unlike fact table which grows in size dimension table remain mostly static.Referential integrity must be maintained between dimension table and fact table.<br />
Dimension table will generally contain hierarchies.Hierarchy help in analyzing the data.For example in the table the hierarchy is Country,State and City.This help in knowing the sales in particular city,state or country.Mainly dimensional table will contain textual information which make sense like country.year,etc.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/what-is-dimension-table/8/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fact Table in Datawarehouse</title>
		<link>http://www.dwhnow.com/fact-table-in-datawarehouse/3</link>
		<comments>http://www.dwhnow.com/fact-table-in-datawarehouse/3#comments</comments>
		<pubDate>Thu, 31 Dec 2009 11:22:49 +0000</pubDate>
		<dc:creator>Admin</dc:creator>
				<category><![CDATA[Basic Concepts in Data Warehousing]]></category>

		<guid isPermaLink="false">http://dwhnow.com/?p=3</guid>
		<description><![CDATA[Fact table is the centre table in snow flake scheme.A fact is measure of business for example sales.Generally facts are numbers.Dimension on other help us to analyze facts.Example of Dimension will be Region.Apart from these measures the fact table contain foreign key for dimension table.Fact table is generally large in size.Fact table are highly normalized [...]]]></description>
			<content:encoded><![CDATA[<p>Fact table is the centre table in snow flake scheme.A fact is measure of business for example sales.Generally facts are numbers.Dimension on other help us to analyze facts.Example of Dimension will be Region.Apart from these measures the fact table contain foreign key for dimension table.Fact table is generally large in size.Fact table are highly normalized because of that reason.<br />
<span id="more-3"></span><br />
Measures in Fact Table can be of three type</p>
<p>Additive: Additive facts are facts that can be summed up through all of the dimensions in the fact table.<br />
Semi-Additive: Semi-additive facts are facts that can be summed up for some of the dimensions in the fact table but not the others.<br />
Non-Additive: Non-additive facts are facts that cannot be summed up for any of the dimensions present in the fact table.</p>
<p>Let us see all three type of measures in fact table.Suppose you have fact table has</p>
<p>Region<br />
Year<br />
Per-Profit<br />
Sale</p>
<p>Sale and Per-Profit are the facts while region &amp; Year are dimension.Sale is additive fact since it makes sense to add it across Region and Year.Percentage profit is non-additive fact as it doesn&#8217;t make sense to add it across Region and Year.</p>
<p>Every fact table has primary key which help in uniquely identifying the record.<br />
Some fact table don&#8217;t contain facts at all.Such table are called factless Fact table.These table are used to find event which occurred or didn&#8217;t occurred.</p>
<p>The other concept of fact table is to understand what is meant by granularity.Granularity means what is the lowest level of information that will be stored in fact table.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dwhnow.com/fact-table-in-datawarehouse/3/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
