WHAT IS DATA WAREHOUSE ?
Data warehouse – a logical collection of information – gathered from many different operational databases – that supports business analysis activities and decision-making tasks
The primary purpose of a data warehouse is to combined information throughout an organization into a single repository for decision-making purposes – data warehouse support only analytical processing
Data Warehouse Model
- Extraction, transformation, and loading (ETL) – a process that extracts information from internal and external databases, transforms the information using a common set of enterprise definitions, and loads the information into a data warehouse.
- Data warehouse then send subsets of the information to data mart
- Data mart – contains a subset of data warehouse information
Multidimensional Analysis and Data Mining
- Relational Database contain information in a series of two-dimensional tables.
In a data warehouse and data mart, information is multidimensional, it contains layers of columns and rows
- Dimension – a particular attribute of information.
- Cube – common term for the representation of multidimensional information
- Once a cube of information is created, users can begin to slice and dice the cube to drill down into the information.
- Users can analyze information in a number of different ways and with number of different dimensions.
😍 To perform data mining users need data-mining tools
- Data-mining tool – uses a variety of techniques to find patterns and relationships in large volumes of information. Eg: retailers can use knowledge of these patterns to improve the placement of items in the layout of a mail-order catalog page or Web page.
Information Cleansing or Scrubbing
😍 Information cleansing or scrubbing – a process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information
😍 Occur during ETL process and second on the information once if is in the data warehouse
Contact information in an operational system |
|
Information cleansing activities |
Business Intelligence
😍 Business intelligence – refers to applications and technologies that are used to gather, provide access, analyze data, and information to support decision making effort.
😍 these systems will illustrate business intelligence in the areas of customer profiling, customer support, market research, market segmentation, product profitability, statistical analysis, and inventory and distribution analysis to name a few
😍 Eg: Excel, Access
No comments:
Post a Comment