Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data warehouse architectural components - Coggle Diagram
Data warehouse architectural components
Components
Data
Things needed to prepare data and store it
Ways to deliver information to user
Major areas
Data acquisition
Data staging for cleansing, transformation and integration of data
Data transformation
Map input data to data for data warehouse repository
Clean data, deduplicate, and merge/purge
Denormalize extracted data structures as required by the dimensional model of the data warehouse
Convert data types
Calculate and derive attribute values
Check for referential integrity
Aggregate data as needed
Resolve missing values
Consolidate and integrate data
Data staging
Provide backup and recovery for staging area repositories
Sort and merge files
Create files as input to make changes to dimension tables
If data staging storage is a relational database, create and populate database
Preserve audit trail to relate each data item in the data warehouse to input source
Resolve and create primary and foreign keys for load tables
If staging area storage is a relational database, extract load files
Extraction of data
Select data sources and determine the what types of filters to be applied to individual sources
Generate automatic extract files from operational systems using replication and other techniques
Create intermediary files to store selected data to be merged later
Provide automated job control services for creating extract files.
Transport extracted files from multiple platforms
Reformat input from outside sources
Reformat input from departmental data files, databases, and spreadsheets
Generate common application code for data extraction
Resolve inconsistencies for common data elements from multiple sources
Data storage
Load data for full refreshes of data warehouse tables
Perform incremental loads at regular prescribed intervals
Support loading into multiple tables at the detailed and summarized levels
Optimize the loading process
Provide automated job control services for loading the data warehouse
Provide backup and recovery for the data warehouse database
Provide security
Monitor and fine-tune the database
Information delivery
Monitor user access to improve service and for future enhancements
Allow users to browse data warehouse content
Provide security to control information access
Automatically reformat queries for optimal execution
Provide self-service report generation for users, consisting of a variety of flexible options to create, schedule, and run reports
Store result sets of queries and reports for future use
Provide multiple levels of data granularity
Make provision for the users to perform complex analysis through online analytical processing (OLAP)
Enable data feeds to downstream, specialized decisions support systems such as EIS and data mining
Management and Control
Characteristics
Different Objectives and Scope
Data content
read-only
Must be processed
support storing of data grouped by business object, not applications
Complex analysis and quick response
Flexible and dynamic
Flex enough to solve the additional requirement probllems
Ready for changing requirements
Meta data