Please enable JavaScript.
Coggle requires JavaScript to display documents.
Data Wrangling (Data Pre-processing (1.2 Data Quality Assessment…
Data Wrangling
-
-
-
-
1.4 Data Cleaning
-
- Imputation: dealing with missing values
-
-
-
- Manual Inspection of possible outliers
- Removal of duplicate data:
- duplicate records occur within a signle or combined datasets;
- Redundant attributes can be identified by correlation analysis
- Resolve Inconsistency:
Different formats, codes, and standards across different sources
-
-
-
Data Integration
-
-
Data Fusion: Given a set of two or more records that have been classified to refer to the same entity, create a single record by resolving conflicting data values
-
-
-
-
-
-
-