Please enable JavaScript.
Coggle requires JavaScript to display documents.
Summarization (Summarize (Common summary tool names (Group, Group by), 1…
Summarization
Summarize
Available in most programming tools
Common summary tool names
Group
Group by
1 or more columns selected to group data by
Creates a virtual bucket for reach unique group
Once relevant rows are in the bucket, data within to be summarized
Placed in appropriate bucket, the preferred function can be applied across
Typical functions that can be applied to data
Count, sum, min, max, first, last, average, median, mode, standard deviation
Cross tab
Uses content inside columns to create new columns
Deal with data in skinny form
Skinny form
Usually not at the right level for analysis
Transform what is listed in rows to column forms
Can create new features for machine learning to predict a target
Skinny table
Common shape for data to take
Must summarize data when
Does not focus directly on the right unit