Please enable JavaScript.
Coggle requires JavaScript to display documents.
Summarization (typical functions that can be applied to data inside each…
Summarization
-
Crosstab
where summarize provides summary and aggregate information on existing
columns, crosstab uses the content inside columns to create new columns
a way to deal with data that is currently in “skinny” form and transform
what is currently listed in rows to column-form
kind of “skinny” data is seldom
at the right level for analysis, and crosstab makes data available in an intuitive and
readable fashion, sometimes also creating new features for machine learning to
better predict a target
-
when data is available that
does not focus directly on the right unit, it must be aggregated or summarized
-
-
-
when summarizing, one or more columns by
which to group data is selected, essentially creating a virtual “bucket” for each
unique group