Instance
== object, record, sample, entity, observationAttribute
== characteristic, field, feature, dimensionNumerical
: Made of numbersCategorical
: Made of wordsRecord Data
Transaction data
Graph-based data
Data Cleaning
: Remove noise and correct inconsistencies in data
Data integration
: Merge data from multiple sources into a coherent data store such as a data warehouse
Data transformation / Discretization
: Data are scaled to fall within a smaller range like 0 ~ 1 (Normalization)
Data reduction
: Reduce data size by aggregating, eliminating redundant features, or clustering