This is a stage in Data Analysis/Data Mining

- to get initial understating of the data source
- e.g. understand the Distribution of the values in this data
- typically using visual tools: Plots and graph

However other things can also be done at this stage

- Summary Statistics
- Univariate Analysis - to analyze how variable values behave in isolation
- Bivariate Analysis - to analyze how two variables interact
- Correlation
- Chi-Square Test of Independence to see if two variables are dependent

- also simple forms of Cluster Analysis to spot patterns at earlier stages

Main plots used at this stage