Exploratory Data Analysis
This is a stage in Data Analysis/Data Mining
- to get initial understating of the data source
- e.g. understand the Distribution of the values in this data
- typically using visual tools: Plots and graph
However other things can also be done at this stage
- Summary Statistics
- Univariate Analysis - to analyze how variable values behave in isolation
- Bivariate Analysis - to analyze how two variables interact
- Correlation
- Chi-Square Test of Independence to see if two variables are dependent
- also simple forms of Cluster Analysis to spot patterns at earlier stages
Plots
Main plots used at this stage