Data Reduction

In Machine Learning and Data Mining

  • How to speed up computation for our model?


There are two approaches:

  • reducing the number of rows
  • reducing the number of columns


Rows

The main approach is to randomly select a subset of the dataset


Columns

Main approach: remove dependent variables


Other techniques:


See Also

Sources