If n is the number of samples required and there are k
processors, then each processor takes n/k samples from its local
partition.
The sampling can be done from
* B+ Trees
* Hash tree sampling
* Dense Index sampling
These methods do not give unbiased sampling as the probability of choosing
samples from a node containing a lesser number of tuple pointers is greater
than choosing from other nodes.