Analyzing selected instances
nFraction of duplicates in selected instances: 44% starting with only 0.5%
nIs the gain due to increased fraction of duplicates?
nReplaced non-duplicates in selected set with random non-dups
nResult à only 40% accuracy!!! 
n