The de-duplication problem
nGiven a list of semi-structured records,
n   find all records that refer to a same entity
nExample applications:
§Data warehousing:  merging name/address lists
nEntity:
a)Person
b)Household
nAutomatic citation databases (Citeseer): references
nEntity: paper
n