Record linkage is the process of finding data that refer to the same entities across several databases.

This will be necessary for finding duplicates in bibliography data (some data may come from CiteSeer, Google Scholar, publisher's page).

Wikipedia: https://en.wikipedia.org/wiki/Record_linkage

See also Data deduplication.