Dynamic de-duplication of bibliographic data for user services
Alamos National Laboratory, Research Library
Forum, October 26 2004, Baltimore, MD
–Netrics technology is a set of scalable linear-time algorithms that model the human notion
of similarity in order to match related information. Netrics algorithms compute optimal weighted bipartite matching of
letters and polygraphs.
This bipartite matching approach captures a more flexible, more "human" notion of
similarity than that provided
by traditional approaches to inexact matching, such as string edit-distance, dictionary/speller
methods, fuzzy search and probabilistic algorithms.