Dynamic de-duplication of bibliographic data for user services
Los
Alamos National Laboratory, Research Library
DLF
Forum, October 26 2004, Baltimore, MD
•Netrics
elevator pitch:
–
– Netrics technology is a set of scalable linear-time algorithms that model the human notion
of similarity in order to match related information. Netrics algorithms compute optimal weighted bipartite matching of
letters and polygraphs.
This bipartite matching approach captures a more flexible, more "human" notion of
similarity than that provided
by traditional approaches to inexact matching, such as string edit-distance, dictionary/speller
correction, automaton-based
methods, fuzzy search and probabilistic algorithms.
•
•
•