Dynamic de-duplication of bibliographic data for user services
Los
Alamos National Laboratory, Research Library
DLF
Forum, October 26 2004, Baltimore, MD
•Netrics
properties:
–Forgiving
with respect to errors in dataset
–Forgiving
with respect to errors in query
–Compares
strings like humans do
–Response can be optimized for specific datasets:
machine-learning module
–Performance
scales well with growing dataset
–RAM-based
index
•
–
•
•
•