Dynamic de-duplication of bibliographic data for user services
Los
Alamos National Laboratory, Research Library
DLF
Forum, October 26 2004, Baltimore, MD
Results look much better than
those of batch de-duplication approach ~ Netrics matching + training by
librarians
Can de-dup external
data against local data
No batch processing,
but on-the-fly de-duplication
Possibility
to retrain the system to optimize responses without data reprocessing:
machine learning module
Modularity of solution
accommodates growth of dataset
Netrics module can be
used by multiple applications, not just one
Positive collaboration
with Netrics (machine learning, cache)
Will be plugged into
new search environment that is being created
Will be applied to
full-content collection