OXFORD UNIVERSITY COMPUTING LABORATORY

Building a recommendation Engine from Noisy, Duplicated Data

Jason Hoyt (Mendeley Corporation)

info

date

10th November 2009 (week 5, Michaelmas Term 2009)

time

11:30

place

478

abstract

Building a recommendation engine from plain text data is a difficult task. Beyond plain text, noisy, inaccurate, and duplicated metadata from text extraction of PDF documents presents an enormous challenge. Mendeley is a reference manager for researchers that that is doing just that. The infrastructure and data mining requirements to build a recommendation engine from text-based PDFs will be discussed.

further info

related series

Random Image
Random Image
Random Image