Posts tagged "HathiTrust"
from Library Tech Talk

HathiTrust Accessible Interface

For the past two years the University of Michigan Library has been making many of our digitized texts (including items that are in-copyright) available to persons with print disabilities through the HathiTrust Digital Library. Our Dean, Paul Courant, recently posted about this project on his blog so I thought it might be nice to offer more background and some technical information about this project.

HathiTrust Announces Temporary Beta Catalog

A major milestone for the HathiTrust partners was reached on April 27, 2009 as we released a temporary beta catalog for the digital library, offering bibliographic search (title, author, subject, ISBN, publisher, and year of publication) and faceted browse of all items in HathiTrust. The catalog is built using the open source VuFind software and is temporary, pending the release of permanent catalog to be built by OCLC in conjunction with a HathiTrust partner team.

Large-scale Full-text Indexing with Solr

A recent blog pointed out that search is hard when there are many indexes to search because results must be combined. Search is hard for us in DLPS for a different reason. Our problem is the size of the data. The Library has been receiving page images and OCR from Google for a while now. The number of OCR'd volumes has passed the 2 million mark. This raises the question of whether it is possible to provide a useful full text search of the OCR for 2 million volumes. Or more. We are trying...


Page 3 of 5