A team of students will collaborate with Duke librarians to use AI-powered Handwriting Text Recognition (HTR) tools to transform thousands of pages of handwritten text into machine readable data. Using a large dataset of digitized 19th and early 20th-century women’s travel diaries held in the Rubenstein Library students will test and evaluate various HTR technologies, document methods and constraints for extracting text from historical manuscripts, and build an HTR toolset and proof-of-concept interface that the library can build on for future projects. This work will further the library’s initiative to make its historical collections more readily available for computational research (and future Data+ projects!).
Project Leads: Molly Bragg and Noah Huffman
Project Manager: Anna Holleman
Watch the team’s final presentation below: