Rubenstein Library’s Card Catalog File

Rubenstein Library’s Card Catalog File


Heidi Smith (CS, English) and Biniam Garomsa (DataScience, Math) spent ten weeks building tools to assist the David M. Rubenstein Rare Book and Manuscript Library’s mission of finding and describing historically marginalized voices within their collections. The team performed extensive data wrangling, including modern optical character recognition techniques, with the card catalog, and then did a demographic analysis and a topic modeling analysis with the results. Final deliverables to library professionals included a structured dataset, an interactive web app, and a search tool.


View the team’s project poster here

Watch the team’s final presentation on Zoom:

Project Leads: Meghan Lyon

Project Manager: Anna Holleman



Related People

Computer Science, English

Data Science, Math