Heidi Smith (CS, English) and Biniam Garomsa (DataScience, Math) spent ten weeks building tools to assist the David M. Rubenstein Rare Book and Manuscript Library’s mission of finding and describing historically marginalized voices within their collections. The team performed extensive data wrangling, including modern optical character recognition techniques, with the card catalog, and then did a demographic analysis and a topic modeling analysis with the results. Final deliverables to library professionals included a structured dataset, an interactive web app, and a search tool.
Watch the team’s final presentation on Zoom:
Project Leads: Meghan Lyon
Project Manager: Anna Holleman