About My Data+ Experience
"It made me realize data science is much more subjective than I thought. It also taught me about the proper workflow and documentation."
"It made me realize data science is much more subjective than I thought. It also taught me about the proper workflow and documentation."
Ashley Murray (Chemistry/Math), Brian Glucksman (Global Cultural Studies), and Michelle Gao (Statistics/Economics) spent 10 weeks analyzing how meaning and use of the work “poverty” changed in presidential documents from the 1930s to the present. The students found that American presidential rhetoric about poverty has shifted in measurable ways over time. Presidential rhetoric, however, doesn’t necessarily affect policy change. As Michelle Gao explained, “The statistical methods we used provided another more quantitative way of analyzing the text. The database had around 130,000 documents, which is pretty impossible to read one by one and get all the poverty related documents by brute force. As a result, web-scraping and word filtering provided a more efficient and systematic way of extracting all the valuable information while minimizing human errors.” Through techniques such as linear regression, machine learning, and image analysis, the team effectively analyzed large swaths of textual and visual data. This approach allowed them to zero in on significant documents for closer and more in-depth analysis, paying particular attention to documents by presidents such as Franklin Delano Roosevelt or Lyndon B. Johnson, both leaders in what LBJ famously called “The War on Poverty.”
Led by Dr. Eva Wheeler, this project considers how racial language in African American literature and film is rendered for international audiences and traces the spread of these translations. To address the study’s primary questions, the team analyzed a preliminary dataset and explored the relationship between translation strategy and different categories of racial language. The team also conducted a macro-level analysis of the linguistic, temporal, and geographic spread of African American stories using the IMDB and WorldCat databases. We have found a large amount of variation in how African American stories are rendered, which can in part be explained through a social scientific lens.