Queens of Antiquity

Project Summary

Understanding how to generate, analyze, and work with datasets in the humanities is often a difficult task without learning how to code or program. In humanities centered courses, we often privilege close reading or qualitative analysis over other methods of knowing, but by learning some new quantitative techniques we better prepare the students to tackle new forms of reading. This class will work with the data from the HathiTrust to develop ideas for thinking about how large groups and different discourse communities thought of queens of antiquity like Cleopatra and Dido.

Please refer to https://sites.duke.edu/queensofantiquity/ for more information.

Themes and Categories
Year
2018

Graduate Student: Grant Glass

Faculty: Dr. Charlotte Sussman

Course: “Queens of Antiquity” (English 390S-7; Spring 2018)

Grant Glass taught this Data Expedition activity to students in ENGL 290, a spring 2019 course aimed at undergraduates. This experience exemplified that by introducing simple “distant reading” or qualitative concepts in a humanities undergraduate classroom, students would be able to use these tools to drive new types of research questions and think about how reading can include qualitative analysis.

The goals were to give students an introduction to “distant reading,” show how data and collections are created, what algorithms we can apply to those collections, and what types of analysis we can do from the results.

Over the course of two, 1.5-hour class sessions, 10 undergraduates were given the opportunity to create their own datasets and explore the results. For the end product, students created posts to discuss how the visualizations created from their collections helped them better understand.

Guiding Questions

  • What visualization is the most useful? Why?
  • What does the visualization help you understand about the corpus? What does it obscure?
  • What research questions can you generate from the visualization?

The Dataset

Dido

Elizabeth 1

Anne

Cleopatra

In-Class Exercises

Creating Collections with Hathitrust

Understanding the Visualizations

Related People

Related Projects

A large and growing trove of patient, clinical, and organizational data is collected as a part of the “Help Desk” program at Durham’s Lincoln Community Health Center. Help Desk is a group of student volunteers who connect with patients over the phone and help them navigate to community resources (like food assistance programs, legal aid, or employment centers). Data-driven approaches to identifying service gaps, understanding the patient population, and uncovering unseen trends are important for improving patient health and advocating for the necessity of these resources. Disparities in food security, economic stability, education, neighborhood and physical environment, community and social context, and access to the healthcare system are crucial social determinants of health, which studies indicate account for nearly 70% of all health outcomes.

We led a 75-minute class session for the Marine Mammals course at the Duke University Marine Lab that introduced students to strengths and challenges of using aerial imagery to survey wildlife populations, and the growing use of machine learning to address these "big data" tasks.

Most phenomena that data scientists seek to analyze are either spatially or temporally correlated. Examples of spatial and temporal correlation include political elections, contaminant transfer, disease spread, housing market, and the weather. A question of interest is how to incorporate the spatial correlation information into modeling such phenomena.

 

In this project, we focus on the impact of environmental attributes (such as greenness, tree cover, temperature, etc.) along with other socio-demographics and home characteristics on housing prices by developing a model that takes into account the spatial autocorrelation of the response variable. To this aim, we introduce a test to diagnose spatial autocorrelation and explain how to integrate spatial autocorrelation into a regression model

 

 

In this data exploration, students are provided with data collected from remote sensing, census, and Zillow sources. Students are tasked with conducting a regression analysis of real-estate estimates against environmental amenities and other control variables which may or may not include the spatial autocorrelation information.