RTI-Duke Data+ Practicum

Project Summary

Lindsay Hirschhorn (Mechanical Engineering) and Kelsey Sumner (Global Health and Evolutionary Anthropology) spent ten weeks determining optimal vaccination clinic locations in Durham County for a simulated Zika virus outbreak. They worked closely with researchers at RTI International to construct models of disease spread and health impact, and developed an interactive visualization tool.

Themes and Categories
Year
2016

Project Results

Using a simulated population of Durham county based on the 2010 UC Decennial Census, the team developed a model for disease spread based on transmission characteristics of Zika virus. They then determined optimal placement for vaccination clinics to minimize public health impact with several clinic placement algorithms including Cover Tree assignment, recursive removal of vaccinated population, and geospatial proximity to clusters of residents. Kelsey and Lindsay also developed an R/Shiny application for interactive visualization of their results.

Download the Executive Summary (PDF)

Project Manager

"I was very impressed by the Data+ program and would encourage both undergraduate students and employers to consider it as a strong opportunity for valuable experience and potential recruitment. The students involved in Data+ demonstrated excellence in scholarship, creativity, and output. The administrators and directors of Data+ have done a tremendous job in creating a learning environment that promotes cross collaboration, drives innovation and creative problem solving, and provides enough structure and mentoring as to guide students to successful outcomes. We will continue to stay in contact with Data+, and we look forward to another potential sponsorship in the future." Thom Miano, RTI International

Participants

Disciplines Involved

  • Public Health
  • Applied Mathematics
  • All quantitative STEM

"Working with Data+ for the past two summers has taught me not only new data analysis techniques but also the importance of collaboration and sharing ideas. This summer, my team’s project was focused on Zika virus vaccination clinic location optimization. To determine the best locations for clinics, we created a disease spread simulation and calculated the number of days to vaccinate the population to herd immunity. The simulation was a great way to choose the best clinic distribution model; however, it was difficult to effectively present the disease transmission simulation to an audience. After hearing Matt Ross present about his project last year creating interactive Shiny Apps, we started an application under the mentorship of Matt and Data+ 2015 intern, Molly Rosenstein. With their help, my team was able to create an interactive Shiny App within a week. This fast-paced, in-depth learning process is something only found in a multidisciplinary environment like Data+." Kelsey Sumner, Evolutionary Anthropology & Global Health major

Related People

Related Projects

Social and environmental contexts are increasingly recognized as factors that impact health outcomes of patients. This team will have the opportunity to collaborate directly with clinicians and medical data in a real-world setting. They will examine the association between social determinants with risk prediction for hospital admissions, and to assess whether social determinants bias that risk in a systematic way. Applied methods will include machine learning, risk prediction, and assessment of bias. This Data+ project is sponsored by the Forge, Duke's center for actionable data science.

Project Leads: Shelly Rusincovitch, Ricardo Henao, Azalea Kim

Project Manager: TBD

Producing oil and gas in the North Sea, off the coast of the United Kingdom, requires a lease to extract resources from beneath the ocean floor and companies bid for those rights. This team will consult with professionals at ExxonMobil to understand why these leases are acquired and who benefits. This requires historical data on bid history to investigate what leads to an increase in the number of (a) leases acquired and (b) companies participating in auctions. The goal of this team is to create a well-structured dataset based on company bid history from the U.K. Oil and Gas Authority; data which will come from many different file structures and formats (tabular, pdf, etc.). The team will curate these data to create a single, tabular database of U.K. bid history and work programs.

Producing oil and gas in the Gulf of Mexico requires rights to extract these resources from beneath the ocean floor and companies bid into the market for those rights. The top bids are sometimes significantly larger than the next highest bids, but it’s not always clear why this differential exists and some companies seemingly overbid by large margins. This team will consult with professionals at ExxonMobil to curate and analyze historical bid data from the Bureau of Ocean Energy Management that contains information on company bid history, infrastructure, wells, and seismic survey data as well as data from the companies themselves and geopolitical events. The stretch goal of the team will be to see if they can uncover the rationale behind historic bidding patterns. What do the highest bidders know that other bidders to not (if anything)? What characteristics might incentivize overbidding to minimize the risk of losing the right to produce (i.e. ambiguity aversion)?