Smoking and Activity Space

Project Summary

Computer Science major Yumin Zhang and IIT student Akhil Kumar Pabbathi spent ten weeks working closely with Dr. Joe McClernon from Psychiatry and Behavioral Sciences to understand smoking and tobacco purchase behavior through activity space analysis.

Themes and Categories
Year
2016

Project Results

The team developed a robust algorithm to extract meaningful features from GPS tracking and subject-indicated smoking and tobacco purchase data, and verified purchases with locations of tobacco retail outlets in Durham County. Yumin and Akhil used clustering techniques and network analysis to compare activity space and length-of-stay behaviors between smokers and non-smokers. Their computational framework has brought valuable insights to the clinical study team and will play an integral role in future studies carried out by Dr. McClernon and colleagues.

Download the Executive Summary (PDF)

Faculty Sponsor

  • Joe McClernon, Associate Professor, Psychiatry and Behavioral Sciences

"Our DATA+ team did wonderful work. I’d wanted a software tool for quantifying human mobility data using network analysis for 5 years—the team got it done in 8 weeks! Paul and Ashlee run a fantastic program that requires minimal oversight on the part of faculty but with maximum returns. Sign me up for next year!" Joe McClernon, Associate Professor, Psychiaty and Behavioral Sciences

Project Manager

  • Tianyuan Liu, UNC-Chapel Hill Transportation Planning and Health Behavior

Participants

  • Yumin Zhang, Duke University Computer Science
  • Pabbathi Akhil Kumar, Duke University Electrical and Computer Engineering

Disciplines Involved

  • Economics
  • Sociology
  • Public Health
  • Psychology
  • All quantitative STEM

Related People

Related Projects

Social and environmental contexts are increasingly recognized as factors that impact health outcomes of patients. This team will have the opportunity to collaborate directly with clinicians and medical data in a real-world setting. They will examine the association between social determinants with risk prediction for hospital admissions, and to assess whether social determinants bias that risk in a systematic way. Applied methods will include machine learning, risk prediction, and assessment of bias. This Data+ project is sponsored by the Forge, Duke's center for actionable data science.

Project Leads: Shelly Rusincovitch, Ricardo Henao, Azalea Kim

Project Manager: Austin Talbot

Producing oil and gas in the North Sea, off the coast of the United Kingdom, requires a lease to extract resources from beneath the ocean floor and companies bid for those rights. This team will consult with professionals at ExxonMobil to understand why these leases are acquired and who benefits. This requires historical data on bid history to investigate what leads to an increase in the number of (a) leases acquired and (b) companies participating in auctions. The goal of this team is to create a well-structured dataset based on company bid history from the U.K. Oil and Gas Authority; data which will come from many different file structures and formats (tabular, pdf, etc.). The team will curate these data to create a single, tabular database of U.K. bid history and work programs.

Producing oil and gas in the Gulf of Mexico requires rights to extract these resources from beneath the ocean floor and companies bid into the market for those rights. The top bids are sometimes significantly larger than the next highest bids, but it’s not always clear why this differential exists and some companies seemingly overbid by large margins. This team will consult with professionals at ExxonMobil to curate and analyze historical bid data from the Bureau of Ocean Energy Management that contains information on company bid history, infrastructure, wells, and seismic survey data as well as data from the companies themselves and geopolitical events. The stretch goal of the team will be to see if they can uncover the rationale behind historic bidding patterns. What do the highest bidders know that other bidders do not (if anything)? What characteristics might incentivize overbidding to minimize the risk of losing the right to produce (i.e. ambiguity aversion)?