Quantifying Rare Diseases in Duke Health System

Project Summary

Gary Koplik (Masters in Economics and Computation) and Matt Tribby (CompSci, Statistics) spent ten weeks investigating the burden of rare diseases on the Duke University Health System (DUHS). They worked with a massive set of ICD diagnosis codes and visit data provided by DUHS.

Themes and Categories
Year
2017
Contact
Paul Bendich
Center for Applied Genomics and Precision Medicine
bendich@math.duke.edu

Project Results: The team created cohorts of patients with and without rare disease diagnosis codes and performed exploratory comparisons. They identified key roadblocks to analysis of rare disease created by the current ICD hierarchy and created a compelling plan for future work.

Click here for the Executive Summary

Faculty Lead: Rachel Richesson

Project Manager: Isaac Lavine

 

 

"I've gained an appreciation for the all-important data 'pre-processing' that takes up the vast majority of the effort when working with health data." — Isaac Lavine, Project Manager and PhD Student in Statistical Science at Duke University

Related People

Related Projects

Alexa Goble (Finance) joined Econ majors Chavez Cheong and Eli Levine in a ten-week exploration of mortgage enforcement actions related to the financial crisis from earlier in this century. Using NLP techniques on mortgage data from Ohio and Massachusetts, the team validated a new experimental approach to understanding the dynamics between state regulatory agencies, mortgage lenders, brokers, and loan originators. This project was a continuation of two previous Data+ projects:

https://bigdata.duke.edu/projects/american-predatory-lending-global-financial-crisis

https://bigdata.duke.edu/projects/american-predatory-lending-and-global-financial-crisis-year-2

 

View the team's project poster here

Watch the team's final presentation on Zoom:

 

Project Lead: Lee Reiners

Project Manager: Malcolm Smith Fraser

Stats/Sociology major Mitchelle Mojekwu joined Neuroscience majors Kassie Hamilton and Zineb Jaidi in a ten-week exploration of data relevant to an upcoming public school zone redistricting in Durham County. Using information acquired from the General Social Survey and the US Census, the team applied modern mathematical and statistical methods for generating proposed redistricting plans, with the aim of providing decision-makers with information they can use to produce school districts that are equitable and reflective of the Durham County student population.

View the team's project poster here

Watch the team's final presentation on Zoom:

 

Faculty Lead: Greg Herschlag

Project Manager: Bernard Coles

 

Pryia Juarez (BME/ECE), Jonathan Pilland (ECE/BME), and Matthew Traum (CS/Econ) spent teen weeks analyzing sensor data synthesized by an agile waveform generator. The team used deep reinforcement learning techniques to understand the performance of different synthetic agents representing potential attackers to the sensor system.

 

View the team's project poster here

Watch the team's final presentation on Zoom:

 

Faculty leads: Robert Calderbank, Vahid Tarokh, Ali Pezeshki

Client leads: Dr. Lauren Huie, Dr. Elizabeth Bentley, Dr. Zola Donovan, Dr. Ashley Prater-Bennette, Dr. Erin Trip

Project Manger: Suya Wu