Heidi Smith (CS, English) and Biniam Garomsa (DataScience, Math) spent ten weeks building tools to assist the David M. Rubenstein Rare Book and Manuscript Library’s mission of finding and describing historically marginalized voices within their collections. The team performed extensive data wrangling, including modern optical character recognition techniques, with the...
Using digitized card catalogs from the David M. Rubenstein Rare Book and Manuscript Library, a team of students explored extracting structured data from over 115,000 subject cards to develop searchable and sortable descriptions of manuscript and archival collections. They prepared the digitized subject cards for online access in the Internet...
Marine mammals exhibit extreme physiological and behavioral adaptions that allow them to dive hundreds to thousands of meters underwater despite their need to breathe air at the surface. Through the development of new remote monitoring technologies, we are just beginning to understand the mechanisms by which they are able to...
A team of students led by researchers in the Nicholas School of the Environment will use satellite imagery and spatial data in Google Earth Engine (GEE) to determine how a quarter-million smallholder farmers across East Africa and India have successfully scaled tree planting as a natural climate solution over the...
Interested in understanding the types of attacks targeting Duke and other universities? Led by OIT and the IT Security Office, students will learn to analyze threat intelligence data to identify trends and patterns of attacks. Duke blocks an average of 1.5 billion malicious connection attempts/day and is working with other universities...
Volumetric segmentation of sub-cortical structures such as the basal ganglia and thalamus is necessary for non-invasive diagnosis and neurosurgery planning. This is a challenging problem due in part to limited boundary information between structures, similar intensity profiles across the different structures, and low contrast data. This work presents a semi-automatic...
Undergraduate students Ellie Burton (BioPhysics/Math, Johns Hopkins University), Kevin Kuo (Electrical and Computer Engineering), and GiSeok Choi (Electrical and Computer Engineering/Math) joined a research group led by Douglas Boyer and Professor Ingrid Daubechies, testing and developing mathematical and statistical methodology for measuring similarities between bones and teeth. Faculty Lead: Ingris...
Yanchen Ou (Computer Science) and Jiwoo Song (Chemistry, Mechanical Engineering) spent ten weeks building tools to assist in the analysis of smart meter data. Working with a large dataset of transformer and household data from the Kyrgyz Republic, the team built a data preprocessing pipeline and then used unsupervised machine-learning techniques to assess...
Runliang Li (Math), Qiyuan Pan (Computer Science), and Lei Qian (Masters in Statistics and Economic Modelling) spent ten weeks investigating discrepancies between posted wait times and actual wait times for rides at Disney World. They worked with data provided by TouringPlans. Project Results The team built a linear regression model to predict future wait times on given rides...
David Liu (Electrical Computer Engineering) and Connie Wu (Computer Science/Statistics) spent ten weeks analyzing data about walking speed from the 6th Vital Sign Study. Integrating study data with public data from the American Community Survey, they built interactive visualization tools that will help researchers understand the study results and the representativeness of study participants. Click here...
Computer Science major Yumin Zhang and IIT student Akhil Kumar Pabbathi spent ten weeks working closely with Dr. Joe McClernon from Psychiatry and Behavioral Sciences to understand smoking and tobacco purchase behavior through activity space analysis. Project Results The team developed a robust algorithm to extract meaningful features from GPS tracking and subject-indicated smoking and tobacco purchase...
The goal of this Data Expedition was to introduce students to the exploration of social networks data using R. Students learned to load and plot a social network in R and then perform some basic analyses on two different networks: Hockey Fights in the National Hockey League in 2018-2019 and...
Sharrin Manor, Arjun Devarajan, Wuming Zhang, and Jeffrey Perkins explored a lage collection of imagery data provided by the U.S. Geological Survey, with the goal of identifying solar panels using image recognition. They worked closely with the Energy Data Analytics Lab, part of the Energy Initiative at Duke. Project Results The students coded their own proof-of-principle algorithm which identified...
This data expeditions module used three full course sessions to introduce undergraduate hydrology students with minimal programming background to: Public water data (water quantity and chemistry) Spatial analysis of water data 2 core, spatial datasets produced by the USGS that enable spatial analysis The programming language R R based tools for water...
Andre Wang (Math, Statistics), Michael Xue (Computer Science, ECE), and Ryan Culhane (Computer Science) spent ten weeks exploring the role played by emotion in speech-focused machine-learning. The team used a variety of techniques to build emotion recognition pipelines, and incorporated emotion into generated speech during text-to-speech synthesis. Click here to read the Executive Summary Faculty...
The aim of this data expedition was to give students an introduction to stable isotopes and how the data can be used to understand trophic dynamics. Within a 3-hour lab students were introduced to methane seeps and the difference between photosynthetic and chemosynthetic carbon, before working through an analysis of...
Vivek Sahukar (Masters, Data Science), Yuval Medina (Computer Science), and Jin Cho (Computer Science/Electrical & Compter Engineering) spent ten weeks creating tools to help augment the experience of users in the StreamPULSE community. The team created an interactive guide and used data sonification methods to help users navigate and understand the data, and they used a mixture...
Showing 321-340 of 388 results