Prolific Pigs? Mating Reproductive Capacity with Market Price in Early Twentieth Century Pig Breeding

Project Summary

Yanmin (Mike) Ma, mathematics/economics major, and Manchen (Mercy) Fang, electrical and computer engineering/computer science major, spent ten weeks studying historical archives and building a model to predict the price of pigs, relative to a number of interesting factors.

Themes and Categories
Year
2015
Contact
Paul Bendich
Mathematics
bendich@math.duke.edu

Project Results

Berkshire pig transaction price is influenced by factors including the age of the pig, the number of offspring and the prevailing pork price. And we can conclude that within 1910 - 1920, breeders and buyers did price Berkshire pig rationally to a large extent.

Download the executive summary (PDF).

Disciplines Involved

  • Economics
  • History

Project Team

Undergraduates: Yanmin (Mike) Ma, mathematics/economics major, and Manchen (Mercy) Fang, electrical and computer engineering/computer science major

Client: Gabriel Rosenberg, Assistant Professor, Women's Studies Program

Graduate student mentor: Chris Glynn, Department of Statistical Science

From left to right: Manchen (Mercy) Fang; Chris Glynn; Yanmin (Mike) Ma

"Working with Mercy Fang and Mike Ma has been a pleasure. Throughout the Data+ experience, Mike and Mercy demonstrated creativity, diligence, and technical ability in their research on Berkshire pigs. Our primary objective was to build a data set that would cross-reference genealogical, market, and geographic data on individual Berkshire pigs. To collect this data, Mike and Mercy worked with original scans of archived primary source publications. Together, they developed efficient algorithms and error-correcting mechanisms to extract data from PDF documents. Mike and Mercy taught me a lot about working with text-data, the perils of optical character recognition, and the factors that drove hog markets in the early 1900s.

The pride that they took in their research is evident in their Data+ seminar talks and their final poster. Most impressively, Mike and Mercy were extremely independent and self-reliant. They formulated their own questions, developed strategies for answering them, and implemented solutions on their own. They are very promising young researchers." — Chris Glynn, graduate student mentor

Related People

Related Projects

Social and environmental contexts are increasingly recognized as factors that impact health outcomes of patients. This team will have the opportunity to collaborate directly with clinicians and medical data in a real-world setting. They will examine the association between social determinants with risk prediction for hospital admissions, and to assess whether social determinants bias that risk in a systematic way. Applied methods will include machine learning, risk prediction, and assessment of bias. This Data+ project is sponsored by the Forge, Duke's center for actionable data science.

Project Leads: Shelly Rusincovitch, Ricardo Henao, Azalea Kim

Project Manager: Austin Talbot

Aaron Chai (Computer Sciece, Math) and Victoria Worsham (Economics, Math) spent ten weeks building tools to understand characteristics of successful oil and gas licenses in the North Sea. The team used data-scraping, merging, and OCR method to create a dataset containing license information and work obligations, and they also produced ArcGIS visualizations of license and well locations. They had the chance to consult frequently with analytics professionals at ExxonMobil.

Click here to read the Executive Summary

 

Project Lead: Kyle Bradbury

Project Manager: Artem Streltsov

Yueru Li (Math) and Jiacheng Fan (Economics, Finance) spent ten weeks investigating abnormal behavior by companies bidding for oil and gas rights in the Gulf of Mexico. Working with data provided by the Bureau of Ocean Energy Management and ExxonMobil, the team used outlier detection methods to automate the flagging of abnormal behavior, and then used statistical methods to examine various factors that might predict such behavior. They had the chance to consult frequently with analytics professionals at ExxonMobil.

 

Click here to read the Executive Summary

 

Project Lead: Kyle Bradbury

Project Manager: Hyeongyul Roh