Exploring lemur olfactory communication via statistical analyses in R

Project Summary

Questions asked: Do males and females scent mark equally? Do lemurs scent mark equally in breeding and non-breeding seasons?

Themes and Categories
Year
Contact
Paul Bendich
bendich@math.duke.edu

Graduate students: Lydia Greene and Kendra Smyth

Faculty instructor: Julie Teichroeb

Course: EVANTH 246: Sociobiology

Data set: The frequency of scent-marking behavior in the Coquerel’s sifaka

Dependent variable: scent-marking frequency

Potential explanatory variables: sex, season, age, group size, free ranging, amount of time observed, individual identity

  • Step 1: Visualizing data and testing for normalcy (histograms, dotcharts, box plots, Shapiro test)
  • Step 2: Choosing an appropriate distribution and test
  • Step 3: Applying the test in R (Wilcoxon tests and GLMMs)
  • Step 4: Interpreting results

Model <- glmmadmb(Scentmark ~ Sex + Season + Group.size + Age + FR + (1|Individual) + offset(log(Obs..Time)), data=data, family=”nbinom”, zeroInflation=TRUE)

Related Projects

This data expedition explores the local (ego) patent citation networks of three hybrid vehicle-related patents. The concept of patent citations and technological development is a core theme in innovation and entrepreneurship, and the purpose of these network explorations is to both quantitatively and visually assess how innovations are connected and what these connections mean for the focal innovations and the technologies that draw on those patents in the future. The expedition was incorporated as part of the Sociology of Entrepreneurship class, where students are thinking about the emergence and diffusion of innovations.

Large publicly available environmental databases are a tremendous resource for both scientists and the general public interested in climate trends and properties. However, without the programming skills to parse and interpret these massive datasets, significant trends may remain hidden from both scientists and the public. In this data exploration, students, over the course of three hours, accessed two large, publicly available datasets, each with greater than 4 million observations. They learned how to use R and RStudio to effectively organize, visualize and statistically explore trends in deep sea physical oceanography.  

Our aim was to introduce students to the wealth of possibilities that human genotyping and sequencing hold by illustrating firsthand the power of these datasets to identify genetic relatives, using the story of the Golden State Killer’s capture with public genetic databases.