Baboon Reproduction: Linking Sexual Swellings, Estrogen, and Mating Behavior

Project Summary

Female baboons occasionally exhibit large swellings on their behinds. Although these ‘sexual swellings’ may evoke disgust from human on-lookers, they provide important information to group members about a female’s reproductive state. To figure out what these sexual swellings mean and whether male baboons notice, we need to look at the data.  

Themes and Categories

This data expedition explores the relationship between female baboon sexual swellings, female estrogen concentrations, and male mating success. The expedition uses long-term data collected on wild baboons by the Amboseli Baboon Research Project. After learning background information about baboon social lives and reproduction, students generate testable predictions for two hypotheses about baboon reproduction. Students then learn how to use the popular R packages dplyr and ggplot2 to calculate descriptive statistics about the dataset. Finally, students perform data visualization to understand and explore patterns in animal mating behavior and sexual signals.

Learning objectives

  • Learn basics of exploratory data analysis (descriptive statistics, generating plots) in R
  • Learn basics of popular R packages dplyr and ggplot2
  • Increase understanding of association between hormones and mating behavior
  • Increase science literary skills (e.g. generating predictions, interpreting results)

Course Materials


The lesson started with a brief Powerpoint presentation to introduce the class to basic information on baboon sociality and reproduction. At the end of the Powerpoint, students were introduced to 2 key hypotheses about baboon reproduction that they then explored using R. The class was divided into small groups where students worked together to propose possible predictions to test these hypotheses (and filled out the first section of the provided worksheet) before seeing the provided predictions.

Students then worked through the provided R script and accompanying dataset to test these predictions. Students with prior experience with R were able to skip ahead by following instructions on the R script, while most of the class worked through the script step-by-step with guidance from instructors. The course instructors walked the students through most of the script, then let students work independently to complete Data Visualization Part 2. Students filled out the worksheet as they went along.

At the end of the R script, students ultimately replicated Figure 1 from Gesquiere et al. 2007. This figure is included as the final slide of the Powerpoint presentation. The end of the class session was used to interpret the figure and discuss how it relates to the 2 project hypotheses.

Student level

This lesson is designed for undergraduate students who have little to no exposure to R or other programming software. It could be easily adjusted for students who are familiar with R or other programming software. This lesson takes about 75 minutes to complete.

The dataset and the Amboseli Baboon Research Project

The dataset for this expedition is a subset of the long-term database of the Amboseli Baboon Research Project, a project co-directed by Drs. Jeanne Altmann, Susan Alberts, Beth Archie, and Jenny Tung. The Amboseli Baboon Research Project has collected demographic, behavioral, genetic, and endocrinological data on a population of wild baboons since 1971 in order to study questions related to animal behavior, life history, behavioral ecology, genetics, and physiology. The project’s database is managed by Jake Gordon at Duke University and Niki Learn at Princeton University.

The unit of analysis for this dataset is a fecal estrogen sample from a cycling1 female. For each fecal sample (n = 843), 6 variables are recorded:

  1. female - identity of the female baboon
  2. cyle_day - day of her reproductive cycle
  3. estrogen - fecal estrogen concentration
  4. swelling_size - sexual swelling size2
  5. alpha_consort - whether or not the female consorted3 with an alpha male4 on that day
  6. nonalpha_consort - whether or not the female consorted with a non-alpha male on that day

This dataset includes data from 93 female baboons, with approximately 10 fecal estrogen samples per female. Minor differences between this dataset and the the dataset used in Gesquiere et al. 2007 are due to small, incremental changes in the database over time.


1 cycling: sexually mature but not pregnant or lactating
2 female yellow baboons exhibit exaggerated sexual swellings (an enlargement/engorgement of the genital and perineal skin) around ovulation
3 consortship: a period in which a male mate-guards a female. Virtually all matings and conceptions occur during consorts

Graduate students: Ali Galezo and Emily Levy
Course: Mechanisms of Animal Behavior (BIOLOGY 268D)


Related Projects

In this two-day, virtual data expedition project, students were introduced to the APIM in the context of stress proliferation, linked lives, the spousal relationship, and mental and physical health outcomes.

Stress proliferation is a concept within the stress process paradigm that explains how one person’s stressors can influence others (Thoits 2010). Combining this with the life course principle of linked lives explains that because people are embedded in social networks, stress not only can impact the individual but can also proliferate to people close to them (Elder Jr, Shanahan and Jennings 2015). For example, one spouse’s chronic health condition may lead to stress-provoking strain in the marital relationship, eventually spilling over to affect the other spouse’s mental health. Additionally, because partners share an environment, experiences, and resources (e.g., money and information), as well as exert social control over each other, they can monitor and influence each other’s health and health behaviors. This often leads to health concordance within couples; in other words, because individuals within the couple influence each other’s health and well-being, their health tends to become more similar or more alike (Kiecolt-Glaser and Wilson 2017, Polenick, Renn and Birditt 2018). Thus, a spouse’s current health condition may influence their partner’s future health and spouses may contemporaneously exhibit similar health conditions or behaviors.

However, how spouses influence each other may be patterned by the gender of the spouse with the health condition or exhibiting the health behaviors. Recent evidence suggests that a wife’s health condition may have little influence on her husband’s future health conditions, but that a husband’s health condition will most likely influence his wife’s future health (Kiecolt-Glaser and Wilson 2017).

Nationally there is a disproportionate number of children of color (African American & Latino) in the child welfare system. Durham County is no different. However, reviewing this problem through the lens of data has not been done to formulate or implement possible solutions. Durham County Department of Social Services Child & Family Services would like to evaluate systems to identify where and how disproportionality and disparity are occurring. It is occurring at the entry point of Reporting child abuse and neglect? Is it occurring at the case decision? Is our reunification time different for African American children? Or Does it take longer for a child of color to achieve permanence through adoption? Organizing the data to show us our “hot spots” would facilitate further discussion and focus on solutions to an age-old systemic problem.

Faculty Lead: Greg Herschlag

Project Lead: Jovetta L Whitfield

A team of students led by researchers at Duke University and UC Davis will visualize data on child and family health from Yolo County, California. Data varies from single words or numbers per variable (e.g. gender, age) to more complex (e.g. crime and violence, social integration, housing/homeless impact). The visualization dashboard will be used by academic researchers and community service providers in addition to Yolo County community members. The overall goal of the research is to reduce health disparities through strengthening academic-community partnerships.

Project Lead: Leigh Ann Simmons (UC Davis)