| Title: | Cherry Blossom Run Race Results | 
| Version: | 0.1.0 | 
| Description: | Race results of the Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. | 
| License: | GPL-3 | 
| Suggests: | ggplot2, testthat | 
| Encoding: | UTF-8 | 
| LazyData: | true | 
| RoxygenNote: | 7.1.0 | 
| URL: | https://github.com/OpenIntroStat/cherryblossom | 
| BugReports: | https://github.com/OpenIntroStat/cherryblossom/issues | 
| Depends: | R (≥ 2.10) | 
| NeedsCompilation: | no | 
| Packaged: | 2020-06-20 11:35:35 UTC; mine | 
| Author: | Mine Çetinkaya-Rundel | 
| Maintainer: | Mine Çetinkaya-Rundel <cetinkaya.mine@gmail.com> | 
| Repository: | CRAN | 
| Date/Publication: | 2020-06-25 10:10:02 UTC | 
cherryblossom: Cherry Blossom Run Race Results
Description
Race results of the Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
Author(s)
Maintainer: Mine Çetinkaya-Rundel cetinkaya.mine@gmail.com (ORCID)
See Also
Useful links:
- Report bugs at https://github.com/OpenIntroStat/cherryblossom/issues 
Cherry Blossom Run data, 2009
Description
Details for all 14,974 runners in the 2009 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
Usage
run09
Format
A data frame with 14,974 observations on the following 14 variables.
- place
- Finishing position. Separate positions are provided for each gender. 
- time
- The total run time. 
- net_time
- The run time from the start line to the finish line. 
- pace
- Average time per mile, in minutes. 
- age
- Age. 
- gender
- Gender. 
- first
- First name. 
- last
- Last name. 
- city
- Hometown city. 
- state
- Hometown state. 
- country
- Hometown country. 
- div
- Running division (age group). 
- div_place
- Division place, also broken up by gender. 
- div_tot
- Total number of people in the division (again, also split by gender). 
Source
Examples
library(ggplot2)
# Finishing times by gender
ggplot(run09, aes(x = time, y = gender)) +
  geom_boxplot() +
  labs(
    title = "Finishing times for 2009 Cherry Blossom Run, by gender",
    x = "Time to complete the race, in minutes",
    y = "Gender"
    )
# Pacing times by gender
ggplot(run09, aes(x = pace, y = gender)) +
  geom_boxplot() +
  labs(
    title = "Pacing for 2009 Cherry Blossom Run, by gender",
    x = "Average time per mile, in minutes",
    y = "Gender"
    )
Cherry Blossom Run data, 2012
Description
Details for all 16,924 runners in the 2012 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC.
Usage
run12
Format
A data frame with 16,924 observations on the following 9 variables.
- place
- Finishing position. Separate positions are provided for each gender. 
- time
- The total run time,, in minutes. 
- pace
- Average time per mile, in minutes. 
- age
- Age. 
- gender
- Gender. 
- location
- Hometown city. 
- state
- Hometown state (if from the US) or country. 
- div_place
- Division place, also broken up by gender. 
- div_tot
- Total number of people in the division (again, also split by gender). 
Source
Examples
library(ggplot2)
# Finishing times
ggplot(run12, aes(x = time)) +
  geom_histogram(binwidth = 5) +
  labs(
    title = "Finishing times for 2012 Cherry Blossom Run,",
    x = "Time to complete the race, in minutes",
    y = "Frequency"
    )
# Pacing
ggplot(run12, aes(x = pace)) +
  geom_histogram(binwidth = 0.5) +
  labs(
    title = "Pacing for 2012 Cherry Blossom Run",
    x = "Average time per mile, in minutes",
    y = "Frequency"
    )
Cherry Blossom Run data, 2017
Description
Details for all 19,961 runners in the 2017 Cherry Blossom Run, which is an annual road race that takes place in Washington, DC. Most runners participate in a 10-mile run while a smaller fraction take part in a 5k run or walk.
Usage
run17
Format
A data frame with 19,961 observations on the following 9 variables.
- bib
- Number on the runner's bib. 
- name
- Name of the runner, with only the initial of their last name. 
- sex
- Gender of the runner. 
- age
- Age of the runner. 
- city
- Home city of the runner. 
- net_sec
- Time to complete the race, after accounting for the staggered starting time, in seconds. 
- clock_sec
- Time to complete the race, ignoring the staggered starting time, in seconds. 
- pace_sec
- Average time per mile, in seconds. 
- event
- The event the racer participated in, either the - "10 Mile"race or the- "5K".
Details
There was a time limit where all 10 Mile racers had to finish by. Can you figure out what that time is?
Source
Examples
library(ggplot2)
# Finishing times
ggplot(run17, aes(x = net_sec)) +
  geom_histogram(binwidth = 300) +
  facet_wrap(~event, nrow = 2) +
  labs(
    title = "Finishing times for 2017 Cherry Blossom Run, by event",
    subititle = "After accounting for the staggered starting time",
    x = "Time to complete the race, in seconds",
    y = "Frequency"
    )
# Pacing
ggplot(run17, aes(x = pace_sec)) +
  geom_histogram(binwidth = 100) +
  facet_wrap(~event, nrow = 2, scales = "free_y") +
  labs(
    title = "Pacing for 2017 Cherry Blossom Run, by event",
    x = "Average time per mile, in seconds",
    y = "Frequency"
    )