Type: | Package |
Title: | Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed) |
Version: | 1.0 |
Author: | Laura Chihara [aut], Tim Hesterberg [aut, cre] |
Date: | 2022-09-01 |
Maintainer: | Tim Hesterberg <timhesterberg@gmail.com> |
Description: | Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed). |
Depends: | R (≥ 4.2.0) |
License: | CC0 |
URL: | https://github.com/lchihara/MathStatsResamplingR |
LazyData: | true |
NeedsCompilation: | no |
Packaged: | 2022-09-01 15:58:40 UTC; timhesterberg |
Repository: | CRAN |
Date/Publication: | 2022-09-02 08:10:02 UTC |
Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed)
Description
Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed). https://github.com/lchihara/MathStatsResamplingR
Examples
# For a list of datasets do:
library(help = resampledata3)
Alcohol content and calories of beers
Description
Alcohol content and calories for a sample of ale and lager beers.
Usage
Alelager
Format
A data frame with 31 observations on the following 4 variables.
ID
Subject ID
Type
Beer: ale or lager
Alcohol
Percentage alcohol content
Calories
Number of calories
Arsenic levels of wells in Bangladesh
Description
Levels of arsenic, chlorine and cobalt in a sample of 271 wells in Bangladesh.
Usage
Bangladesh
Format
A data frame with 271 observations on the following 3 variables.
Arsenic
Arsenic level, ppb
Chlorine
Chlorine level, ppb
Cobalt
Cobalt level, ppb
Source
https://www2.bgs.ac.uk/groundwater/health/arsenic/Bangladesh/data.html
References
Reproduced with the permission of the British Geological Survey, copyright UKRI. All Rights Reserved.
Beer and hotwings consumption
Description
Beer and hotwings consumption by a sample of patrons at a Minneapolis bar.
Usage
Beerwings
Format
A data frame with 30 observations on the following 4 variables.
ID
Subject ID
Hotwings
Number of hotwings consumed
Beer
Ounces of beers consumed
Gender
Gender of patron (M/F)
Source
Data collected by Nicole Catchpole in 2004 (private communication).
Price of textooks at a college bookstore
Description
Price of textbooks at a college bookstore.
Usage
BookPrices
Format
A data frame with 44 observations on the following 3 variables.
Subject
Biology
Chemistry
Computer Science
Economics
Educational Studies
Geology
Mathematics
Physics
Political Science
Psychology
SOAN
Area
Classification of subject as either
Math & Science
orSocial Sciences
Price
Price in U.S.~dollars
Source
Data collected by R.~Hien and S.~Becker in 2010 (private communication).
Fish supply and demand for bushmeat in Ghana
Description
Fish supply (kg) and demand for bushmeat in Ghana.
Usage
Bushmeat
Format
A data frame with 30 observations on the following 4 variables.
Fish
Fish supply (in kg.) per capita
Biomass
Biomass
Year
Year
Change
Percent change in biomass
Details
Biomass of large mammals was calculated for each year by multiplying the number of animals observed in 700 walking counts of 10 to 15 km each by species-specific body weights. The products of these calculations were then summed across all species.
Source
Brashares, Arces, Sam, Coppolillo, Sinclaire, Balmford, Bushmeat hunting, wildlife declines, and fish supply in West Africa, Science. 2004 Nov 12.
Cafeteria
Description
Nutritional data on meals served in a college cafeteria.
Usage
Cafeteria
Format
A data frame with 41 observations on the following 9 variables.
ID
a numeric vector
Type
type of meal,
Meat
orVegetarian
Calories
number of calories
Carbohydrates
number of carbohydrates
Fiber
fiber content
Fat
fat content
Cholesterol
cholesterol
Protein
protein
Sodium
sodium
Source
Stephenson (private communication).
Cereals
Description
Nutritional data on a sample of cereals.
Usage
Cereals
Format
A data frame with 43 observations on the following 5 variables.
ID
a numeric vector
Age
target consumer,
adult
orchildren
Shelf
location of cereal,
bottom
,middle
, ortop
shelfSodiumgram
sodium content in grams
Proteingram
protein content in grams
Challenger
Description
Data on O-rings in 23 space shuttle flights prior to the Challenger shuttle disaster of January 1986.
Usage
Challenger
Format
A data frame with 23 observations on the following 3 variables.
Date
Data of launch
Temperature
Air temperature at launch (F)
Incident
Binary variable, 1 if one of the 0-rings on one of the booster rockets was damaged, 0 otherwise
Source
https://archive.ics.uci.edu/ml/datasets/Challenger+USA+Space+Shuttle+O-Ring
References
Dala, S.~R., Fowlkes, E.~B., Hoadley, B (1989). Risk analysis of the space shuttle: pre-Challenger prediction of failure. J.~American Statistical Association, 84, 945-957.
ChiMarathonMen
Description
Times from a sample of men who completed the Chicago marathon in 2015.
Usage
data("ChiMarathonMen")
Format
A data frame with 80 observations on the following 4 variables.
name
Name of competitor
Division
Age group
Finish
Finish time
FinishMin
Time in minutes
Source
https://chicago-history.r.mikatiming.com/
Cuckoos
Description
Female cuckoos lay their eggs on the ground and then move them to the nests of other birds. Latter gathered data on the lengths of the cuckoo eggs found in these foster nests.
Usage
data("Cuckoos")
Format
A data frame with 120 observations on the following 2 variables.
Eggs
Lengths of eggs (mm) of cuckoos
Bird
Species of birds:
HedgeSparrow
,MeadowPipit
,PiedWagtail
,Robin
,TreePipit
,Wren
Source
Tippett, L. H. C. (1952). The Methods of Statistics, 4th Edition. Wiley.
References
Latter, O. (1902). An enquiry into the dimensions of the Cuckoo's egg and the relation of the variations to the size of eggs of the foster-parent, with notes on coloration. Biometrika 1 (2): 164-176.
Diving 2017
Description
Scores of 12 female divers (10 m platform) in the 2017 FINA World Championships.
Usage
data("Diving2017")
Format
A data frame with 12 observations on the following 4 variables.
Name
Name of competitor
Country
Country
Semifinal
Score in the semi-finals
Final
Score in the finals.
Details
Competitors perform 5 dives in each round and the sum of these 5 dives determines who moves on to the next round.
Source
https://www.fina.org/competitions/213/17th-fina-world-championships-2017/results?disciplines=DV
Eyes
Description
Measurements of eyes of 40 people.
Usage
data("Eyes")
Format
A data frame with 40 observations on the following 6 variables.
ID
Subject ID
age
Age of subject
hand
Dominant hand of subject,
left
orright
eye
Dominant eye of subject,
left
orright
leftPD
Left pupillary distance (mm)
rightPD
Right pupillary distance (mm)
Source
Westfield (private communication).
Fatalities
Description
A random sample of driver fatalities in 2009 in Pennsylvania.
Usage
Fatalities
Format
A data frame with 100 observations on the following 3 variables.
ID
Subject ID
Alcohol
Alcohol involved? 1 = yes, 0 = no
Age
Age
Details
The drivers were driving a car, SUV, or light pickup truck (vehicles such as motor homes, convertibles, or commercial vehicles are excluded).
Source
http://www.nhtsa.gov/FARS
Mercury content in a sample of fish in Minnesota
Description
Mercury levels (ppm) in a sample of fish caught in Minnesota
Usage
FishMercury
Format
A data frame with 30 observations on the following variable.
Mercury
Mercury level in ppm
Source
Minnesota pollution control agency.
Length of delays of airline flights
Description
Length of delays for flights on American Airlines and United Airlines in 2009
Usage
data("FlightDelays")
Format
A data frame with 4029 observations on the following 10 variables.
ID
Subject ID
Carrier
Airline: American Airlines
AA
or United AirlinesUA
FlightNo
Flight number
Destination
Destination:
BNA
,DEN
,DFW
,IAD
,MIA
,ORD
,STL
DepartTime
Departure time:
4-8am
4-8pm
8-Mid
8-Noon
Noon-4pm
Day
Day of week
Month
Month:
May
orJune
FlightLength
Length of flight
Delay
Delay time (in minutes)
Delayed30
Delayed more than 30 minutes?
No
orYes
Details
All departures of AA or UA flights from LaGuardia Airport in May or June of 2009.
Source
https://www.bts.gov/topics/airlines-and-airports/quick-links-popular-air-carrier-statistics
General Social Survey 2018
Description
General Social Survey data from 2018
Usage
GSS2018
Format
A data frame with 2348 observations on the following 17 variables.
ID
Subject ID
Region
Midwest
,Northeast
,South
,West
GenderNow
Gender of subject:
A gender not listed here
,Man
,Not applicable
,Transgender
,Woman
Age
Age
Marital
Marital status:
Divorced
,Married
,Never married
,Separated
,Widowed
Degree
Education:
Bachelor
Graduate
,High school
Junior college
,Less than high school
Employed
Employed?
No
orYes
Income
Income level
Polviews
Political views:
Conservative
,Extremely liberal
,Extremely conservative
,Liberal
,Moderate
,Slightly conservative
,Slightly liberal
Pres16
Voted for whom in presidential election of 2016?
Clinton
,Other
,Trump
DeathPenalty
Opinion on death penalty:
Favor
,Oppose
Courts
How courts deal with criminals:
About right
,Dont know
,Not harsh enough
,Too harsh
Attend
Attendance at religious services:
Monthly
,Never
,Occasionally
,Weekly
Postlife
Believe in life after death?
Dont know
,No
,Yes
Happy
General happiness level:
Not too happy
,Pretty happy
,Very happy
Satfin
Satisfaction with financial situation:
More or less
,Not at all
,Satisfied
Energy
Government spending on developing alternative energy sources:
About right
,Dont know
,Too little
,Too much
Source
https://gss.norc.org
Births of girls in Alaska or Wyoming
Description
Data on births of a random sample of girls in Alaska or Wyoming in 2004.
Usage
data("Girls2004")
Format
A data frame with 80 observations on the following 6 variables.
ID
Subject ID
State
State:
AK
orWY
MothersAge
Age of mother:
15-19
,20-24
,25-29
,30-34
,35-39
,40-44
Smoker
Mother a smoker?
No
orYes
Weight
Weight of baby (grams)
Gestation
Gestation time (weeks)
Source
http://wonder.cdc.gov/natality-current.html
Groceries
Description
Prices of a sample of grocery items at Target or Walmart.
Usage
Groceries
Format
A data frame with 30 observations on the following 4 variables.
Product
Grocery item
Size
Package size
Target
Price at Target
Walmart
Price at Walmart
Birth weight of boys born in Illinois
Description
Birth weight of boys born in Illinois.
Usage
ILBoys
Format
A data frame with 241 observations on the following 2 variables.
MothersAge
Age range of mother:
15-19
,20-24
,25-29
Weight
Weight of baby (gm)
Details
Random sample of boys born to mothers in Illinois in 2004. Births are restricted to single births only and gestation lengths of at least 37 weeks.
Ice Cream
Description
Nutritional information on a sample of ice cream.
Usage
data("IceCream")
Format
A data frame with 39 observations on the following 7 variables.
Brand
Brand of ice cream
VanillaCalories
Calories in vanilla
VanillaFat
Fat (gm) in vanilla ice cream
VanillaSugar
Sugar (gm) in vanilla ice cream
ChocolateCalories
Calories in chocolate ice cream
ChocolateFat
Fat (gm) in chocolate ice cream
ChocolateSugar
Sugar (gm) in chocolate ice cream
Illiteracy
Description
Data on female illiteracy in a sample of countries where illiteracy is more than 5%.
Usage
Illiteracy
Format
A data frame with 94 observations on the following 4 variables.
ID
Country ID
Country
Name of country
Illit
Percentage of women over 15 years old who are illiterate (2003)
Births
Number of births per woman in that country (2005)
Source
www.unesco.org, www.data.worldbank.org
Lottery
Description
Winning lottery numbers for Fantasy 5 in California.
Usage
Lottery
Format
A data frame with 500 observations on the following variable.
Win
Number
Details
In Fantasy 5, a lottery game in California, a player tries to match 5 numbers chosen from 1 through 39. This data are the winning numbers for the daily games from 5 May 2010 through 15 August 2010.
Source
http://www.calottery.com/play/draw-games/fantasy-5
Math Anxiety
Description
Data from a study on math anxiety in a sample of primary and secondary school students in Italy
Usage
MathAnxiety
Format
A data frame with 599 observations on the following 6 variables.
Age
Age
Gender
Gender:
Boy
,Girl
Grade
Grade:
Secondary
,Primary
AMAS
Score on Abbreviated Math Anxiety Scale
RCMAS
Score on Revised Abbreviated Math Anxiety Scale
Arith
Score on arithmetic test
Source
Hill, Mammarella, Devine, et al (2016). Maths anxiety in primary and secondary school students: gender differences, developmental changes and anxiety specificity. Learning and Individual Differences 48: 45-53
Carbon dioxide levels collected by Mauna Loa Observatory
Description
Average CO2 levels (ppm) for the month of May from 1990 to 2010.
Usage
Maunaloa
Format
A data frame with 21 observations on the following 3 variables.
ID
Subject ID
Year
Year
Level
Carbon dioxide level (ppm)
Source
https://www.esrl.noaa.gov/gmd/ccgg/trends
Minnesota groundwater
Description
Measurements on water quality in wells in Minnesota.
Usage
MnGroundwater
Format
A data frame with 895 observations on the following 10 variables.
County
Minnesota county
Aquifer.Group
Type of aquifer:
buried Quaternary
,Cambrian
,Cretaceous
,Devonian
,Ordovician
,Precambrian
,surficial Quaternary
Water.Level
Water level
Alkalinity
Alkalinity
Aluminum
Aluminum
Arsenic
Arsenic
Chloride
Chloride
Lead
lead
pH
pH level
Basin.Name
Basin name
Source
Minnesota Pollution Control Agency
Mobile Ads
Description
Google experiment on effectiveness of certain recommendations for bidding on ads.
Usage
MobileAds
Format
A data frame with 655 observations on the following 40 variables.
Campaign
a numeric vector
m.impr_post
a numeric vector
m.impr_pre
a numeric vector
m.click_post
a numeric vector
m.click_pre
a numeric vector
m.cost_post
a numeric vector
m.cost_pre
a numeric vector
m.conv_post
a numeric vector
m.conv_pre
a numeric vector
m.value_post
a numeric vector
m.value_pre
a numeric vector
m.cpm_pre
a numeric vector
m.cpm_post
a numeric vector
m.cpc_pre
a numeric vector
m.cpc_post
a numeric vector
m.cpa_pre
a numeric vector
m.cpa_post
a numeric vector
m.cpr_pre
a numeric vector
m.cpr_post
a numeric vector
mult.change
a numeric vector
d.impr_post
a numeric vector
d.impr_pre
a numeric vector
d.click_post
a numeric vector
d.click_pre
a numeric vector
d.cost_post
a numeric vector
d.cost_pre
a numeric vector
d.conv_post
a numeric vector
d.conv_pre
a numeric vector
d.value_post
a numeric vector
d.value_pre
a numeric vector
d.cpm_pre
a numeric vector
d.cpm_post
a numeric vector
d.cpc_pre
a numeric vector
d.cpc_post
a numeric vector
d.cpa_pre
a numeric vector
d.cpa_post
a numeric vector
d.cpr_pre
a numeric vector
d.cpr_post
a numeric vector
error.cpr_pre
a numeric vector
error.cpr_post
a numeric vector
Details
Subset of experimental data for one advertiser. See Chihara and Hesterberg textbook for more information.
Source
Ed Lee (Google)
References
Chihara and Hesterberg, Mathematical Statistics with Resampling and R (2022). Wiley.
NBA 2016-2017 season
Description
Basketball statistics for a sample of NBA players from 4 teams for the 2016-2017 season.
Usage
data("NBA1617")
Format
A data frame with 68 observations on the following 13 variables.
Name
Player name
Position
Position:
C
(center),PF
(power forward),PG
(point guard),SF
(small forward),SG
(shooting guard)Team
Team:
Brooklyn
,Charlotte
,Cleveland
,San Antonio
Games
Number of games played
Minutes
Number of minutes plyaed
PercFG
Field goal percentage
Perc3P
3-point field goal percentage
Perc2P
2-point field goal percentage
PercFT
Free throw percentage
OffReb
Offensive rebounds
DefReb
Defensive rebounds
Assists
Assists
Blocks
Blocks
Details
Players in this data set played a minimum of 100 minutes during the 2016-2017 season.
Source
https://www.basketball-reference.com/
Birth weights of babies born
Description
Birth weights of babies born in North Carolina in 2004
Usage
NCBirths2004
Format
A data frame with 1009 observations on the following 7 variables.
ID
Subject ID
MothersAge
Mother's age level
Smoker
Mother a smoker? codeNo,
Yes
Alcohol
Mother consumed alcohol during pregnancy?
No
,Yes
Gender
Baby's gender
Weight
Baby's weight (gm)
Gestation
Gestation length (weeks)
Details
Babies in this random sample had a gestation period of at least 37 weeks and were single births (that is, not one of a twin or triplet).
Source
http://wonder.cdc.gov/natality-current.html
References
Chihara and Hesterberg, Mathematical Statistics with Resampling and R, 2022 (Wiley).
Nasdaq stock data
Description
Opening and closing stock prices for a random sample of 50 stock funds on NASDAQ on 1 December 2017.
Usage
Nasdaq
Format
A data frame with 50 observations on the following 4 variables.
Symbol
Stock symbol
Open
Opening price
Close
Closing price
Volume
Number of shares traded
Source
https://finance.yahoo.com
Olympics 2012
Description
Data on a sample of athletes competing in the 2012 London Olympics.
Usage
Olympics2012
Format
A data frame with 42 observations on the following 7 variables.
Name
Name of athlete
Country
Country
Age
Age
Sex
Sex:
F
,M
Height
Height (inches)
Weight
weight (lb)
Sport
Sport
Oscars
Description
Age and gender of Academy Award winners
Usage
Oscars
Format
A data frame with 188 observations on the following 6 variables.
Year
Year of award
Actor
Name of actor
Movie
Movie
Gender
Gender:
Man
,Woman
Birthyear
Birth year of actor
Age
Age at time of award
Source
https://www.oscars.org/
Philadelphia Phillies data 2009
Description
Baseball data for Philadelphia Phillies during the 2009 season.
Usage
Phillies2009
Format
A data frame with 162 observations on the following 8 variables.
Date
Date of game
Location
Game played where:
Away
,Home
Outcome
Outcome of game:
Lose
,Win
Outcome2
Outcome recoded: 1=win, 0 = lose
Hits
Number of hits
Doubles
Number of doubles
Homeruns
Number of homeruns
StrikeOuts
Number of strikeouts
Source
https://www.baseball-reference.com/
Quakes
Description
Time between earthquakes for all earthquakes of magnitude 6 or greater (1970-2009).
Usage
data("Quakes")
Format
A data frame with 805 observations on the following 2 variables.
ID
Subject ID
TimeDiff
Time (days)
Source
http://earthquakes.usgs.gov/earthquakes/eqarchives
Quetzal
Description
Heights of nests and snags for the quetzal (bird).
Usage
Quetzal
Format
A data frame with 21 observations on the following 3 variables.
Country
Country:
Costa Rica
,Guatemala
Nest
Height of nest (meters)
Snag
Height of snag (meters)
Details
The quetzal typically nests in abandoned woodpecker nests in dead tree trunks (snags).
Source
Siegfried, D., Linville, D., Hille, D. (2010). Analysis of nest sites and the resplendent quetzal (pharomachrus mocinno): relationship between nest and snag heights. Wilson Journal of Ornithology 122: 608-11.
Rangers and Twins baseball players (2016 season)
Description
Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins.
Usage
data("RangersTwins2016")
Format
A data frame with 27 observations on the following 17 variables.
Name
Name of player
Team
Team:
Rangers
,Twins
Pos
Player's position
Age
Age in years
Games
Number of games played
AtBats
Number of at bats
Runs
Runs
Hits
Hits
Doubles
Doubles
Triples
Triples
HR
Homeruns
RBI
Runs batted in
SB
Stolen bases
CS
Caught stealing
BB
Base on balls
SO
Strike outs
BA
Batting average
Details
Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins. These players played at least 50 games. During the 2016 season, the Rangers had the best winning percentage (0.586) in the American League while the Twins had the worst (0.364)
Source
www.baseball-reference.com
Recidivism
Description
Recidivism data from Iowa.
Usage
Recidivism
Format
A data frame with 17022 observations on the following 7 variables.
Gender
Gender:
F
,M
Age
Age group:
25-34
,35-44
,45-54
,55 and Older
,Under 25
Age25
Over or Under 25 years of age?
Over 25
,Under 25
Offense
Type of offense:
Felony
Misdemeanor
Recid
Recidivated?
No
,Yes
Type
Reason:
New
(new crime),No Recidivism
(did not recidivate),Tech
(technical violation, such as a parole violation)Days
Number of days to recidivism; NA if no recidivism
Details
All offenders convicted of either a misdemeanor or felony who were released from an Iowa prison during the 2010 fiscal year ending in June.
Source
https://data.iowa.gov/Public-Safety/3-Year-Recidivism-for-Offenders-Released-from-Pris/mw8r-vqy4
Salaries of baseball players
Description
Salaries of a random sample of baseball players from 1985 and 2015.
Usage
Salaries
Format
A data frame with 70 observations on the following 3 variables.
League
League:
American
National
Salary
Salary (in millions) in 2015 dollars
Year
Year: 1985 or 2015
Service times at a college snack bar.
Description
Time to be served at a college snack bar.
Usage
Service
Format
A data frame with 174 observations on the following 2 variables.
ID
Subject ID
Times
Time in minutes
Source
Haynor, Lojovich, Syed (private communication, 2010).
Skateboard experiment
Description
Measurement of testosterone levels in males in a skateboard experiment.
Usage
Skateboard
Format
A data frame with 71 observations on the following 3 variables.
Age
Age in years
Experimenter
Treatment (gender of experimenter):
Female
,Male
Testosterone
Testosterone level
Details
Results from an experiment where male skateboarders performed tricks in front of either a female or male.
Source
Ronay and Hippel (2010). The presence of an attractive woman elevates testosterone and physical risk taking in young men. Social Psychological and Personality Science 1:57-64.
Figure skating scores for men from the 2010 Winter Olympics.
Description
Short and free skate scores for male figure skaters in the 2010 Winter Olympics (Vancouver).
Usage
Skating2010
Format
A data frame with 24 observations on the following 5 variables.
Country
Country of skater
Name
Name
Short
Short program score
Free
Free skate score
Total
Total
Source
https://skatingscores.com/0910/oly/
Spruce data
Description
Measurements from an experiment on the growth of black spruce seedlings.
Usage
Spruce
Format
A data frame with 72 observations on the following 9 variables.
Tree
Subject ID
Competition
Treatment:
C
(competition),NC
(no competition)Fertilizer
Treatment:
F
(fertilizer),NF
(no fertilizer)Height0
Height of seedling at start
Height5
Height of seedling after 5 years
Diameter0
Diameter of seedling at start
Diameter5
Diameter of seedling after 5 years
Ht.change
Change in height
Di.change
Change in diameter
Details
Experiment on growth of black spruce seedlings under treatments of fertilizer-no fertilizer, competition- no competition (weeding).
Source
Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picea mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.
Starcraft
Description
Number of wins by a sample of Korean players in Starcraft, a strategy video game.
Usage
Starcraft
Format
A data frame with 45 observations on the following 4 variables.
ID
Subject ID
Race
Chosen race of player:
Protoss
,Terran
,Zerg
Age
Age of player
Wins
Number of wins
Source
Evans, private communication. http://www.teamliquid.net/tipd/players
TV commercial lengths
Description
Lengths of television commercials on basic and extended cable TV channel.s
Usage
data("TV")
Format
A data frame with 20 observations on the following 3 variables.
ID
Subject ID
Times
Time (min)
Cable
Cable:
Basic
,Extended
Details
Lengths of TV commercials during any given half-hour time period.
Source
Rodgers, Robinson (private communication).
Texas birth weights
Description
Weights of babies born in Texas in 2004.
Usage
TXBirths2004
Format
A data frame with 1587 observations on the following 8 variables.
ID
Subject ID
MothersAge
Mother's age:
15-19
,20-24
,25-29
,30-34
,35-39
,40-44
,under 15
Smoker
Mother smokes?
No
,Yes
Gender
Gender of baby:
Female
,Male
Weight
Weight of baby (g)
Gestation
Gestation length (weeks)
Number
Baby a single birth (1), twin (2), etc.
Multiple
Part of multiple birth (eg twin, triple)?:
No
,Yes
Details
Random sample of babies born in Texas in 2004.
Source
http://wonder.cdc.gov/natality-current.html
Titanic
Description
Subset of Titanic data
Usage
Titanic
Format
A data frame with 658 observations on the following 3 variables.
ID
Subject ID
Survived
Survival status: 1 = survived, 0 = died
Age
Age of passenger
Details
Subset of passenger data on the Titanic.
Source
https://data.world/nrippner/titanic-disaster-dataset
Turbine
Description
Average daily wind speeds (2010) from Carleton College turbine.
Usage
data("Turbine")
Format
A data frame with 168 observations on the following 4 variables.
Date2010
Date
AveKW
Average kilowatts
AveSpeed
Average speed (m/s)
Production
Energy output (kilowatt hours)
Source
Carleton College, Northfield MN.
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R. (Wiley)
Verizon repair times
Description
Repair times by Verizon for its customers or customers of other telephone companies.
Usage
Verizon
Format
A data frame with 1687 observations on the following 2 variables.
Time
Repair time (h)
Group
Customer:
CLEC
(competing local exchange carrier),ILEC
(incumbent local exchange carrier)
Details
Verizon is responsible for providing repair service to both its customers (ILEC) and its competitors (ILEC).
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resamplng and R (Wiley).
Volleyball data
Description
Data on a sample of Division I women volleyball teams.
Usage
Volleyball2009
Format
A data frame with 30 observations on the following 4 variables.
Team
Team
HitPercent
Hitting percentage
Assts
Assists
Kills
Kills
Source
http://www.ncaa.org/championships/statistics/womens-volleyball-statistics
Walleye
Description
Lengths and weights of a sample of walleye caught in Minnesota lakes (1990's).
Usage
Walleye
Format
A data frame with 60 observations on the following 2 variables.
Length
Length (inches)
Weight
Weight (pounds)
Source
Monson, Minnesota Pollution Control Agency (private communication)
Watertable
Description
Relationship between the depth of the watertable and survival status of black spruce seedings.
Usage
Watertable
Format
A data frame with 360 observations on the following 2 variables.
Depth
Depth of watertable (cm)
Alive
Status of seedling: 1 = alive, 0 = dead
Details
Part of the data from an experiment to study factors associated with the growth of black spruce seedlings under various treatments. Status of seedling at the end of the second year of the experiment is noted here.
Source
Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picae mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R (Wiley).