The study of the collection, analysis, interpretation, presentation, and organization of data (Dodge 2006)
The study of the collection, analysis, interpretation, presentation, and organization of data (Dodge 2006)
The science of learning from data in the face of uncertainty (various)
The study of the collection, analysis, interpretation, presentation, and organization of data (Dodge 2006)
The science of learning from data in the face of uncertainty (various)
Statistics=Information+Uncertainty
Inductive reasoning
Often attributed to Francis Bacon (and others)
Consistent observations -> general principle
Problem: "confirmatory" observations can't disprove theory
Example: I've only seen birds that fly :: all birds can fly
Inductive reasoning
Often attributed to Francis Bacon (and others)
Consistent observations -> general principle
Problem: "confirmatory" observations can't disprove theory
Example: I've only seen birds that fly :: all birds can fly
Deductive reasoning
Formalized by Karl Popper
Theory -> predictions -> observations
Based on falsification
Example: All birds can fly :: penguins are birds :: penguins can fly
1) Pattern identification (i.e., exploratory studies)
1) Pattern identification (i.e., exploratory studies)
2) Hypothesis formation
1) Pattern identification (i.e., exploratory studies)
2) Hypothesis formation
3) Predictions
1) Pattern identification (i.e., exploratory studies)
2) Hypothesis formation
3) Predictions
4) Data collection
1) Pattern identification (i.e., exploratory studies)
2) Hypothesis formation
3) Predictions
4) Data collection
5) Models and testing
1) Pattern: Trees at higher elevations are shorter than at low elevations
1) Pattern: Trees at higher elevations are shorter than at low elevations
2) Hypotheses
1) Pattern: Trees at higher elevations are shorter than at low elevations
2) Hypotheses
3) Predictions
1) Pattern: Trees at higher elevations are shorter than at low elevations
2) Hypotheses
3) Predictions
4) Data collection1
5) Models1
[1] We'll get to these!
In other words, if we change x, will y change also (and by how much)?
Harder than it seems! Why?
In other words, if we change x, will y change also (and by how much)?
Harder than it seems! Why?
Generally restricted to manipulative experiments
In other words, if we change x, will y change also (and by how much)?
Harder than it seems! Why?
Generally restricted to manipulative experiments
In other words, if we change x, will y change also (and by how much)?
Harder than it seems! Why?
Generally restricted to manipulative experiments
Increasing interest in causal inference from observational studies
A collection of subjects of interest
Often, a biologically meaningful unit
Sometimes a process of interest
A collection of subjects of interest
Often, a biologically meaningful unit
Sometimes a process of interest
1) This is the population
2) There is variation within and among groups, but:
3) The hypothesis is correct (mean prevalence is lower in new vs. current variety)
A collection of subjects of interest
Often, a biologically meaningful unit
Sometimes a process of interest
A collection of subjects of interest
Often, a biologically meaningful unit
Sometimes a process of interest
A finite subset of the population of interest, i.e. the data we collect
Samples allow us to draw inferences about the population
Good samples are:
1) This is a sample
2) The sample means are our best estimates of the population means
3) But the sample means will never equal the population means (uncertainty!)
ˉy=∑ni=1yin
ˉy=∑ni=1yin
ˉy=∑ni=1yin
s2=∑ni=1(yi−ˉy)2n−1
s2=∑ni=1(yi−ˉy)2n−1
s=√s2
s2=∑ni=1(yi−ˉy)2n−1
s=√s2
Every sample has a different mean (and standard deviation) - more uncertainty!
sampling produces uncertainty
unavoidable (but that's ok!)
Doubt is not a pleasant condition, but certainty is absurd -- Voltaire
sampling produces uncertainty
unavoidable (but that's ok!)
Doubt is not a pleasant condition, but certainty is absurd -- Voltaire
the primary goal of this class is for you to understand how to make robust inferences that account for uncertainty (and the limitations of those inferences)
we will return to this basic concept (sampling error) many times this semester
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |