These slides are viewed best by Chrome or Firefox and occasionally need to be refreshed if elements did not load properly. See here for the PDF .
Press the right arrow to progress to the next slide!
Lecturer: Emi Tanaka
Department of Econometrics and Business Statistics
ETC5512.Clayton-x@monash.edu
Week 1
Starts with a question
Now that we have a question ...
Population
Population
Population
Parameters:
Green: 6/53=0.113, Yellow: 22/53=0.416,
Red: 25/53=0.472
The statistics:
Green: 1/21=0.048, Yellow: 10/21=0.476,
Red: 10/21=0.476, are estimates of parameters
Collecting data on the entire population is normally too expensive or infeasible!
Collecting data on the entire population is normally too expensive or infeasible!
Collecting data on the entire population is normally too expensive or infeasible!
Collecting data on the entire population is normally too expensive or infeasible!
Collecting data on the entire population is normally too expensive or infeasible!
We therefore collect data only on a subset of the population.
How should we sample the population? There are many sampling schemes.
Simple random sampling
Every unit in the population has the same sample probability to be drawn.
Collecting data on the entire population is normally too expensive or infeasible!
We therefore collect data only on a subset of the population.
How should we sample the population? There are many sampling schemes.
Simple random sampling
Every unit in the population has the same sample probability to be drawn.
Stratified random sampling
Units are drawn from non-overlapping sub-populations.
The goal of a sampling scheme is to get accurate information from the sample in order to answer your question.
The goal of a sampling scheme is to get accurate information from the sample in order to answer your question.
Sampling strategies combine knowledge about the population with statistical methods.
For example,
Sampling strategies combine knowledge about the population with statistical methods.
For example,
What might go wrong with a simple random sampling of 10 creatures from this population?
If I survey every 10th household in a street, is that a random selection?
If I survey every 10th household in a street, is that a random selection?
What do you think can go wrong if we don't sample randomly?
Units ideally are sampled randomly, but more than often selections are non-random.
If I survey every 10th household in a street, is that a random selection?
What do you think can go wrong if we don't sample randomly?
What's wrong with these examples?
Units ideally are sampled randomly, but more than often selections are non-random.
If I survey every 10th household in a street, is that a random selection?
What do you think can go wrong if we don't sample randomly?
What's wrong with these examples?
Making an appropriate sampling design is hard.
You may introduce intentional data structures, e.g.
You may have unintended or unknown structures in the data, e.g. confounded variables.
Making an appropriate sampling design is hard.
You may introduce intentional data structures, e.g.
You may have unintended or unknown structures in the data, e.g. confounded variables.
It's further complicated by:
Examples:
Examples:
Experimental units are recipients of the allocated treatment such that no sub-division of it can receive another treatment independently.
Experimental units are recipients of the allocated treatment such that no sub-division of it can receive another treatment independently.
What are the experimental units?
Experimental units are recipients of the allocated treatment such that no sub-division of it can receive another treatment independently.
What are the experimental units? It's the classes.
Observational units are units that you measure the response on.
Carrying on from the previous example...
What are the observational units?
Observational units are units that you measure the response on.
Carrying on from the previous example...
What are the observational units? It's the students.
Observational units are units that you measure the response on.
Carrying on from the previous example...
What are the observational units? It's the students.
Source: Gilmour et al. (1997) Accounting for natural and extraneous variation in the analysis of field experiments. Journal of Agric Biol Env Statistics, 2, 269-293.
Source: Gilmour et al. (1997) Accounting for natural and extraneous variation in the analysis of field experiments. Journal of Agric Biol Env Statistics, 2, 269-293.
VF655
, TINCURRIN
and WW1477
have a replication of 6, the remaining 104 varieties each have a replication of 3. VF655
, TINCURRIN
and WW1477
have a replication of 6, the remaining 104 varieties each have a replication of 3. VF655
, TINCURRIN
and WW1477
have a replication of 6, the remaining 104 varieties each have a replication of 3. Carrying on from the teaching example...
What are the replications of each treatment?
Carrying on from the teaching example...
What are the replications of each treatment? It's 5.
Carrying on from the teaching example...
What are the replications of each treatment? It's 5.
The treament of repetition as replication in the analysis is referred to as pseudo-replication.
Blocks are used to group the experimental units into alike units.
Blocks are used to group the experimental units into alike units.
Blocks are used to group the experimental units into alike units.
You can form blocks from:
Source: Freedman, Pisani & Purves (2010) Statistics. 4th edition
Vaccinate all grade 2 children whose parents would consent, leaving children in grades 1 and 3 as controls.
Vaccinate all grade 2 children whose parents would consent, leaving children in grades 1 and 3 as controls.
Group | Participants | Rate |
---|---|---|
Vaccinated (Grade 2) | 221,998 | 25 |
Control (Grade 1 & 3) | 725,173 | 54 |
Not Vaccination (Grade 2, no consent) |
123,605 | 44 |
Incomplete Vaccination (Grade 2, incomplete) |
9,904 | 40 |
Group | Participants | Rate |
---|---|---|
Vaccinated | 200,745 | 28 |
Placebo | 201,229 | 71 |
Not Vaccination (no consent) |
338,778 | 46 |
Incomplete Vaccination | 8,484 | 24 |
Group | Participants | Rate |
---|---|---|
Vaccinated (Grade 2) | 221,998 | 25 |
Control (Grade 1 & 3) | 725,173 | 54 |
Not Vaccination (Grade 2, no consent) |
123,605 | 44 |
Incomplete Vaccination (Grade 2, incomplete) |
9,904 | 40 |
Group | Participants | Rate |
---|---|---|
Vaccinated | 200,745 | 28 |
Placebo | 201,229 | 71 |
Not Vaccination (no consent) |
338,778 | 46 |
Incomplete Vaccination | 8,484 | 24 |
Both the not vaccinated (no consent) and placebo/control group did not receive the treatment but why is the rate of polio cases less in the not vaccinated (no consent) group?
Basically, designing and running experiments are hard.
The Academic Performance Index is computed for all California schools based on standardised testing of students. The data sets contain information and characteristics for 100 schools.
The Academic Performance Index is computed for all California schools based on standardised testing of students. The data sets contain information and characteristics for 100 schools.
Observational
The response is the length of odontoblasts in 60 guinea pigs. Each animal received one of three dose levels of vitamin C by one of two delivery methods by the technician.
The response is the length of odontoblasts in 60 guinea pigs. Each animal received one of three dose levels of vitamin C by one of two delivery methods by the technician.
Experimental
Can people really tell the difference between different flavours associated with the color of the skittles? You blind your friends so they can't see the color and collect data on their guess after giving them one skittle at a time.
Can people really tell the difference between different flavours associated with the color of the skittles? You blind your friends so they can't see the color and collect data on their guess after giving them one skittle at a time.
Experimental
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Lecturer: Emi Tanaka
Department of Econometrics and Business Statistics
ETC5512.Clayton-x@monash.edu
Week 1
Lecturer: Emi Tanaka
Department of Econometrics and Business Statistics
ETC5512.Clayton-x@monash.edu
Week 1
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |