I.
Introduction
A.
We must identify the population we want to study, then figure
out how to select a representative sample.
B.
Usually we use a random sample, but sometimes that is not
desirable.
1.
We may be interested in different segments of the population
and want to guarantee that we have a sufficient sample size from each segment.
2.
We may want to study certain segments more intensively than
others, requiring a larger sample from those segments.
C.
The major issue is not the overall sample size. It is the sample size within each cell.
1.
Example: my wine study (n=809)
2.
Hand out and discuss
D.
Ideally we would like to survey the entire population, thus
eliminating many of the statistical problems.
1.
This tends to be costly: survey administration, data
processing, and so on.
2.
Occasionally you may be able to actually use an entire
population in which case your work will be much easier.
3.
It's tempting to define the population narrowly to avoid this
problem, but usually this costs you external validity. As you narrow the definition of your
population, you are less able to generalize your results to the rest of the
world.
E.
As Black notes (p. 111) "A relatively small sample
carefully selected may provide a more valid result than a large sample poorly
chosen.
II.
Identifying the population(s)
A.
We call it a population because we're usually studying people.
B.
As noted earlier, the definition of the population is an
important factor in determining external validity.
C.
Many research questions ask about the effects of various
aspects of life on people's behavior and attitudes. Life is the experimenter, the life experience is the treatment
and the researcher is only an observer.
D.
The population definition and research question are closely
intertwined.
1.
If your research will require a lengthy questionnaire, you may
find you can't persuade very many people to answer the survey, in which case
your population will be limited.
2.
Narrowing the research question to produce a shorter
questionnaire will allow you to expand the population.
E.
Always remember that theory defines causation while data can
only tell us about correlation.
1.
While it's rare in economics, you should at least think about
using a control group for comparison.
2.
Usually, however, we can't use that sort of experiment.
III.
Sampling
A.
Two important consequences of using samples from a population.
1.
Since we want external validity to our study, we must do our
best to make sure the sample represents the population.
2.
Do your best to obtain a sample that won't introduce new
potentially confounding variables.
B.
What does the "random" in the phrase "random
sampling" mean?
1.
Might use a random number table starting at a randomly chosen
location.
2.
Might use a RAND() function in a computer, preferably seeded
with the date and time (discuss the idea of the Julian date and how it might be
used).
3.
But once a series of numbers has been generated, can it truly
be said to be random?
C.
Random sampling pulls a portion of the population such that
all possible samples of size n have the same probability of being selected.
IV.
Common Sampling Techniques
A.
Simple random sample
1.
Highly representative if all subjects participate; ideal.
2.
Not possible without complete list of population members.
B.
Stratified random sample
1.
Random sample from identifiable groups (strata), subgroups,
etc.
2.
Can ensure that specific groups are represented, even
proportionally, in the sample.
3.
Strata must be carefully defined.
C.
Cluster sample
1.
Random samples of successive clusters of subjects (e.g., by
institution) until small groups are chosen as units.
2.
May be useful when no population list exists, but local lists
do exist.
3.
Clusters must be equivalent (somehow) and some natural
clusters are not grouped by essential characteristics. For example, a geographic cluster may yield
equal sample sizes, but unemployment rates in different areas will most likely
not be the same.
D.
Stage sample
1.
Combines cluster and random/stratified sampling.
2.
Can make up a probability sample by randomizing in stages and
within groups. Possible to select
nearly random sample when population lists are very localized.
3.
Very complex, combines limitations of cluster and stratified
sampling (if random not used).
E.
Purposive sample (focus groups)
1.
Hand selected subjects on the basis of some specified
characteristic.
2.
Ensures balance of group sizes.
3.
Samples not easily defended; external validity likely to be
compromised.
F.
Quota sample
1.
Select individuals as they arrive to fill a quota by
characteristics proportional to populations.
2.
Ensures adequate sample sizes within each group.
3.
Not possible to prove the sample is representative of the
population.
G.
Snowball sample
1.
Subjects with desired characteristics give names of other
appropriate subjects.
2.
Possible to include members of groups where no lists even
exist (e.g., drug abusers, criminals).
3.
No way of knowing whether the sample is representative of the
population.
H.
Volunteer, accidental, convenience sample
1.
Major advantage is low cost.
2.
Can be highly unrepresentative.
3.
Example: wine survey.
V.
Sampling Errors
A.
Sampling errors occur simply because data are being collected
on a sample instead of the entire population.
B.
First four sampling techniques were devised to increase the
likelihood that any sample will be representative of the entire population.
C.
Statistical accuracy when the population mean and standard
deviation are known.
1.
The natural variability in a set of sample means (from
successive samples) is measured as the standard error of the mean. It is estimated from the population standard
deviation and the sample size as
.
2.
From this we can derive our first equation that gives us an
inkling about what size sample we need:
.
3.
Clearly as n gets very large the standard error of the mean
will approach the actual mean so
will approach zero.
Looking at this the other way, the closer we want
to be to the actual
mean, the larger n must be.
4.
We often calculate the z-score 
5.
See fig. 5.3. Discuss
meaning of 95% confidence interval and one vs. two tailed tests.
D.
Statistical accuracy when population mean and/or standard
deviation are not known.
1.
We can estimate the standard error of the mean using sample
statistics
where sA
is the standard deviation of the sample group A and nA is the sample
size of sample group A.
2.
Strictly speaking this only applies to purely random sampling.
3.
Using this equation we can establish a confidence
interval. For example, a 95% confidence
interval would be
. Explain why it's
1.96 with reference to figs 5.3 and 5.4.
4.
See example, p. 129
E.
The chi-square test
1.
Used to test how likely it is that a sample represents the
population. Only useful when you know
the population frequencies for each category.
2.

3.
See example p. 131 and note that the degrees of freedom are
usually m-1 not m.
F.
Sampling errors are unavoidable, but can be reduced by sound
sampling procedures. Non-sampling
errors can be caused by incorrect sampling frame (population is voters, sample
drawn from phone book), poor measuring instruments, incorrect data processing
and non-response by subjects in the sample.
VI.
Avoiding Subject Loss
A.
Is your sample a true random sample or simply volunteers?
1.
Who responded and who failed to respond? How do you measure statistics from the group
of non-respondents? (Example: wine)
2.
Even the time you query subjects may cause a bias, e.g.,
calling during dinner.
3.
Must convince subjects that it is worthwhile to
participate. (wine: offered them copy
of the finished paper) Must assure them
of confidentiality, then keep your word.
Must be careful in your analysis.
B.
Contending with non-responses
1.
Must discuss who responded and who didn't.
2.
Try to provide evidence that any non-response has nothing to
do with the research instrument.
3.
Think about ways in which your study may discourage
participation before you start sending out surveys.
4.
Question: how do you follow up to non-responders and still
maintain confidentiality? How do you
know who answered and who didn't? What
does "confidentiality" mean?
5.
Can send follow-up questionnaire.
6.
Structure questionnaires so that optionally answered questions
don't mean data loss. Figure out which
questions are more important and which are less, then (perhaps) label the less
important questions optional.
VII.
Sampling Techniques and Sample Size
A.
Fraction of population sampled is less important than n.
B.
We want to test whether characteristics of our sample are
close to those of the population. We
can use parametric tests (t-tests, analysis of variance), chi-square tests and
so on.
C.
Remember, the prime concern is to have a reasonable n in each
cell, not just a large overall n. See
p. 136.
D.
Nan: n=50. Black:
n=30.
E.
Non-responses become very important when cell sizes are
small. You may need to sample with
replacement.
F.
Random selection and assignment are both important.
VIII.
Ethics
A.
Be cautious. When in
doubt ask. Talk to other researchers.
B.
DO NOT VIOLATE PROMISES OF CONFIDENTIALITY.
IX.
Technical notes on sample distributions and statistical tests
(Black, ch. 13)
A.
At what point do ordinal data become interval discrete data?
1.
Need between 10 and 20 intervals, implying a sample size of 30
if the shape of the probability distribution is to be decipherable.
2.
Do not assume the data are normally distributed. Test for skew and kurtosis (at the minimum).
3.
Is IQ normally distributed or are IQ test results only
normally distributed? Don't confuse the
instrument with the "thing" being measured.
B.
Collect this data: height of each person in class. Plot on a histogram. Are they normally distributed?
C.
Recall the z-score:
where xi
is observation I,
is the mean of the
distribution of observations and S is the standard deviation
and we use n-1
because we lost one df calculating the mean.
Alternatively, you can use the STDEV( ) function in Excel.
D.
Why the t-test is so popular among economists
1.
We rarely know the population standard deviation. Therefore, we must calculate the sample
standard error of the mean using sample statistics. See V.D.1 above for details.
2.
However, we can no longer use the z-score but instead use
. The usual rule of
thumb is that t scores greater than 2.0 are good at the 95% level. However, cowards can look it up in the
table.
E.
Testing for normality.
1.
. where M is the
median. A normal distribution has a skewness of 0.0.
2.
Alternatively, we can use the Glass-Hopkins measure 
X.
Technical note on sample size (Black, ch. 14)
A.
Significance test denoted bya. Table 14.1 shows significance levels and the
corresponding z-score. This works
pretty well for t-scores too.
B.
Note that the level of significance should be chosen in
advance.
C.
One criterion is that the statistical test should have a high
statistical power -- a high probability of correctly rejecting the null
hypothesis.
D.
We can estimate the sample size necessary to achieve a desired
level of power:
where z(b) is
the z-score for a Type II error (incorrectly classifying the group as belonging
to the original population when it did not) and z(a) is the z-score for probability
level a from
table 14.1.
XI.
What is "power?"
A.
a is
the probability of a Type I error (rejecting Ho when it's true).
B.
b is
the probability of a Type II error (accepting Ho when it's false).
C.
The power of a test is the probability of rejecting Ho when it
is false = 1-b.
1.
Suppose the sample actually belongs to a different population
altogether. That implies the sample
mean
really belongs to a
different distribution of sampling means for a population whose mean we can
only estimate by assuming that it is equal to the sample mean
. To determine a value for b necessitates finding the overlap of these two sampling
distributions.
2.
The sampling distribution of means is normally distributed
around
. Therefore, it's
often true that b=0.5
and the power is also 0.5.
3.
Do the example in Black, pp. 381-382.
4.
We can calculate
where m0
is the mean for the original known population and sx0 is the standard
error of the mean for the original population.
5.
With x(a)
the process is simply reversed for the non-central distribution
where m1 is
the mean of the alternative population (equals the sample mean
and sx1
is the standard error of the mean for the alternative population (the standard
error of the mean for the sample).
6.
Solving for 
7.
If we accept that the two standard errors of the mean are
equal and that the best estimate of the mean for the non-central distribution
is the sample mean (
) then we can simplify the above equation to 
XII.
Estimation (Gujarati, ch. 7 plus OLS without inverting a
matrix)