Statistical Analysis
Mathematics
Statistical Analysis
You may have heard the saying "You can prove anything with statistics," which implies that statistical analysis cannot to be trusted, that the conclusions that can be drawn from it are so vague and ambiguous that they are meaningless. Yet the opposite is also true. Statistical analysis can be reliable and the results of statistical analysis can be trusted if the proper conditions are established.
What Is Statistical Analysis?
Statistical analysis uses inductive reasoning and the mathematical principles of probability to assess the reliability of a particular experimental test. Mathematical techniques have been devised to allow measurement of the reliability (or fallibility) of the estimate to be determined from the data (the sample, or "N") without reference to the original population. This is important because researchers typically do not have access to information about the whole population, and a sample—a subset of the population— is used.
Statistical analysis uses a sample drawn from a larger population to make inferences about the larger population. A population is a well-defined group of individuals or observations of any size having a unique quality or characteristic. Examples of populations include first-grade teachers in Texas, jewelers in New York, nurses at a hospital, high school principals, Democrats, and people who go to dentists. Corn plants in a particular field and automobiles produced by a plant on Monday are also populations. A sample is the group of individuals or items selected from a particular population. A random sample is taken in such a way that every individual in the population has an equal opportunity to be chosen. A random sample is also known as an unbiased sample.
Most mail surveys, mall surveys, political telephone polls, and other similar data gathering techniques generally do not meet the proper conditions for a random, unbiased sample, so their results cannot to be trusted. These are "self-selected" samples because the subjects choose whether to participate in the survey and the subjects may be picked based on the ease of their availability (for example, whoever answers the phone and agrees to the interview).
Selecting a Random Sampling
The most important criterion for trustworthy statistical analysis is correctly choosing a random sample. For example, suppose you have a bucket full of 10,000 marbles and you want to know how many of the marbles are red. You could count all of the marbles, but that would take a long time. So, you stir the marbles thoroughly and, without looking, pull out 100 marbles. Now you count the red marbles in your random sample. There are 10. Thus you could conclude that approximately 10 percent of the original marbles are red. This is a trustworthy conclusion, but it is not likely to be exactly right. You could improve your accuracy by counting a larger sample; say 1,000 marbles. Of course if you counted all the marbles, you would know the exact percentage, but the point is to pick a sample that is large enough (for example, 100 or 1,000) that gives you an answer accurate enough for your purposes.
Suppose the 100 marbles you pulled out of the bucket were all red. Would this be proof that all 10,000 marbles in the bucket were red? In science, statistical analysis is used to test a hypothesis . In the example we are testing, the hypothesis would be "all the marbles in the bucket are red."
Statistical inference makes it possible for us to state, given a sample size (100) and a population size (10,000), how often false hypotheses will be accepted and how often true hypotheses are rejected. Statistical analysis cannot conclusively tell us whether a hypothesis is true; only the examination of the entire population can do that. So "statistical proof" is a statement of just how often we will get "the right answer."
Using Basic Statistical Concepts
Statistics is applicable to all fields of human endeavor, from economics to education, politics to psychology. Procedures worked out for one field are generally applicable to the other fields. Some statistical procedures are used more often in some fields than in others.
Example 1. Suppose the Wearemout Pants Company wants to know the average height of adult American men, an important piece of information for a clothing manufacturer producing pants. The population is all men over the age of 25 who live in the United States. It is logistically impossible to measure the height of every man who lives in the United States, so a random sample of around 1,000 men is chosen. If the sample is correctly chosen, all ethnic groups, geographic regions, and socioeconomic classes will be adequately represented. The individual heights of these 1,000 men are then measured. An average height is calculated by dividing the sum of these individual heights by the total number of subjects (N = 1,000). By doing so, imagine that we calculate an average height is 1.95 meters (m) for this sample of adult males in the United States. If a representative sample was selected, then this figure can be generalized to the larger population.
The random sample of 1,000 men probably included some very short men and some very tall men. The difference between the shortest and the tallest is known as the "range" of the data. Range is one measure of the "dispersion" of a group of observations. A better measure of dispersion is the "standard deviation." The standard deviation is the square root of the sum of the squares of the differences divided by one less than the number of observations.
In this equation, xi is an observed value and is the arithmetic mean.
In our example, if a smaller height interval is used (1.10 m, 1.11 m, 1.12 m, 1.13 m, and so on) and the number of men in each height interval plotted as a function of height a smooth curve can be drawn which would have a characteristic shape, known as a "bell" curve or "normal frequency distribution." A normal frequency distribution can be stated mathematically as
The value of sigma (σ) is a measure of how "wide" the distribution is. Not all samples will have a normal distribution, but many do, and these distributions are of special interest.
The following figure shows three normal probability distributions. Because there is no skew, the mean, median, and mode are the same. The mean of curve (a) is less than the mean of curve (b), which in turn is less than the mean of (c). Yet the standard deviation, or spread, of (c) is least, whereas that of (a) is greatest. This is just one illustration of how the parameters of distributions can vary.
Example 2. One of the most common uses of statistical analysis is in determining whether a certain treatment is efficacious. For example, medical researchers may want to know if a particular medicine is effective at treating the type of pain resulting from extraction of third molars (known as "wisdom" teeth). Two random samples of approximately equal size would be selected. One group would receive the pain medication while the other group received a "placebo," a pill that looked identical but contained only inactive ingredients. The study would need to be a "double-blind" experiment, which is designed so that neither the recipients nor the persons dispensing the pills knew which was which. Researchers would know who had received the active medicines only after all the results were collected.
Example 3. Suppose a student, as part of a science fair project, wishes to determine if a particular chemical compound (Chemical X) can accelerate the growth of tomato plants. In this sort of experiment design, the hypothesis is usually stated as a null hypothesis : "Chemical X has no effect on the growth rate of tomato plants." In this case, the student would reject the null hypothesis if she found a significant difference. It may seem odd, but that is the way most of the statistical tests are set up. In this case, the independent variable is the presence of the chemical and the dependent variable is the height of the plant.
The next step is experiment design. The student decides to use height as the single measure of plant growth. She purchases 100 individual tomato plants of the same variety and randomly assigns them to 2 groups of 50 each. Thus the population is all tomato plants of this particular type and the sample is the 100 plants she has purchased. They are planted in identical containers, using the same kind of potting soil and placed so they will receive the same amount of light and air at the same temperature. In other words, the experimenter tries to "control" all of the variables, except the single variable of interest. One group will be watered with water containing a small amount of the chemical while the other will receive plain water. To make the experiment double-blind, she has another student prepare the watering cans each day, so that she will not know until after the experiment is complete which group was receiving the treatment. After 6 weeks, she plans to measure the height of the plants.
The next step is data collection. The student measures the height of each plant and records the results in data tables. She determines that the control group (which received plain water) had an average (arithmetic mean) height of 1.3 m (meters), while the treatment group had an average height of 1.4 m.
Now the student must somehow determine if this small difference was significant or if the amount of variation measured would be expected under no treatment conditions. In other words, what is the probability that 2 groups of 50 tomato plants each, grown under identical conditions would show a height difference of 0.1 m after 6 weeks of growth? If this probability is less than or equal to a certain predetermined value, then the null hypothesis is rejected. Two commonly used values of probability are 0.05 or 0.01. However, these are completely arbitrary choices determined mostly by the widespread use of previously calculated tables for each value. Modern computer analysis techniques allow the selection of any value of probability.
The simplest test of significance is to determine how "wide" the distribution of heights is for each group. If there is a wide variance (σ) in heights (say, σ = 25), then small differences in mean are not likely to be significant. On the other hand, if the dispersion is narrow (for example, if all the plants in each group were close to the same height, so that σ = 0.1) then the difference would probably be significant.
There are several different tests the student could use. Selecting the right test is often a tricky problem. In this case, the student can reject several tests outright. For example, the chi-square test is suitable for nominal scales (yes or no answers are one example of nominal scales), so it does not work here. The F -test measures variability or dispersion within a single sample. It too is not suitable for comparing two samples. Other statistical tests can also be rejected as inappropriate for various reasons.
In this case, since the student is interested in comparing means, the best choice is a t test. The t -test compares two means using this formula:
In this case, the null hypothesis assumes that μ1 − μ2 = 0 (no difference in the sample groups), so that we can say:
The quantity is known as the standard error of the mean difference. When the sample sizes are the same, . The standard error of the mean difference is the square root of the sums of the squares of the standard errors of the means for each group. The standard error of the mean for each group is easily calculated from . N is the sample size, 50, and the student can calculate the standard deviation by the formula for standard deviation given above.
The final experimental step is to determine sensitivity. Generally speaking, the larger the sample, the more sensitive the experiment is. The choice of 50 tomato plants for each group implies a high degree of sensitivity.
Students, teachers, psychologists, economists, politicians, educational researchers, medical researchers, biologists, coaches, doctors and many others use statistics and statistical analysis every day to help them make decisions. To make trustworthy and valid decisions based on statistical information, it is necessary to: be sure the sample is representative of the population; understand the assumptions of the procedure and use the correct procedure; use the best measurements available; keep clear what is being looked for; and to avoid statements of causal relations if they are not justified.
see also Central Tendency, Measures of; Data Collection and Interpretation; Graphs; Mass Media, Mathematics and the.
Elliot Richmond
Bibliography:
Huff, Darrell. How to Lie With Statistics. New York: W. W. Norton & Company, 1954.
Kirk, Roger E. Experimental Design, Procedures for the Behavioral Sciences. Monterrey, CA: Brooks/Cole Publishing Company, 1982.
Paulos, John Allen. Innumeracy: Mathematical Illiteracy and Its Consequences. New York: Hill & Wang, 1988.
Tufte, Edward R. The Visual Display of Quantitative Information. Cheshire, CT: Graphics Press, 1983.
MEAN, MEDIAN, AND MODE
The average in the clothing example is known as the "arithmetic mean," which is one of the measures of central tendency. The other two measures of central tendency are the"median" and the "mode." The median is the number that falls in the mid-point of an ordered data set, while the mode is the most frequently occurring value.
Find more facts and information related to the .
Copyright 2002, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.
Related newspaper, magazine, and trade journal articles from HighBeam Research
(Including press releases, facts, information, and biographies)
|
Statistical Analysis with Excel(TM) for Dummies
; Statistical Analysis with Excel(TM) for Dummies Joseph...broad coverage of statistics and statistical analysis with a lot of detail on the...and get some experience doing statistical analysis and they have Excel available...
Read more
|
|
Distributing microarray data: statistical analysis software solutions look to manage the wealth of information in microarray data.(Expanded Article Online)
; ...and the consolidation of rigorous statistical analysis methods for microarray data. Indeed...statistical community and many appropriate statistical analysis methods have been developed through...throughout experimental practice and statistical analysis workflows. This basic workflow ...
Read more
|
|
Planning, Construction and Statistical Analysis
; Planning, Construction and Statistical Analysis Francis G. Giesbrecht and Marcia...book). Planning, Construction and Statistical Analysis of Comparative Experiments emphasizes...tool as well as a good reference on statistical analysis. This book is recommended to those...
Read more
|
|
Science & Nature: Statistical analysis established that smoking led to lung cancer. Statistics save lives
; ...outrageously unfair and misleading. Statistical analysis is fundamental to careful interpretation...primary importance. But after careful statistical analysis, they established that smoking was...sociology. He has written not only about statistical analysis but also about the closely ...
Read more
|
|
Statistical Analysis of Designed Experiments.(Book Reviews)(Book Review)
; Statistical Analysis of Designed Experiments (2nd...Model 9. Cross-Over Design 10. Statistical Analysis of Incomplete Data Three appendices...most cases arising in practice. Statistical Analysis of Designed Experiments might...
Read more
|
|
An Introduction to Multivariate Statistical Analysis, 3rd ed.
; ...Introduction to Multivariate Statistical Analysis, 3rd ed. T.W. Anderson...the author of The Statistical Analysis of Time Series. The...Introduction to Multivariate Statistical Analysis was derived from lecture...
Read more
|
|
Key factors for economic success in industrial nations; a statistical analysis.(Brief article)(Book review)
; ...factors for economic success in industrial nations; a statistical analysis. Margolis, Lawrence. Edwin Mellen Pr. 2007 232 pages...Margolis (political science, U. of Michigan) employs statistical analysis, particularly Spearman's Rank-Order coefficients, in...
Read more
|
|
Applied Multivariate Statistical Analysis.(Review)
; Applied Multivariate Statistical Analysis (4th ed.), by Richard...of Applied Multivariate Statistical Analysis (2nd ed.), by R. A. Johnson...Applied Multivariate Statistical Analysis (2nd ed.), Englewood Cliffs...
Read more
|
|
Needed: Real-world statistical analysis tools
; Consider these two statements: Statistical analysis is an essential part of modern manufacturing.; Most statistical analysis today ignores realworld...order. In the early days of statistical analysis, researchers discovered...
Read more
|
|
Survey course on research methods: Integrating statistical analysis and study design
; ...acquisition and interpretation of data, statistical analysis, and validity of conclusions. The...connection between research design and statistical analysis is emphasized throughout the course...share the methods used to convey statistical analysis and research design concepts as...
Read more
|
For more facts and information,
see all related premium articles
Related entries from encyclopedias, dictionaries, and thesauruses
|
statistical demand analysis
statistical demand analysis A set of statistical procedures used to discover the most important real factors affecting sales and their relative influence; the most commonly analysed factors are prices, income, population, and promotion.
Read more
|
|
statistical analysis
statistical analysis See statistical methods .
Read more
|
|
Statistical Analysis System
Statistical Analysis System See SAS .
Read more
|
|
Statistical Geneticist
...these disciplines. Typical statistical geneticists spend much...computers, since much of the statistical analysis of data is now conducted...pencil and paper. Many statistical geneticists are actively...simulated. Once a new statistical method is validated using...then be applied to the ...
Read more
|
|
statistical methods
...compensating factors, etc. Statistical analysis relates observed statistical data to theoretical models...models used in regression analysis . By estimating parameters...to similar situations. Statistical prediction is the application...including exploratory data analysis .
Read more
|