|
Definition Most statistical experiments attempt to understand a measurable population by observing a number of entities from that population. On the other hand, simulation experiments attempt to understand a non-measurable population (e.g., a jury that has not yet been selected) by observing a number of entities from a measurable population that is thought to be similar (e.g., a focus group. Experiment design is a plan to achieve an objective efficiently. Accuracy is an important part of achieving any objective. Accuracy is increased when bias is decreased and precision is increased. In order to design an experiment, one must
A number of conditions, each having one or more levels or values, exist during an experiment. If only one condition affects the variable of interest (usually called the response variable), the design may be very simple. However, if a number of conditions affect the variable, the design can be quite complex. Letting the levels of a number of conditions vary simultaneously is desirable because doing so may reveal interactions between conditions. |
1. IntroductionThe design of an experiment should always be the first step in conducting the experiment. Without a proper design, the objective cannot be achieved. Before discussing experiment design, consider the preliminary notions of
1.1 Inductive InferenceDrawing conclusions about the general (i.e., the population) from knowledge of the specific (i.e., a sample from the population) is called inductive inference. Even though inductive inference results in uncertain success, the methods of statistics allow us to measure the uncertainty and reduce it to a known, tolerable level. In contrast, deductive inference always yields a correct conclusion because the conclusion results from a chain of proved conclusions (e.g., mathematical theorems are proved by deductive inference.) 1.2 PopulationsA population can be defined by the number of entities comprising it and by the characteristics or conditions of those entities. 1.2.1 Number of EntitiesAn entity is a single fact, trial, individual, etc. The population is the totality of entities of a specified type. There are two types of populations, depending upon the number of entities:
1.2.2 Characteristics or Conditions of EntitiesAn entity is defined by one or more characteristics or conditions. Each characteristic or condition can have one or more values (if it is quantitative) or it can have one or more levels (if it is qualitative). For example, an individual might be 35 years old (a value because it is quantitative) and Caucasian (a level because it is qualitative). (Henceforth, for simplicity, the word "conditions" will refer to either characteristics or conditions, and the word "levels" will refer to either values or levels.)
1.3 SamplesA sample is a subset of a population. The number of entities in the sample is called the sample size. An outcome is the observed level of a trial (not a court trial). A test provides outcomes from a number of trials. Outcomes from a test can provide an estimate of the (true) population level. If one is to infer something about the population from a sample, care must be taken that the sample is from the target population, and it is a random sample:
1.4 Principals of Experiment DesignBias is the difference between an estimate (from a sample) and the true value (of a population). Precision is the measure of the closeness of the values in an estimate: Accuracy of an estimate is the bias plus the precision. The following four principals of experiment design minimize the inaccuracy of an estimate. The first principal tends to reduce bias, and the other three principals tend to reduce imprecision:
2. Determine the Objective of the Experiment and Select Its AnalysisThere are a number of possible objectives and analyses. The proper analysis depends upon the objective of the experiment. Following is a list of common experiment objectives, a list of recommended analyses, and a table showing the recommended analysis for each objective. 2.1 Common Experiment ObjectivesThe variable of interest, called a response variable, is analyzed to achieve the objective. There are five rather common objectives of an experiment.
2.2 Recommended AnalysesAlthough a great variety of analyses are possible, one of five analyses is recommended to achieve each objective:
2.3 Select the Analysis to Meet the ObjectiveThe analyses and objectives discussed in this section are not exhaustive: The objectives are common objectives, and the analyses are plausible analyses. The following table matches these objectives and analyses. The "x" indicates the analysis appropriate for each objective. After the analysis is selected, one can judiciously choose the number of levels of the variable conditions.
|
|||||||||||||||||||||||||||||||||||||||||||
3. Select the Response VariableThe response variable is the variable chosen to determine if the objective has been achieved. Examples of response variables might be
4. Define the PopulationThe population is the set of all entities. The population is defined by the conditions and levels existing during the experiment, both fixed conditions and variable conditions. A fixed condition will either have no options or the experimenter will choose a single option (e.g., a single level, such as gender) for the entire experiment. Variable conditions have more than one level - and, hence, present an opportunity to design an experiment; the experiment is conducted over selected combinations of levels of the variable conditions. 5. Select the DesignThe design must be selected to meet the specified precision, time, and budget. Following are some common designs:
6. Determine the Sample SizeThe sample size can be determined from time, budget, or specified precision. For most objectives, FactLogic can estimate both the absolute precision (i.e., measured by the difference between the true value and the estimate) and the relative precision (i.e., measured by the percentage difference between the true value and the estimate). FactLogic estimates the precision from a preliminary test using a focus group using, probably, the Internet. However, for the acceptance objective, the sample size is determined from an operating characteristic curve. Because a sample has a finite number of trials, an interval of uncertainty exists about an acceptable (threshold) level. The probability of committing two types of errors in acceptance tests can be measured and reduced by using a larger sample:
|
|
Summary An experiment must be well designed in order to achieve its objective. Drawing conclusions about the general (i.e., the population) from knowledge of the specific (i.e., a sample from the population) is called inductive inference. Even though inductive inference results in uncertain success, the methods of statistics allow us to measure the uncertainty and reduce it to a known, tolerable level. The four principals of experiment design help reduce uncertainty. They are
A number of entities might affect the variable of interest (called the response variable). Each entity has a number of characteristics or conditions and each characteristic or condition has one or more values or levels. When these values or levels are varied and the response variable is observed for each variation, we can determine which, if any values or levels affect the response variable. In order to design an experiment, one must
|
| © Convex Corporation 1999 - 2005 |