1、Designation: E 122 09An American National StandardStandard Practice forCalculating Sample Size to Estimate, With SpecifiedPrecision, the Average for a Characteristic of a Lot orProcess1This standard is issued under the fixed designation E 122; the number immediately following the designation indicat
2、es the year oforiginal adoption or, in the case of revision, the year of last revision. A number in parentheses indicates the year of last reapproval. Asuperscript epsilon () indicates an editorial change since the last revision or reapproval.1. Scope1.1 This practice covers simple methods for calcu
3、lating howmany units to include in a random sample in order to estimatewith a specified precision, a measure of quality for all the unitsof a lot of material, or produced by a process. This practice willclearly indicate the sample size required to estimate theaverage value of some property or the fr
4、action of nonconform-ing items produced by a production process during the timeinterval covered by the random sample. If the process is not ina state of statistical control, the result will not have predictivevalue for immediate (future) production. The practice treats thecommon situation where the
5、sampling units can be consideredto exhibit a single (overall) source of variability; it does nottreat multi-level sources of variability.2. Referenced Documents2.1 ASTM Standards:2E 456 Terminology Relating to Quality and Statistics3. Terminology3.1 Definitions: Unless otherwise noted, all statistic
6、al termsare defined in Terminology E 456.3.2 Symbols: Symbols used in all equations are defined asfollows:E = the maximum acceptable difference between the trueaverage and the sample average.e = E/, maximum acceptable difference expressed as afraction of .f = degrees of freedom for a standard deviat
7、ion estimate(7.5).k = the total number of samples available from the sameor similar lots. = lot or process mean or expected value of X, the resultof measuring all the units in the lot or process.0= an advance estimate of .N = size of the lot.n = size of the sample taken from a lot or process.nj= siz
8、e of sample j.nL= size of the sample from a finite lot (7.4).p8 = fraction of a lot or process whose units have thenonconforming characteristic under investigation.p0= an advance estimate of p8.p = fraction nonconforming in the sample.R = range of a set of sampling values. The largest minusthe small
9、est observation.Rj= range of sample j.R=(j 5 1kRj/k, average of the range of k samples, all of thesame size (8.2.2).s = lot or process standard deviation of X, the result ofmeasuring all of the units of a finite lot or process.s0= an advance estimate of s.s =(i 5 1n(Xi X )2/(n1)1/2, an estimate of t
10、hestandard deviation s from n observation, Xi, i = 1 to n.s =(j 5 1kSj/k, average s from k samples all of the same size(8.2.1).sp= pooled (weighted average) s from k samples, not all ofthe same size (8.2).sj= standard deviation of sample j.Vo= an advance estimate of V, equal to do/o.v = s/X, the coe
11、fficient of variation estimated from thesample.vp= pooled (weighted average) of v from k samples (8.3).vj= coefficient of variation from sample j.X = numerical value of the characteristic of an individualunit being measured.1This practice is under the jurisdiction ofASTM Committee E11 on Quality and
12、Statistics and is the direct responsibility of Subcommittee E11.10 on Sampling/Statistics.Current edition approved Aug. 1, 2009. Published September 2009. Originallyapproved in 1958. Last previous edition approved in 2007 as E 12207.2For referenced ASTM standards, visit the ASTM website, www.astm.or
13、g, orcontact ASTM Customer Service at serviceastm.org. For Annual Book of ASTMStandards volume information, refer to the standards Document Summary page onthe ASTM website.1Copyright ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959, United States.X=(i 5 1nXi/n
14、iaverage of n observations, Xi,i= 1 to n.4. Significance and Use4.1 This practice is intended for use in determining thesample size required to estimate, with specified precision, ameasure of quality of a lot or process. The practice applieswhen quality is expressed as either the lot average for a g
15、ivenproperty, or as the lot fraction not conforming to prescribedstandards.The level of a characteristic may often be taken as anindication of the quality of a material. If so, an estimate of theaverage value of that characteristic or of the fraction of theobserved values that do not conform to a sp
16、ecification for thatcharacteristic becomes a measure of quality with respect to thatcharacteristic. This practice is intended for use in determiningthe sample size required to estimate, with specified precision,such a measure of the quality of a lot or process either as anaverage value or as a fract
17、ion not conforming to a specifiedvalue.5. Empirical Knowledge Needed5.1 Some empirical knowledge of the problem is desirablein advance.5.1.1 We may have some idea about the standard deviationof the characteristic.5.1.2 If we have not had enough experience to give a preciseestimate for the standard d
18、eviation, we may be able to state ourbelief about the range or spread of the characteristic from itslowest to its highest value and possibly about the shape of thedistribution of the characteristic; for instance, we might be ableto say whether most of the values lie at one end of the range,or are mo
19、stly in the middle, or run rather uniformly from oneend to the other (Section 9).5.2 If the aim is to estimate the fraction nonconforming,then each unit can be assigned a value of 0 or 1 (conforming ornonconforming), and the standard deviation as well as theshape of the distribution depends only on
20、p8, the fractionnonconforming in the lot or process. Some rough idea con-cerning the size of p8 is therefore needed, which may bederived from preliminary sampling or from previous experi-ence.5.3 More knowledge permits a smaller sample size. Seldomwill there be difficulty in acquiring enough informa
21、tion tocompute the required size of sample. A sample that is largerthan the equations indicate is used in actual practice when theempirical knowledge is sketchy to start with and when thedesired precision is critical.5.4 The precision of the estimate made from a randomsample may itself be estimated
22、from the sample. This estima-tion of the precision from one sample makes it possible to fixmore economically the sample size for the next sample of asimilar material. In other words, information concerning theprocess, and the material produced thereby, accumulates andshould be used.6. Precision Desi
23、red6.1 The approximate precision desired for the estimate mustbe prescribed. That is, it must be decided what maximumdeviation, E, can be tolerated between the estimate to be madefrom the sample and the result that would be obtained bymeasuring every unit in the lot or process.6.2 In some cases, the
24、 maximum allowable sampling error isexpressed as a proportion, e, or a percentage, 100 e. Forexample, one may wish to make an estimate of the sulfurcontent of coal within 1 %, or e = 0.01.7. Equations for Calculating Sample Size7.1 Based on a normal distribution for the characteristic, theequation f
25、or the size, n, of the sample is as follows:n 5 3so/E!2(1)The multiplier 3 is a factor corresponding to a low probabil-ity that the difference between the sample estimate and theresult of measuring (by the same methods) all the units in thelot or process is greater than E. The value 3 is recommended
26、for general use. With the multiplier 3, and with a lot or processstandard deviation equal to the advance estimate, it is practi-cally certain that the sampling error will not exceed E. Wherea lesser degree of certainty is desired a smaller multiplier maybe used (Note 1).NOTE 1For example, multiplyin
27、g by 2 in place of 3 gives a probabil-ity of about 45 parts in 1000 that the sampling error will exceed E.Although distributions met in practice may not be normal, the followingtext table (based on the normal distribution) indicates approximateprobabilities:Factor Approximate Probability of Exceedin
28、g E3 0.003 or 3 in 1000 (practical certainty)2.56 0.010 or 10 in 10002 0.045 or 45 in 10001.96 0.050 or 50 in 1000 (1 in 20)1.64 0.100 or 100 in 1000 (1 in 10)7.1.1 If a lot of material has a highly asymmetric distribu-tion in the characteristic measured, the sample size as calcu-lated in Eq 1 may n
29、ot be adequate. There are two things to dowhen asymmetry is suspected.7.1.1.1 Probe the material with a view to discovering, forexample, extra-high values, or possibly spotty runs of abnor-mal character, in order to approximate roughly the amount ofthe asymmetry for use with statistical theory and a
30、djustment ofthe sample size if necessary.7.1.1.2 Search the lot for abnormal material and segregate itfor separate treatment.7.2 There are some materials for which s varies approxi-mately with , in which case V (=s/) remains approximatelyconstant from large to small values of .7.2.1 For the situatio
31、n of 7.2, the equation for the samplesize, n, is as follows:n 5 3 Vo/e!2(2)If the relative error, e, is to be the same for all values of ,then everything on the right-hand side of Eq 2 is a constant;hence n is also a constant, which means that the same samplesize n would be required for all values o
32、f .7.3 If the problem is to estimate the lot fraction noncon-forming, then so2is replaced by po(1po) so that Eq 1becomes:n 5 3/E!2po1 2 po! (3)E1220927.4 When the average for the production process is notneeded, but rather the average of a particular lot is needed, thenthe required sample size is le
33、ss than Eq 1, Eq 2, and Eq 3indicate. The sample size for estimating the average of thefinite lot will be:nL5 n/1 1 n/N!# (4)where n is the value computed from Eq 1, Eq 2, or Eq 3. Thisreduction in sample size is usually of little importance unless nis 10 % or more of N7.5 When the information on th
34、e standard deviation islimited, a sample size larger than indicated in Eq 1, Eq 2, andEq 3 may be appropriate. When the advance estimate s0isbased on f degrees of freedom, the sample size in Eq 1 may bereplaced byn 5 3s0/E!21 1 =2/f! (6)NOTE 2The standard error of a sample variance with f degrees of
35、freedom, based on the normal distribution, is =2s4/f . The factor1 1 =2/f! has the effect of increasing the preliminary estimate s02byone times its standard error.8. Reduction of Empirical Knowledge to a NumericalValue of so(Data for Previous Samples Available)8.1 This section illustrates the use of
36、 the equations inSection 7 when there are data for previous samples.8.2 For Eq 1An estimate of socan be obtained fromprevious sets of data. The standard deviation, s, from any givensample is computed as:s 5 (i 5 1nXi2 X!2/n 2 1!#1/2(6)The s value is a sample estimate of so. A better, more stablevalu
37、e for somay be computed by pooling the s values obtainedfrom several samples from similar lots. The pooled s value spfor k samples is obtained by a weighted averaging of the kresults from use of Eq 6.sp5 (j 5 1knj2 1!sj2/(j 5 1knj2 1!#1/2(7)8.2.1 If each of the previous data sets contains the samenu
38、mber of measurements, nj, then a simpler, but slightly lessefficient estimate for somay be made by using an average ( s)of the s values obtained from the several previous samples. Thecalculated s value will in general be a slightly biased estimateof so. An unbiased estimate of sois computed as follo
39、ws:so5sc4(8)where the value of the correction factor, c4, depends on thesize of the individual data sets (nj)(Table 13).8.2.2 An even simpler, and slightly less efficient estimate forsomay be computed by using the average range ( R) takenfrom the several previous data sets that have the same groupsi
40、ze.so5Rd2(9)The factor, d2, from Table 1 is needed to convert the averagerange into an unbiased estimate of so.8.2.3 Example 1 Use of s.8.2.3.1 ProblemTo compute the sample size needed toestimate the average transverse strength of a lot of bricks whenthe value of E is 50 psi, and practical certainty
41、 is desired.8.2.3.2 SolutionFrom the data of three previous lots, thevalues of the estimated standard deviation were found to be215, 192, and 202 psi based on samples of 100 bricks. Theaverage of these three standard deviations is 203 psi. The c4value is essentially unity when Eq 1 gives the followi
42、ngequation for the required size of sample to give a maximumsampling error of 50 psi:n 5 3 3 203!/5025 12.225 149 bricks (10)8.3 For Eq 2If s varies approximately proportionatelywith for the characteristic of the material to be measured,compute the average, X, the standard deviation, s, and thecoeff
43、icient of variation v for each sample. The pooled V valuevpfor k samples, not necessarily of the same size, is obtainedby a weighted average of the k results. Then use Eq 2.vp5 (j 5 1knj2 1!vj2/(j 5 1knj2 1!#1/2(11)8.3.1 Example 2 Use of V, the estimated coefficient ofvariation:8.3.1.1 ProblemTo com
44、pute the sample size needed toestimate the average abrasion resistance (i.e., average numberof cycles) of a material when the value of e is 0.10 or 10 %, andpractical certainty is desired.8.3.1.2 SolutionThere are no data from previous samplesof this same material, but data for six samples of simila
45、rmaterials show a wide range of resistance. However, the valuesof estimated standard deviation are approximately proportionalto the observed averages, as shown in the following text table:Lot No.SampleSizeAvgCyclesStandardDeviationCoefficientof Varia-tion, %110 9013142 10 190 32 173 10 350 45 134 10
46、 450 71 165 10 1000 120 126 10 3550 680 19Pooled 15.4The use of the pooled coefficient of variation for Voin Eq 2gives the following for the required size of sample to give amaximum sampling error not more than 10 % of the expectedvalue:n 5 3 3 15.4!/1025 21.322 test specimens (12)3ASTM Manual on Pr
47、esentation of Data and Control Chart Analysis, ASTMMNL 7A, 2002, Part 3TABLE 1 Values of the Correction Factor C4and d2for SelectedSample Sizes njASample Size3,(nj) C4d22 .798 1.134 .921 2.065 .940 2.338 .965 2.8510 .973 3.08AAs njbecomes large, C4approaches 1.000.E1220938.3.1.3 If a maximum allowab
48、le error of 5 % were needed,the required sample size would be 86 specimens. The datasupplied by the prescribed sample will be useful for the studyin hand and also for the next investigation of similar material.8.4 For Eq 3Compute the estimated fraction nonconform-ing, p, for each sample. Then for th
49、e weighted average use thefollowing equation:p 5total number nonconforming in all samplestotal number of units in all samples(13)8.4.1 Example 3Use of p:8.4.1.1 ProblemTo compute the size of sample needed toestimate the fraction nonconforming in a lot of alloy steel trackbolts and nuts when the value of E is 0.04, and practicalcertainty is desired.8.4.1.2 SolutionThe data in the following table from fourprevious lots were used for an advance estimate of p:Lot No. Sample SizeNumberNonconformingFractionNonconforming1