ASTM E1847-1996(2003) Standard Practice for Statistical Analysis of Toxicity Tests Conducted Under ASTM Guidelines《根据ASTM指南进行毒素试验统计分析的标准操作规程》.pdf

资源描述

1、Designation: E 1847 96 (Reapproved 2003)Standard Practice forStatistical Analysis of Toxicity Tests Conducted UnderASTM Guidelines1This standard is issued under the fixed designation E 1847; the number immediately following the designation indicates the year oforiginal adoption or, in the case of re

2、vision, the year of last revision. A number in parentheses indicates the year of last reapproval. Asuperscript epsilon (e) indicates an editorial change since the last revision or reapproval.1. Scope1.1 This practice covers guidance for the statistical analysisof laboratory data on the toxicity of c

3、hemicals or mixtures ofchemicals to aquatic or terrestrial plants and animals. Thispractice applies only to the analysis of the data, after the testhas been completed.All design concerns, such as the statementof the null hypothesis and its alternative, the choice of alphaand beta risks, the identifi

4、cation of experimental units, possiblepseudo replication, randomization techniques, and the execu-tion of the test are beyond the scope of this practice. Thispractice is not a textbook, nor does it replace consultation witha statistician. It assumes that the investigator recognizes thestructure of h

5、is experimental design, has identified the experi-mental units that were used, and understands how the test wasconducted. Given this information, the proper statistical analy-ses can be determined for the data.1.1.1 Recognizing that statistics is a profession in whichresearch continues in order to i

6、mprove methods for performingthe analysis of scientific data, the use of statistical methodsother than those described in this practice is acceptable as longas they are properly documented and scientifically defensible.Additional annexes may be developed in the future to reflectcomments and needs id

7、entified by users, such as more detaileddiscussion of probit and logistic regression models, or statisti-cal methods for dose response and risk assessment.1.2 The sections of this guide appear as follows:Title SectionReferenced Documents 2Terminology 3Significance and Use 4Statistical Methods 5Flow

8、Chart 6Flow Chart Comments 7Keywords 8References1.3 This standard does not purport to address all of thesafety concerns, if any, associated with its use. It is theresponsibility of the user of this standard to establish appro-priate safety and health practices and determine the applica-bility of reg

9、ulatory limitations prior to use.2. Referenced Documents2.1 ASTM Standards:2E 178 Practice for Dealing with Outlying ObservationsE 380 Practice for Use of the International System of Units(SI) (the Modernized Metric System)E 456 Terminology Relating to StatisticsE 1241 Guide for Conducting Early Lif

10、e Stage ToxicityTests with FishesE 1325 Terminology Relating to Design of Experiments3. Terminology3.1 Definitions of Terms Specific to This StandardThefollowing terms are defined according to the references noted:3.1.1 analysis of variance (ANOVA)a technique that sub-divides the total variation of

11、a set of data into meaningfulcomponent parts associated with specific sources of variationfor the purpose of testing some hypothesis on the parameters ofthe model or estimating variance components (1).33.1.2 categorical datavariates that take on a limitednumber of distinct values (2).3.1.3 censored

12、datasome subjects have not experiencedthe event of interest at the end of the study or time of analysis.The exact survival times of these subjects are unknown (3).3.1.4 central limit theoremwhatever the shape of thefrequency distribution of the original populations of Xs, thefrequency distribution o

13、f the mean, in repeated randomsamples of size n tends to become normal as n increases (2).3.1.5 central tendency measurea statistic that measuresthe central location of the sample observations (4).3.1.6 concentration-response testingthe quantitative rela-tion between the amount of factor X and the m

14、agnitude of theeffect it causes is determined by performing parallel sets of1This practice is under the jurisdiction of ASTM Committee E47 on BiologicalEffects and Environmental Fate and is the direct responsibility of SubcommitteeE47.06 on Terminology and Technical Services.Current edition approved

15、 Dec. 10, 1996. Published February 1997.2For referenced ASTM standards, visit the ASTM website, www.astm.org, orcontact ASTM Customer Service at serviceastm.org. For Annual Book of ASTMStandards volume information, refer to the standards Document Summary page onthe ASTM website.3The boldface numbers

16、 given in parentheses refer to a list of references at theend of the text.1Copyright ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959, United States.operations with various known amounts, or doses, of the factorand measuring the result, that is called the resp

17、onse (5).3.1.7 continuous dataa variable that can assume a con-tinuum of possible outcomes (4).3.1.8 controlan experiment in which the subjects aretreated as in a parallel experiment except for omission of theprocedure or agent under test and that is used as a standard ofcomparison in judging experi

18、mental effects (6).3.1.9 dichotomous datavariates that have only 2 mutuallyexclusive outcomes, binary data, success or failure data (3).3.1.10 dispersion measurea statistic that measures thecloseness of the independent observations within groups, orrelative to a samples central value (4).3.1.11 dist

19、ributiona set of all the various values thatindividual observations may have and the frequency of theiroccurrence in the sample or population (1).3.1.12 duplicationthe execution of a treatment at leasttwice under similar conditions (1).3.1.13 experimental unita portion of the experimentalspace to wh

20、ich a treatment is applied or assigned in theexperiment (1).3.1.14 homogeneitylack of significant differences amongmean squares of an analysis (2).3.1.15 hypothesis testa decision rule (strategy, recipe)which, on the basis of the sample observations, either acceptsor rejects the null hypothesis (4).

21、3.1.16 independencehaving the property that the jointprobability (as of all events or samples) or the joint probabilitydensity function (as of random variables) equals the product ofthe probabilities or probability density functions of separateoccurrence (6).3.1.17 meana measure of central tendency

22、or location thatis the sum of the observations divided by the number ofobservations (1).3.1.18 modelan equation that is intended to provide afunctional description of the sources of information which maybe obtained from an experiment (1).3.1.19 nonparametric statistica statistic which has certaindes

23、irable properties that hold under relatively mild assump-tions regarding the underlying populations (4).3.1.20 normalityhaving the characteristics of a normaldistribution (2).3.1.21 outlieran outlying observation is one that appearsto deviate markedly from other members of the sample inwhich it occu

24、rs (see Practice E 178).3.1.22 parametric statistica statistic that estimates anunknown constant associated with a population (4).3.1.23 probit logitwhen the response Y in binary, theprobit/logit equation is as follows:p 5 PrY 5 0! 5 C 1 1 2 C! Fx*b! (1)where:b = vector of parameter estimates,F = cu

25、mulative distribution function (normal, logistic),x = vector of independent variables,p = probability of a response, andC = natural (threshold) response rate.The choice of the distribution function, F, (normal for theprobit model, logistic for the logit model) determines the typeof analysis (7).3.1.

26、24 regression analysisthe process of estimating theparameters of a model by optimizing the value of an objectivefunction (for example, by the method of least squares) and thentesting the resulting predictions for statistical significanceagainst an appropriate null hypothesis model (1).3.1.25 replica

27、tionthe repetition of the set of all thetreatment combinations to be compared in an experiment. Eachof the repetitions is called a replicate (1).3.1.26 residualYobsminus Ypred the difference betweenthe observed response variable value and the response variablevalue that is predicted by the model tha

28、t is fit to the data (8).3.1.27 scedasticityvariance (5).3.1.28 significance levelthe probability at which the nullhypothesis is falsely rejected, that is, rejecting the null hypoth-esis when in fact it is true (4).3.1.29 transformationthe transformation of the observa-tions Xij into another scale f

29、or purposes of allowing thestandard analysis to be used as an adequate approximation (2).3.1.30 treatmenta combination of the levels of each of thefactors assigned to an experimental unit (see TerminologyE 456).3.1.31 variancea measure of the squared dispersion ofobserved values or measurements expr

30、essed as a function ofthe sum of the squared deviations from the population mean orsample average (see Terminology E 456).4. Significance and Use4.1 The use of statistical analysis will enable the investiga-tor to make better, more informed decisions when using theinformation derived from the analys

31、es.4.1.1 The goals when performing statistical analyses, are tosummarize, display, quantify, and provide objective measuresfor assessing the relationships and anomalies in data. Statisticalanalyses also involve fitting a model to the data and makinginferences from the model. The type of data dictate

32、s the type ofmodel to be used. Statistical analysis provides the means to testdifferences between control and treatment groups (one form ofhypothesis testing), as well as the means to describe therelationship between the level of treatment and the measuredresponses (concentration effect curves), or

33、to quantify thedegree of uncertainty in the end-point estimates derived fromthe data.4.1.2 The goals of this practice are to identify and describecommonly used statistical procedures for toxicity tests. Fig. 1,Section 6, following statistical methods (Section 5), presents aflow chart and some recomm

34、ended analysis paths, with refer-ences. From this guideline, it is recommended that eachinvestigator develop a statistical analysis protocol specific tohis test results. The flow chart, along with the rest of thisguideline, may provide both useful direction, and service as aquality assurance tool, t

35、o help ensure that important steps in theanalysis are not overlooked.E 1847 96 (2003)25. Statistical Methods5.1 Exploratory Data AnalysisThe first step in any dataanalysis is to look at the data and become familiar with theircontent, structure, and any anomalies that might be present.5.1.1 Plots:5.1

36、1.1 Histograms are unidimensional plots that show thedistributional shapes in the data and the frequencies of indi-vidual values. These diagrams allow the investigator to checkfor unusual observations and also visually check the validity ofsome assumptions that are necessary for several statistical

37、analyses that may be used (9).5.1.1.2 Scatter plots of two or more variables demonstratethe relationships among the variables, so that correlations canbe observed and interactions can be studied. These plots arevery useful when looking for concentration effect relationships(9).5.1.1.3 Normality and

38、box plots are additional plots thatgive distributional information, quantiles and pictures of thedata, either as a whole or by treatment group (9).5.1.2 OutliersOn occasion, some data points in the histo-gram, scatter plot, or box plot, appear to be quite different fromthe majority of points. These

39、data, known as outliers, can betested to determine if they are truly different from the distri-bution of the experimental data (10). The Z or t scores areusually used for testing, with a confidence level chosen by theinvestigator. If they are different and can be attributed to anerror in the executi

40、on of the study (violation of protocol, dataentry error, and so forth), then they can be removed from theanalyses. However, if there is no legitimate reason to removethem, then they must be kept in the analyses. It is recom-mended that the analyses can be conducted on two data sets,the complete one

41、and one with the outliers removed. In thisway, the outliers influence on the analyses can be studied.5.1.3 Non-Detected Data:5.1.3.1 Data that fall below a chemical analysis thresholdlevel of detection, in an analytical technique used to measure avalue, are called non-detected. Values that occur abo

42、ve thedetection limit but are below the limit of quantitation, arecalled non-estimable. Occasionally, the two terms are usedinterchangeably. Essentially, these data are results for which noreliable number can be determined.5.1.3.2 In analyzing a data set containing one or morenon-detects, several me

43、thods can be used. If the amount ofnon-detects is below approximately 25 % of the entire data set,then the non-detects can be replaced by one half the detectionlimit (or quantitation limit, whichever is appropriate) andFIG. 1 Flow Chart for Practice for Statistical AnalysisE 1847 96 (2003)3FIG. 1 Fl

44、ow Chart for Practice for Statistical Analysis (continued)FIG. 1 Flow Chart for Practice for Statistical Analysis (continued)E 1847 96 (2003)4analysis proceeds (11). One half the detection or quantitationlimit is often used to prevent undue bias from entering theanalysis. In some cases, the full det

45、ection limit may be moreappropriate for the analyses, or substituting values derivedfrom a distribution function fit to the non-detected range, thatis appropriate given the distribution of the detected values.Zero is not usually used as a substitute because of the bias itintroduces to the analyses,

46、and potential underestimation of thestatistics involved. However, zero may be the most appropriatevalue in certain situations, as determined by best professionaljudgment. One example is the analysis of control samples, thatare known with a very high degree of confidence to be free ofthe chemical bei

47、ng analyzed, that is, zero concentration. Ifthere are more than approximately 25 % non-detects in the dataset, then the proportions of non-detects to the total sample sizefor each group are analyzed on a present/absent basis, and theanalysis is done on the proportions. If there are more thanapproxim

48、ately 50 % non-detects in the data set, the proportionscan be analyzed as above, or the data can be partitioned intodetects and non-detects. The detects group is then analyzed byitself, to reveal the information it holds.5.1.4 Descriptive StatisticsThe next step is to summarizethe information contai

49、ned in the data, by means of descriptivestatistics. First and foremost is the sample size or number ofobservations in the test, broken out by treatment groups,experimental units, or blocks, whatever is appropriate for thetest being analyzed. Other most common ones are measures ofcentral tendency and of dispersion within the data. Centraltendency measures are the mean, median (also known as the50th percentile), mode, and trimmed mean (also called Win-sorized mean). Dispersion measures are range, standard devia-tion, variance, and quantiles (

展开阅读全文