Categorical Data Analysis.ppt

上传人:orderah291 文档编号:379370 上传时间:2018-10-09 格式:PPT 页数:44 大小:178.50KB
下载 相关 举报
Categorical Data Analysis.ppt_第1页
第1页 / 共44页
Categorical Data Analysis.ppt_第2页
第2页 / 共44页
Categorical Data Analysis.ppt_第3页
第3页 / 共44页
Categorical Data Analysis.ppt_第4页
第4页 / 共44页
Categorical Data Analysis.ppt_第5页
第5页 / 共44页
亲,该文档总共44页,到这儿已超出免费预览范围,如果喜欢就下载吧!
资源描述

1、Categorical Data Analysis,Independent (Explanatory) Variable is Categorical (Nominal or Ordinal)Dependent (Response) Variable is Categorical (Nominal or Ordinal)Special Cases: 2x2 (Each variable has 2 levels) Nominal/Nominal Nominal/Ordinal Ordinal/Ordinal,Contingency Tables,Tables representing all

2、combinations of levels of explanatory and response variables Numbers in table represent Counts of the number of cases in each cell Row and column totals are called Marginal counts,Example EMT Assessment of Kids,Explanatory Variable Child Age (Infant, Toddler, Pre-school, School-age, Adolescent) Resp

3、onse Variable EMT Assessment (Accurate, Inaccurate),Source: Foltin, et al (2002),2x2 Tables,Each variable has 2 levels Explanatory Variable Groups (Typically based on demographics, exposure, or Trt) Response Variable Outcome (Typically presence or absence of a characteristic) Measures of association

4、 Relative Risk (Prospective Studies) Odds Ratio (Prospective or Retrospective) Absolute Risk (Prospective Studies),2x2 Tables - Notation,Relative Risk,Ratio of the probability that the outcome characteristic is present for one group, relative to the other Sample proportions with characteristic from

5、groups 1 and 2:,Relative Risk,Estimated Relative Risk:,95% Confidence Interval for Population Relative Risk:,Relative Risk,Interpretation Conclude that the probability that the outcome is present is higher (in the population) for group 1 if the entire interval is above 1 Conclude that the probabilit

6、y that the outcome is present is lower (in the population) for group 1 if the entire interval is below 1 Do not conclude that the probability of the outcome differs for the two groups if the interval contains 1,Example - Coccidioidomycosis and TNFa-antagonists,Research Question: Risk of developing C

7、occidioidmycosis associated with arthritis therapy?Groups: Patients receiving tumor necrosis factor a (TNFa) versus Patients not receiving TNFa (all patients arthritic),Source: Bergstrom, et al (2004),Example - Coccidioidomycosis and TNFa-antagonists,Group 1: Patients on TNFaGroup 2: Patients not on

8、 TNFa,Entire CI above 1 Conclude higher risk if on TNFa,Odds Ratio,Odds of an event is the probability it occurs divided by the probability it does not occur Odds ratio is the odds of the event for group 1 divided by the odds of the event for group 2 Sample odds of the outcome for each group:,Odds R

9、atio,Estimated Odds Ratio:,95% Confidence Interval for Population Odds Ratio,Odds Ratio,Interpretation Conclude that the probability that the outcome is present is higher (in the population) for group 1 if the entire interval is above 1 Conclude that the probability that the outcome is present is lo

10、wer (in the population) for group 1 if the entire interval is below 1 Do not conclude that the probability of the outcome differs for the two groups if the interval contains 1,Example - NSAIDs and GBM,Case-Control Study (Retrospective) Cases: 137 Self-Reporting Patients with Glioblastoma Multiforme

11、(GBM) Controls: 401 Population-Based Individuals matched to cases wrt demographic factors,Source: Sivak-Sears, et al (2004),Example - NSAIDs and GBM,Interval is entirely below 1, NSAID use appears to be lower among cases than controls,Absolute Risk,Difference Between Proportions of outcomes with an

12、outcome characteristic for 2 groups Sample proportions with characteristic from groups 1 and 2:,Absolute Risk,Estimated Absolute Risk:,95% Confidence Interval for Population Absolute Risk,Absolute Risk,Interpretation Conclude that the probability that the outcome is present is higher (in the populat

13、ion) for group 1 if the entire interval is positive Conclude that the probability that the outcome is present is lower (in the population) for group 1 if the entire interval is negative Do not conclude that the probability of the outcome differs for the two groups if the interval contains 0,Example

14、- Coccidioidomycosis and TNFa-antagonists,Group 1: Patients on TNFaGroup 2: Patients not on TNFa,Interval is entirely positive, TNFa is associated with higher risk,Fishers Exact Test,Method of testing for association for 2x2 tables when one or both of the group sample sizes is small Measures (condit

15、ional on the group sizes and number of cases with and without the characteristic) the chances we would see differences of this magnitude or larger in the sample proportions, if there were no differences in the populations,Example Echinacea Purpurea for Colds,Healthy adults randomized to receive EP (

16、n1.=24) or placebo (n2.=22, two were dropped) Among EP subjects, 14 of 24 developed cold after exposure to RV-39 (58%) Among Placebo subjects, 18 of 22 developed cold after exposure to RV-39 (82%) Out of a total of 46 subjects, 32 developed cold Out of a total of 46 subjects, 24 received EP,Source:

17、Sperber, et al (2004),Example Echinacea Purpurea for Colds,Conditional on 32 people developing colds and 24 receiving EP, the following table gives the outcomes that would have been as strong or stronger evidence that EP reduced risk of developing cold (1-sided test). P-value from SPSS is .079.,Exam

18、ple - SPSS Output,McNemars Test for Paired Samples,Common subjects being observed under 2 conditions (2 treatments, before/after, 2 diagnostic tests) in a crossover setting Two possible outcomes (Presence/Absence of Characteristic) on each measurement Four possibilities for each subjects wrt outcome

19、: Present in both conditions Absent in both conditions Present in Condition 1, Absent in Condition 2 Absent in Condition 1, Present in Condition 2,McNemars Test for Paired Samples,McNemars Test for Paired Samples,H0: Probability the outcome is Present is same for the 2 conditions HA: Probabilities d

20、iffer for the 2 conditions (Can also be conducted as 1-sided test),Example - Reporting of Silicone Breast Implant Leakage in Revision Surgery,Subjects - 165 women having revision surgery involving silicone gel breast implants Conditions (Each being observed on all women) Self Report of Presence/Abse

21、nce of Rupture/Leak Surgical Record of Presence/Absence of Rupture/Leak,Source: Brown and Pennello (2002),Example - Reporting of Silicone Breast Implant Leakage in Revision Surgery,H0: Tendency to report ruptures/leaks is the same for self reports and surgical records HA: Tendencies differ,Pearsons

22、Chi-Square Test,Can be used for nominal or ordinal explanatory and response variables Variables can have any number of distinct levels Tests whether the distribution of the response variable is the same for each level of the explanatory variable (H0: No association between the variables r = # of lev

23、els of explanatory variable c = # of levels of response variable,Pearsons Chi-Square Test,Intuition behind test statistic Obtain marginal distribution of outcomes for the response variable Apply this common distribution to all levels of the explanatory variable, by multiplying each proportion by the

24、 corresponding sample size Measure the difference between actual cell counts and the expected cell counts in the previous step,Pearsons Chi-Square Test,Notation to obtain test statistic Rows represent explanatory variable (r levels) Cols represent response variable (c levels),Pearsons Chi-Square Tes

25、t,Marginal distribution of response and expected cell counts under hypothesis of no association:,Pearsons Chi-Square Test,H0: No association between variables HA: Variables are associated,Example EMT Assessment of Kids,Observed Expected,Example EMT Assessment of Kids,Note that each expected count is

26、 the row total times the column total, divided by the overall total. For the first cell in the table:,The contribution to the test statistic for this cell is,Example EMT Assessment of Kids,H0: No association between variables HA: Variables are associated,Reject H0, conclude that the accuracy of asse

27、ssments differs among age groups,Example - SPSS Output,Ordinal Explanatory and Response Variables,Pearsons Chi-square test can be used to test associations among ordinal variables, but more powerful methods exist When theories exist that the association is directional (positive or negative), measure

28、s exist to describe and test for these specific alternatives from independence: Gamma Kendalls tb,Concordant and Discordant Pairs,Concordant Pairs - Pairs of individuals where one individual scores “higher” on both ordered variables than the other individual Discordant Pairs - Pairs of individuals w

29、here one individual scores “higher” on one ordered variable and the other individual scores “higher” on the other C = # Concordant Pairs D = # Discordant Pairs Under Positive association, expect C D Under Negative association, expect C D Under No association, expect C D,Example - Alcohol Use and Sic

30、k Days,Alcohol Risk (Without Risk, Hardly any Risk, Some to Considerable Risk) Sick Days (0, 1-6, 7) Concordant Pairs - Pairs of respondents where one scores higher on both alcohol risk and sick days than the other Discordant Pairs - Pairs of respondents where one scores higher on alcohol risk and t

31、he other scores higher on sick days,Source: Hermansson, et al (2003),Example - Alcohol Use and Sick Days,Concordant Pairs: Each individual in a given cell is concordant with each individual in cells “Southeast” of theirs Discordant Pairs: Each individual in a given cell is discordant with each indiv

32、idual in cells “Southwest” of theirs,Example - Alcohol Use and Sick Days,Measures of Association,Goodman and Kruskals Gamma:,Kendalls tb:,When theres no association between the ordinal variables, the population based values of these measures are 0. Statistical software packages provide these tests.,Example - Alcohol Use and Sick Days,

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 教学课件 > 大学教育

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1