Categorical Data Analysis

Categorical Data Analysis

Categorical Data Analysis Categorical data arise whenever counts (as opposed to measurements) are made. Subjects (sample items) are classified as belonging to one of a set of categories and the numbers in the categories (the frequencies) are recorded.

Example Eye colours: eye colours of males visiting an optician, in four categories Colour Frequency observed A 89

B 66 C D 60 85 Example Tonsils: Relationship between nasal carrier status for Streptococcus pyogenes and size of tonsils

among 1398 children aged 0-15 years. Normal Enlarged Much enlarged Total Carriers

19 29 24 72

Noncarriers 497 560 269 1326

516 589 293 1398

Total Example Prussian cavalry deaths: numbers of cavalry soldiers killed by horsekicks in each of 14 units of the Prussian army over a 20-year period (1875-1894). Number killed 0

5 1 2 3 4

Frequency observed 144 91 32

11 2 Total 0 280

Often we wish to decide whether the categorical variables follow some well known distribution A chi-squared test will provide a method of testing the hypothesis that a data set follows a particular distribution. Often we wish to decide whether the

categorical variables follow some well known distribution A chi-squared test will provide a method of testing the hypothesis that a data set follows a particular distribution. It works by summing the quantity (Observed Expected)2/Expected The chi-squared test in the R program is fairly

limited it copes well with testing whether there is a significant relationship between nasal carrier status for Streptococcus pyogenes and size of tonsils among 1398 children aged 0-15 years (as in the second example) but gives us a problem with the other two. Consider now data from Standard and Poors 500 - an index of 500 of the largest, most

actively traded stocks on the New York Stock Exchange These data are available in R as sp500.R from the module website. Technique: To look at any one of the variables in a data frame such as sp.500, the $ sign is helpful.

Without attaching the data, typing adjclose produces nothing. Instead use >sp500$adjclose or >plot(sp500$adjclose) We are interested in the distribution of the

change in returns from day to day. We suspect that the logs of these changes may follow a normal distribution. These are placed in an R vector by using the command >d=diff(log(sp500$adjclose)) The chisq function that is pre-defined in R is

not powerful enough to test the values of d to see if they conform to a normal distribution, so a program is written instead. We wish to test whether a normal distribution with the same mean and standard deviation of d will look similar to this histogram. Calculate, for example, the approximate

expected number between -0.04 and -0.02 by This can be repeated and made more sophisticated with more than 4 comparisons by writing a program. The one considered has 100 comparisons.

Recently Viewed Presentations

  • Chapter 21: Protists and Fungi - Tenafly Public Schools

    Chapter 21: Protists and Fungi - Tenafly Public Schools

    Chapter 21: Protists and Fungi Section 21-4: Fungi What are Fungi? Heterotrophs - produce enzymes that digest food outside their bodies, then absorb the nutrients Most feed on decaying material in the soil, some are parasitic Cells walls made of...
  • 117626472v1 EBOLA EPIDEMIC IN THE COMMERCIAL SHIPPING CONTEXT
  • Cabang Dan Macam Aliran Seni Rupa

    Cabang Dan Macam Aliran Seni Rupa

    MACAM-MACAM ALIRAN SENI RUPA Aliran Klasikisme (Classicisme) Aliran Romantisme Aliran Realisme Aliran Naturalisme Aliran Impresionisme Aliran Ekspresionisme Aliran Kubisme Aliran Futurisme Aliran Fauvisme Aliran Dadaisme Aliran Suryalisme Aliran Abstrak Aliran Klzsikisme/classicisme/klasik: Aliran yang memiliki nilai klasik/ keunggulan ...
  • Agenda Topic - Ohio University

    Agenda Topic - Ohio University

    Final FY19 Payroll Expense Accounting Correction Forms are due no later than Wednesday, July 24. Any corrections received after this date may not processed. Payroll only has until Tuesday, July 30 to process them. Final FY19 Payroll Accounting Corrections.
  • CounterReformation

    CounterReformation

    The Counter Reformation The Defense of Tradition The Jesuits Artistic Reproductions Military Policies Failure of Church Reform Protestant Politics and Military Protestatio Smalkaldic Wars: Habsburg Hegemony Wars of Religion: French Real Politique Virgin of the Jesuits 1534 Established by Ignatius...
  • Community Interventions to Promote Health and Prevent Chronic ...

    Community Interventions to Promote Health and Prevent Chronic ...

    Obesity among WIC Children Age 2-5 by Race/ethnicity, Missouri, 2000-2013. Prevalence (%) ... > 150 mg/dL (1.7 mmol/L), or specific treatment for this lipid abnormality ... Community Interventions to Promote Health and Prevent Chronic Disease
  • Welcome to the PTAC Kernel Training Tuesday August 15, 2017 ...

    Welcome to the PTAC Kernel Training Tuesday August 15, 2017 ...

    Online Selling. Make account registration part of your kickoff. Pick a location that has access to Wifi and ask parents to bring a tablet or smartphone (have extra laptops available if possible)
  • RCIA with Children - iss.rcec.london.on.ca

    RCIA with Children - iss.rcec.london.on.ca

    penitential rites or scrutinies… and the celebration of the sacraments of initiation… corresponding to the periods of adult initiation are the periods of the children's catechetical formation that lead up to and follow the steps of the initiation" (no. 253).