Please enable JavaScript.
Coggle requires JavaScript to display documents.
Stats, 2
Aggr, Ego Fallacy, Simp Para, ICC, Design Effect, 3
Nullmodell…
Stats
1
Multilevel Daten
Multilevel Daten
- sind hierarchische Daten, deshalb der Ausdruck "Multilevel" (mehrere Ebenen).
- ofc dep on Studiendesign/Fragestellung, the hierarchy can be either or (one over the other)
-
-
-
- sind Daten, in denen Abhängigkeiten vorkommen. Damit ist die wichtige Voraussetzung der statistischen Unabhängigkeit verletzt
2
Aggr, Ego Fallacy, Simp Para, ICC, Design Effect
-
Simpson‘s Paradox – Nichtberücksichtigung von Gruppen (Level-2) kann zu völlig anderen Resultaten führen.
- Beispiel – Arbeitszufriedenheit
- Verantwortung (Prädiktorvariable) und Arbeitszufriedenheit (Zielvariable)
-
- Zusammenhang zwischen Verantwortung und Arbeitszufriedenheit
- ⇒ neg korrel ⇔ Je höher die Verantwortung, desto tiefer die Arbeitszufriedenheit.
- ⇒ Simpson's Paradox
- Simpson‘s Paradox – Nichtberücksichtigung von Gruppen (Level-2) kann zu völlig anderen Resultaten führen.
- "if you dont consider the upper grp levels, you get a diff correl line than if you do consider the grps"
- eg tog Streuung looks like a neg correl, but considering the grps you get a pos correl per grp - and the MLM will show the avg of the pos correls per grp
why?:
- Die drei Firmen unterscheiden sich deutlich bzgl. der durchschnittlichen Verantwortung. (Verantw levels are diff btw the Firmen)
- wie auch der mittleren Arbeitszufriedenheit! (the Firmen means)
- again nicht Unabh, bc those in the same Firma have Abh!
- so u gotta make them Unabh then..
- by separating the Firmen. But now that you separate the Firmen tho it becomes the opp:
- ⇒ pos korrel found! ⇔ Je höher die Verantwortung, desto höher die Arbeitszufriedenheit
-
=> MLMs solve the problem!
- ⇒ Why not just not use the MLM and instead just take the grand avg of the separate Firmen instead. (take the ausgezogene Linien and calc the grand avg line. skipping the MLM step)
- ⇒ can and does work (typically similar results as MLM). but theres infoverlust.
- thats why u should use MLM to keep all info
- eg: theres diff avg Verantw levels for each Firma
- ⇒ abh exists btw Firma and Verantw
-
- eg Detkoeff of lin Reg (prediction acc)
Ecological Fallacy – Auf der Level-2 Ebene gefundene Zusammenhänge werden fälschlicherweise auf die Level-1 Ebene übertragen.
eg
- Why not we take the avg/mean of each Firma, and make a lin Reg.
- Durchschnittswerte pro Firma
- but now were looking at the avgs, meaning on another (upper) level.
- ⇒ no longer micro-micro, but macro-macro
- ⇒ Ecological Fallacy
- we in turn answer a diff q. Not the q that was asked (namely per person - micro, and not per firma - macro)
-
-
- Imagine two neighborhoods:
- Neighborhood
- Uptown
- % College Graduates―80%
- Average Income―$100,000
- Midtown
- % College Graduates―40%
- Average Income―$50,000
- You might look at this and say:
- “People with more education earn more money.”
- But this is the ecological fallacy because you’re assuming that individuals with more education in the group are earning more, when all you actually know is that the group averages are correlated.
- 👉 What if, within each neighborhood, the highest earners are actually those without degrees? Then your individual-level assumption would be wrong, even though the group data suggested otherwise.
- what if the 20% that arent graduated are the ones with the most income! (eg self made men)
- ⇒ its those who didnt graudate with high incomes that pull the avg income of the city up, not the college grads - altho the city graduate ratio makes it look that way!
- findings: the more Einkommen, the more no votes for 13. AHV
- ⇒ again the case of Ecological Fallacy
- it only applies to level 2, macro grp pro Gemeinde
- and not level 1, pro person
- meaning q is diff. not the more Einkommen per person but, the more Einkommen per Gemeinde ⇒ the more no's to 13. AHV
- ⇒ again need MLM to consider grp size, etc etc. Alleviating Eco Fallacy
ICC
- ICC measures Abh = [0, 1]
- "Class" = "Klasse" = Level 2 = macro level = "Gruppen"
- Der ICC gibt im Wesentlichen an, wie gross die Varianz zwischen den Gruppen im Vergleich zur Varianz innerhalb der Gruppen ist.
ICC = 𝜌I
= Var_zw/(Var_zw+Var_inn)
= Var_zw/totaleVar
- ⇔ Var_zw ~= Var_inn ⇔ values inn einer Gruppe sind unabh
- since ICC very low ⇔ low abh (to which grp you belong ⇔ doesnt matter which grp/school you belong, the measurements are "fair") ⇔ "Unabh" ⇔ Unilevel models usable, MLM not necc
- low ICC ⇔ need more Messwdh to reach high Reliabilität
- if unabh/ICC = 0 u dont have to use MLM. U can tho
- But Unilevel models (the classic ANOVA and lin Regs) can be used. (since Unabh satsif as usual)
- ⇔ Var_zw >> Var_inn ⇔ values inn Gruppe sind ähnlich; general values of other grps are diff!
- ICC high ⇔ high Abh (towards which grp/school you belong )⇔ the means are diff placed ⇔ need MLM
- high ICC ⇔ need less Messwdh to reach high Reliabilität
Design Effect
- Design effect answers the following q:
- ⇔ to have the same Standardfehler SE as if doing it the unabh way/Unilevel model,
how much larger/by which factor (= "design effect") larger should my SP size be so that my MLM has the same precision/SE as the unabh way/Unilevel model?
-
- ⇔ how often should i measure a var to see stability - no more Schwankungen (measurements lead to similar results) - measurement/result consistency
Reliabilität ( 𝜆j )
𝑛j = Gruppengrösse
- incr nj ⇒ decr sigma^2 ⇒ incr Realibilität to 1
- => better to have fewer but larger grps, than have many but small grps (same total SP size)
- ⇔ formula very sim to ICC formula
- eg, having one exam to eval math skills is not very realiable. since u dont see the consistency. 10 exams and then avg it is more realible than one exam's realiability
- same eg with Stimmung (has natural low Abh / ICC, since Stimmung very var at every moment)
-
-
5
RIM mit lvl 2, Zentrierung, Kontexteffekt, BFLPE
RIM mit lvl 2
exactly the same as RIM, except that the predictor x is now lvl 2 (means of lvl 1) => ̅ x.j
instead of lvl 1 => xij
Der Koeffizient 𝛾01 gibt an, um welchen Betrag sich y ändert, wenn der Wert von ̅ 𝑥.j um eine Einheit zunimmt.
-
Var_inn no longer exists/trivially is the same per grp, entire participant of the same grp share the same grp mean, trivially
-
BFLPE
- eg of a gigantic Kontexteffekt (big "effect" diff btw level 1 and level 2):
- heart rate correl with sport activity
- a Kontexteffekt involving neg correl level 2 and pos correl level 1 is what you call a BFLP effect...:
- at level 2, neg correl ("effekt")
- the more sport in avg, the lower the heart rate in avg
- bc sport athletes are used to the intensity, thus their heart is calmer / the avg heart rate, the more sport they do is reduced
- at level 1, pos correl ("effekt")
- the more sport, the higher the heart rate
- Ein positiver Zusammenhang auf der Level-1 Stufe, kombiniert mit einem negativen auf der Level-2 Stufe ist eine Art von Kontexteffekt und wird in diesem Zusammenhang auch Big-Fish-Little-Pond (BFLP) Effekt genannt.
- ⇔ pos correl level 1, neg correl level 2
- ⇔ big Kontexteffect (involving different Vorzeichen)
-
=> BFLPE only describes pos correl lvl 1, neg correl lvl 2, not other way around!
-
-
-