Please enable JavaScript.
Coggle requires JavaScript to display documents.
Unit 4: Producing Data (Samples and Surveys (Bias (Definition (Bias is…
Unit 4: Producing Data
-
Samples and Surveys
Sampling Error
Convenience sampling
-
Bias: Generally over or under predicts the value since using data from one small location does not accurately predict for a large location
Example: A survey of one's Instagram followers would be simple, because they are all in the same place. However, this would lead to bias, as one's followers are typically a part of the same demographic.
-
Under-coverage
Definition: When some individuals are not included in the data since they are left out of the process of choosing the sample
-
Example: A list of students taken from the parking lot registration. This would be an example of under-coverage, because this population does not include the students who either get a ride to school or take the bus.
Non-Sampling Error
Non-response Bias
-
Example: If a survey is conducted on how many hours per week the an adult in Massachusetts sleeps on a sample of 10,000 randomly selected but 2,000 of the individuals do not respond, they data would not accurately predict the number of values
Wording Bias
-
Example: A survey question about whether a particular children's toy store should be shut down asks, "Do you want your children to live in a world without toys?" This wording would lead the surveyed people to the obvious answer, "No", even if they do not support this particular restaurant.
Response Bias
-
Example: A survey question has two answers, which vary only in the fact that one says "would" and the other says "wouldn't". The individual taking the survey moves too quickly and cannot see the difference, so just chooses one of the choices.
Bias
-
Sources
Bias in samples and surveys can be caused by the sampling and non-sampling errors, which are listed above.
Direction
Bias in samples and surveys can cause the sample to either overpredict or underpredict the actual population. Surveys should be representative.
Example: When trying to determine which presidential candidate is the most supported in the US, a poll was taken. However the poll was only taken in Massachusetts. This includes bias because it leaves out a large majority of voters from the rest of the country.
Good Sampling Techniques
Stratified SRS
Advantages: more representative of the population's different perspectives than simple random sampling, takes variation between different people into account
Disadvantages: Finding the characteristics to separate the population by can be difficult, difficult to put people into specific boxes, as a result of difficulty can be very time consuming
Definition: the population of individuals is classified into groups of similar individuals, then a separate SRS is chosen in each stratum and all SRSs are combined to form the full sample
Cluster Sampling
Advantages: more convenient than stratified SRS, different within each cluster
Disadvantages: small clusters may not represent whole population (ex. if clusters are formed by homerooms that are separated by grade and more senior homerooms are selected than freshman it would not accurately represent the entire school), convenience sampling error
Definition: when the population is divided into smaller groups that mirror the characteristics of the population, then clusters are chosen randomly to be surveyed
SRS
Advantages: doesn't favor any individuals, works well for small samples, uses tables of random digits to select data randomly
Disadvantages: time-consuming with large samples or if the individuals are spread out over a large area, doesn't evenly represent individuals (more males could be selected than females)
-
-