{"id":12070,"date":"2019-04-02T13:27:46","date_gmt":"2019-04-02T12:27:46","guid":{"rendered":"https:\/\/surveyinsights.org\/?p=12070"},"modified":"2023-07-13T08:54:08","modified_gmt":"2023-07-13T07:54:08","slug":"probability-and-nonprobability-sampling-representative-surveys-of-hard-to-reach-and-hard-to-ask-populations-current-surveys-between-the-poles-of-theory-and-practice","status":"publish","type":"post","link":"https:\/\/surveyinsights.org\/?p=12070","title":{"rendered":"Probability and Nonprobability Sampling: Representative Surveys of hard-to-reach and hard-to-ask populations. Current surveys between the poles of theory and practice"},"content":{"rendered":"<h1><strong>Introduction<\/strong><\/h1>\n<p>In times of increasing demand for evidence based data and accelerating social change, survey research faces numerous challenges. Survey research is still confronted with decreasing and stagnating response and cooperation rates. Furthermore, the emergence and diffusion of relatively new technologies in combination with different usage styles makes it necessary to adapt the framework in which surveys are administrated. Due to the rapid diffusion of the internet as communication platform, an increasing share of surveys is conducted online. This development intensifies the ongoing discourse about probability vs. non-probability sampling. While within survey research there is still a dispute between the representativeness of probability and non-probability sampling, research has long since been overtaken by reality. For example, serious and influential newspapers are now equally using non-probability samples for their surveys (which does not necessarily mean that this approach is valid). In this context research is needed to evaluate, under which conditions non-probability sample surveys can obtain valid and reliable information on characteristics such as attitudes and behaviors of a target population. A potentially valuable of non-probability samples could be the case of hard-to-reach populations. In this case, different non-probability sampling techniques, such as snowball sampling, respondent-driven sampling or quota sampling in non-probability web panels, could increase the integration of these hard-to-reach populations.<\/p>\n<p>Following Willis et al. (2014) typical hard-to-reach survey groups are: ethnic minorities, migrant populations, highly mobile populations, homeless and refugee populations, sexual minorities, populations affected by natural disasters or armed conflicts and stigmatized populations. Bonevski et al. (2014) found in a systematic literature review with 116 epidemiological studies, that the three most frequently investigated hard-to-reach groups are: ethnical\/racial groups, African American and substance users. Since those people are part of particularly vulnerable groups, it is of tremendous importance to collect valid data of their opinions, living conditions and needs. Therefore the question: \u201cHow do we reach the \u2018hard-to-reach\u2019?\u201d is crucial for survey methodologists. Various aspects in the survey process are affected by this question.<\/p>\n<p>In this sense Tourangeau (2014) summarized five theoretical categories:\u00a0Hard-to-reach survey populations are hard to reach because they are <strong>hard to sample, hard to identify, hard to find or contact, hard to persuade <\/strong>and <strong>hard to interview. <\/strong>Each of these reasons contribute to the Total Survey Error (Biemer, 2010). In this respect, populations which are considered hard to sample, hard to identify or hard to find\/contact can increase the coverage and selection error and thus undermine the representativeness of a survey. Furthermore hard-to-reach groups, which are considered hard to persuade can increase the non-response error and thus undermine the representativeness of a survey as well. Added to this, hard-to-interview survey groups can undermine the validity and reliability (if no appropriate adaptions are made in the measurement process) and thus increase the measurement error of a survey.<\/p>\n<h1><strong>Special Issue<\/strong><\/h1>\n<p>With the following collection of articles, \u2018Survey Methods: Insights from the Field\u2019 aims to give an overview about the current state of the hard-to-reach research and the ongoing dispute between the two above mentioned sampling methods. Moreover, this Special Issue attempts to combine theoretical discussions, methodological considerations with experiences from the fields. It offers insights into possible links between non-probability sampling and hard-to-reach populations on the one hand, and, on the other hand, different approaches to address the aforementioned problems via the praxis of each methodology.<\/p>\n<p>This special issue was inspired by a PUMA-Symposium 2017, which was organized by two of the guest editors of the Johannes Kepler University Linz (JKU), Johann Bacher and Andreas Quatember, within the PUMA-project of the Austrian social sciences. In this context different sampling issues and different solution attempts were discussed (<a href=\"https:\/\/puma.univie.ac.at\/home\/\">https:\/\/puma.univie.ac.at\/home\/<\/a>). The editorship by two researchers of the Robert Koch Institute (RKI), Johannes Lemcke und Patrick Schmich, was guided by the Institute\u00b4s interest to a better integration of the hard-to-reach groups into its health monitoring system. The RKI as the public health institute in Germany has the obligation to monitor the health status of the <em>whole population <\/em>in order to offer policy makers valid information within the decision process. As a result of this commitment, various feasibility studies were carried out, which are presented in this special issue alongside initial results. In this respect, the inclusion of elderly and people with migration background is a crucial task for the RKI.<\/p>\n<p><strong>The contributions can be divided into four groups<\/strong><\/p>\n<p>Of course, the assignment of a paper to a certain group is not distinct in every case. Nonetheless, the grouping below should provide some guidance.<\/p>\n<ol>\n<li>Two papers discuss the fundamental theoretical aspects of our topic.<\/li>\n<li>Two papers provide a comparison of probability and non-probability sampling from an applied perspective.<\/li>\n<li>Reports about experiences of concrete studies build the majority of the paper. Nine papers belong to this group. However, these are not only reports in a strict sense; they provide literature reviews, describe the designs and the underlying assumptions and reflect on the design. They cover migrants and refugees as hard-to-reach-groups mentioned by Willis et al. (2014) as well the elderly as one group sometimes ignored.<\/li>\n<li>The fourth group contains papers that can be labeled as \u201creflections and methodological proposals\u201d. One paper suggests a model to increase the recruitment of old people; the second one applies simulations methods.<\/li>\n<\/ol>\n<p>Short description of the papers:<\/p>\n<ol>\n<li><a href=\"https:\/\/surveyinsights.org\/?p=11203\"><strong>Quatember (University Linz) <\/strong><\/a>discusses the inferential quality of an available data set under the standard of the representativeness of a sample and focuses on the assumptions that are made, when calculating an estimator of a certain population characteristic using a specific sampling method. Thus the author offers a theoretical framing for the special issue. In the paper of <a href=\"https:\/\/surveyinsights.org\/?p=10981\"><strong>Kohler (University Potsdam)<\/strong><\/a>, the usability of non-probability and probability sampling is described under six different research scenarios, which are taken from the practice of empirical research in the social sciences. He considers the conditions, which allows the application of nonprobability sampling methods.<\/li>\n<li>In the second group\u00a0of papers, the article from <a href=\"https:\/\/surveyinsights.org\/?p=10433\"><strong>Geary et al.\u00a0<\/strong><strong>(London School of Hygiene and Tropical Medicine)\u00a0<\/strong><\/a>gives an impression about the shortcomings of probability sampling, when small populations are of interest (in this case men who have sex with men and people of Black African ethnicity in Britain). In this study the research tried to boost the sample size of these groups by utilizing non-probability sampling of a non-probability web-panel.\u00a0The contribution from <a href=\"https:\/\/surveyinsights.org\/?p=11959\"><strong>Marks (RTI International (formerly))\u00a0<\/strong><strong>&amp; Rhodes (RTI International) <\/strong><\/a>used a similar approach in a different hard-to-reach population (Africans, African Americans, Cambodians, Hispanics, Koreans, and Whites in Los Angeles in Study 1 and African American and White households with individual(s) who have been incarcerated in Study 2). In their paper the authors present the sampling strategy of these two studies.<\/li>\n<li>In the third group, the paper of <strong><a href=\"https:\/\/surveyinsights.org\/?p=10812\">Koschollek et al. (Robert Koch Institute, Berlin)\u00a0<\/a><\/strong>describes a project, in which a sample of migrants from sub-Saharan Africa is surveyed in Germany on highly stigmatized topic of HIV. The chosen approach involved members of the hard-to-reach target group throughout the entire process from the planning of the study to data collection. <a href=\"https:\/\/surveyinsights.org\/?p=10780\"><strong>Zeisl<\/strong><strong>er et al.\u00a0<\/strong><strong>(Robert Koch Institute, Berlin<\/strong><\/a> report about a multilingual feasibility study conducted in two German federal states. The target populations were people with migration background. Different modes of administration and interventions (study hotline, home visits) were used sequentially and evaluated whether they are able to increase participation. In the article from <a href=\"https:\/\/surveyinsights.org\/?p=10641\"><strong>Prandner &amp; Weichbold (University Linz &amp; University Salzburg)<\/strong><\/a>, the authors present lessons learned in the Austrian Immigrant Survey. In this study the research tackled the sampling issue of immigrants in Austria by building a sampling frame via an onomastic approach.\u00a0<strong><a href=\"https:\/\/surveyinsights.org?p=12000\">Hipp et al. (Berlin Social Science Center &amp; University Potsdam)<\/a> <\/strong>evaluated another sampling technique. The authors highlight recent findings from an ongoing research project, utilizing respondent-driven sampling. All the contributions in the following section deal with the problem of insufficient or missing sampling frames. In their contribution <a href=\"https:\/\/surveyinsights.org\/?p=10741\"><strong>Steinhauer et al. (Leibniz Institute for Educational Trajectories, Bamberg) <\/strong><\/a>also describe a sampling procedure for migrants. They provide insights of how to sample refugee Kindergarten children and students in secondary education. The contribution of <a href=\"https:\/\/surveyinsights.org\/?p=11416\"><strong>K\u00fchne et al. (University Bielefeld)\u00a0<\/strong><\/a>reports on a project to survey a nationwide sample of refugee households in Germany. As in other papers that focus on immigrants or refugees, problems such as the coverage gap of the Central Register of Foreigners or the high mobility of the target group are discussed. Moreover, the paper presents alternative survey instruments applied to this hard-to-reach subpopulation, a group that is hard to interview. The last three papers in the group discuss problems of hard-to-reach-population for older people. <a href=\"https:\/\/surveyinsights.org\/?p=10807\"><strong>Gaertner et al. (Robert Koch Institute, Berlin)<\/strong><\/a>, within a sequential mixed-mode design, compare different approaches to include elderly in nursing homes. Amongst the applied methods, face-to-face contact appears to be the only feasible contact mode if postal contact fails. In a second paper, <strong><a href=\"https:\/\/surveyinsights.org\/?p=10841\">Gaertner et al. (Robert Koch Institute, Berlin)<\/a> <\/strong>analyses the effects of their mixed-mode design in more detail and identify different reasons for non-participation. The last article in this group from <a href=\"https:\/\/surveyinsights.org?p=11064&amp;preview=true&amp;preview_id=11064\"><strong>Kutschar &amp; Weichbold (<\/strong><strong>Paracelsus Medical University &amp; University Salzburg<\/strong><strong>) <\/strong><\/a>addresses the aspect of interviewing elderly in nursing homes (and thus the category of hard-to-interview in the hard-to-reach framework). In their contribution, the authors evaluate the data quality in a survey of elderly people and examine, which respondent-, survey-, and item characteristics predict item non-response, as one indicator of data quality.<\/li>\n<li>Two papers were assigned to the last group of papers. Based on their experiences of a quantitative and qualitative study of older people, <strong><a href=\"https:\/\/surveyinsights.org\/?p=11822\">Kammerer et al. (Institute for Gerontological Research, Berlin) <\/a><\/strong>develop a model for recruitment to increase the participation of hard-to-persuade and hard-to-interview people. They label their model TIBAR, refereeing to the four important elements of the model, namely building trust, offering incentives, identifying barriers and being responsive. In the last paper of the Special Issue, <a href=\"https:\/\/surveyinsights.org\/?p=11275\"><strong>Schanze &amp; Zins (Leibniz Institute for the Social Sciences, Mannheim) <\/strong><\/a>used the framework of a simulation study to evaluate the risk of biased estimates, if elderly institutionalized populations are excluded from the sampling frame or are just insufficiently integrated. Thus this study gives an idea of what to expect if the used sampling frame suffers from undercoverage and describes a method to analyze the effects of sampling procedure by simulation studies.<\/li>\n<\/ol>\n<h2>In summary, the papers enable the following conclusions:<\/h2>\n<ul>\n<li>Random sampling is still the gold standard of survey methods. However, there are situations and scenarios where it is (i) not possible to draw a random sample, where (ii) other sampling techniques are appropriate for the analyzed research question and where (iii) other sampling techniques provide acceptable results \u2013 in some cases better results \u2013 in practice. Examples for all three circumstances are given in this volume.<\/li>\n<li>Random sampling is based on a set of assumptions. It is important to take these assumptions not for granted. They must be tested in the concrete research project. This requires to collect appropriate data for these task (see below).<\/li>\n<li>Sampling and non-random sampling are not mutually exclusive. In many cases, multi-stage or mixed-mode designs prove to be most successful; random and not-random sampling methods can be combined in praxis.<\/li>\n<li>It is important to document the research process and to collect additional information (e.g. metadata) that enables one to evaluate the quality of the realized sample. It should be kept in mind, that the hard-to-reach-problem is multi-dimensional. It cannot be reduced to sampling. Hence, information to all aspects that influence quality should be gathered.<\/li>\n<li>Finally, \u201cgood\u201d surveys require resources. Surveys of hard-to-reach-populations require more resources in order to reach the same quality as in surveys of the general population.<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h2><strong>Some remarks on the term \u201chard-to-reach\u201d<\/strong><\/h2>\n<p>At the end of the introduction the editors want to emphasize that the term \u201chard-to-reach\u201d is currently considered controversial by some researchers. To cite Nicola Brackertz: \u201dThe problem with using the term \u201dhard-to-reach\u201d is that it implies a homogeneity within distinct groups, which does not necessarily exist\u201d (Brackertz, 2007, p. 1). Furthermore this term could imply that the problem of reachability is one within the group of hard-to-reach populations and not within the researcher\u2019s approaches to survey hard-to-reach groups (Smith, 2006). There are certainly different social contexts and personality traits which make it hard to survey certain populations. This, however, \u00a0does not mean that the researches have a passive role. The opposite is true. It becomes just more expensive and resource demanding to integrate these populations. Willis et acknowledges \u00a0al. (2014) this fact with the remark: \u201cWe must recognize that, from the respondent\u2019s point of view, he or she may not be at all \u2018hard-to-reach.\u2019\u201d (Willis et al., 2014, p. 175).<\/p>\n<p>For these reasons, some researches offered different terms to describe the phenomenon of underrepresented survey groups. For example Atkinson &amp; Flint (2001) use \u201chidden populations\u201d as alternative. This term highlights the sampling issues in such populations. Nevertheless, we still use the term hard to reach in the special issue. We understand the term \u201chard to reach\u201d multidimensionally. This means that hard to reach groups are difficult to reach with conventional survey methods.<\/p>\n<p>&nbsp;<\/p>\n<p>Linz and Berlin in April 2019<\/p>\n<p><em>Johann Bacher<\/em><\/p>\n<p><em>Johannes Lemcke<\/em><\/p>\n<p><em>Andreas Quatember<\/em><\/p>\n<p><em>Patrick Schmich<\/em><\/p>\n<p><em>Editors of the special issue<\/em><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction In times of increasing demand for evidence based data and accelerating social change, survey research faces numerous challenges. Survey research is still confronted with decreasing and stagnating response and cooperation rates. Furthermore, the emergence and diffusion of relatively new technologies in combination with different usage styles makes it necessary to adapt the framework in [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[444],"tags":[],"class_list":["post-12070","post","type-post","status-publish","format-standard","hentry","category-probability-and-nonprobability-sampling"],"acf":[],"_links":{"self":[{"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/posts\/12070","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=12070"}],"version-history":[{"count":52,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/posts\/12070\/revisions"}],"predecessor-version":[{"id":18831,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=\/wp\/v2\/posts\/12070\/revisions\/18831"}],"wp:attachment":[{"href":"https:\/\/surveyinsights.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=12070"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=12070"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/surveyinsights.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=12070"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}