7: No. 1, January 2010
Optimized Probability Sampling of Study Sites to Improve Generalizability in a Multisite Intervention Trial
Jennifer L. Kraschnewski, MD; Thomas C. Keyserling, MD, MPH; Shrikant I. Bangdiwala, PhD; Ziya Gizlice, PhD; Beverly A. Garcia, MPH; Larry F. Johnston, MA; Alison Gustafson, RD, MPH; Lindsay Petrovic; Russell E. Glasgow, PhD; Carmen D. Samuel-Hodge, PhD, MS, RD
Suggested citation for this article: Kraschnewski JL, Keyserling TC, Bangdiwala SI, Gizlice Z, Garcia BA, Johnston LF, et al. Optimized probability sampling of study sites to improve generalizability in a multisite intervention trial. Prev Chronic Dis 2010;7(1):A10.
jan/09_0002.htm. Accessed [date].
Studies of type 2 translation, the adaption of evidence-based interventions to real-world settings, should include representative study sites and staff to improve external validity. Sites for such studies are, however, often selected by convenience sampling, which limits generalizability. We used an optimized probability sampling protocol to select an unbiased, representative sample of study sites to prepare for a randomized trial of a weight loss intervention.
We invited North Carolina health departments within 200 miles of the research center to participate (N = 81). Of the 43 health departments that were eligible, 30 were interested in participating. To select a representative and feasible sample of 6 health departments that met inclusion criteria, we generated all combinations of 6 from the 30 health departments that were eligible and interested. From the subset of combinations that met inclusion criteria, we selected 1 at random.
Of 593,775 possible combinations of 6 counties, 15,177 (3%) met inclusion criteria. Sites in the selected subset were similar to all eligible sites in terms of health department characteristics and county demographics.
Optimized probability sampling improved generalizability by ensuring an unbiased and representative sample of study sites.
Back to top
Community-based research is vital for successful type 2 translation — adapting evidence-based interventions to real-world settings (1-4). However, study design and methods can limit the generalizability or external validity of many community-based randomized controlled trials, which often focus on the efficacy of the intervention (efficacy trials) (2,5,6). In contrast, practical clinical trials (PCTs) evaluate the applicability and generalizability of research by including
representative participants, multiple and diverse settings, and a focus on measures relevant to decision makers (eg, cost, quality of life, participant reach, setting adoption) (7). PCTs can assess efficacious interventions for common conditions, such as obesity,
because they provide information relevant to type 2 translation (5,6,8).
One essential element of a PCT is the use of diverse and representative settings and staff in the delivery of the intervention (9). Setting-level representativeness is as necessary for PCTs as patient-level representativeness, although it is ignored in most study reports (5,9). This feature of PCTs is often absent in community-based research because sites are frequently chosen by convenience sampling, on the basis of perceived site motivation or interest, proximity, or staff quality or
resources, as opposed to probability sampling (8,10). In this way, convenience sampling can jeopardize conclusions regarding intervention effectiveness (11). Additionally, as opposed to regular clinic staff with competing demands and without special training, interventionists in non-PCTs are typically paid research staff, which further limits external validity (11).
We describe the process and outcomes of selecting sites for a research study designed to evaluate the type 2 translation of an intensive behavioral weight loss intervention designed for low-income women and conducted in county health departments. To improve study generalizability and meet PCT criteria, an optimized probability sampling protocol was used to select a representative sample of study sites for this project.
Back to top
The study was divided into 2 phases: an assessment and preparation period (phase I) and a randomized controlled trial (phase II). The goals of phase I were to 1) identify, recruit, and select representative study sites; 2) evaluate stakeholder characteristics, resources, and experience relevant to weight loss interventions; 3) train staff at each of the sites to deliver the intervention; and 4) evaluate the process of preparing each of the participating sites. The primary aim of phase II, a
randomized trial conducted at 6 county health departments with approximately 40 participants per site, was to assess the effectiveness of the intervention when implemented by health department staff in a community setting. This study reports on the first goal of phase I. Before site recruitment for phase I began, this component of the study was approved by the University of North Carolina institutional review board.
Health department recruitment
The intervention in this study was designed for delivery by county health department staff, so our goal was to recruit a representative sample of health departments. North Carolina has 100 counties; most are served by county health departments (n = 79), and some are served by regional health districts (n = 21). For logistic reasons, participation was limited to counties whose health department was located within 200 miles of Chapel Hill and whose population was more than 10,000, which
yielded 81 potential study sites (12).
Our recruitment efforts began with a presentation about the study at a meeting of North Carolina Public Health Incubator Collaboratives (http://nciph.sph.unc.edu/incubator/), which was attended by most county health directors. Application packets were distributed at this meeting (n = 17) or mailed to the health directors (n = 64) and included an informational brochure about the study, a memorandum of agreement, and an application form. We also mailed an invitation to the director of nursing
at each potential site. Additionally, we circulated a program announcement through e-mail lists to health directors, nursing directors, health educators, and health departments that participate in the North Carolina Breast and Cervical Cancer Control Program and WISEWOMAN (Well-Integrated Screening and Evaluation for Women Across the Nation). Approximately 3 weeks after we distributed application packets, we contacted each health department via telephone to confirm receipt and answer
Health departments were given approximately 6 weeks to complete the application form. We asked all 81 potential sites to respond to the application, even if they decided not to apply. For departments that did not return the packet, we attempted to follow up by telephone or e-mail at least 2 more times. Of the 81 potential sites, 13 did not respond, 25 indicated that they were not eligible to apply, 13 indicated that they were eligible but not interested, and 30 completed the application form
and signed the memorandum of agreement (Table 1).
Selecting study sites
Given the small number of sites (n = 6) that would make up the sample for the randomized trial, we felt that randomly selecting sites might not yield a representative sample of those eligible and interested or a logistically feasible sample (if many were located far from Chapel Hill). To ensure a representative and feasible study sample, we used an optimized probability sampling protocol to ensure the 6 health departments would have the following characteristics:
- No more than 1 health department from the same health district (21 counties are organized into large health districts that share staff; except in the case of health districts, health departments are organized at the county level, so these terms are used interchangeably).
- No more than 1 site with a bachelor’s-level health educator (vs dietitian, registered nurse, or master’s-level health educator) serving as the interventionist (only 4 of 30 counties had a bachelor’s-level health educator, so we did not want to oversample this type of interventionist).
- At least 3 sites with at least a 30% racial/ethnic minority population (to ensure a reasonably large minority population in at least 50% of participating sites).
- Two sites from each tertile of county population (we wanted sites to be representative of small, medium, and large counties).
- No more than 1 health department located more than 150 miles from Chapel Hill (logistically, it would be difficult to conduct the study with several sites located more than 150 miles from Chapel Hill).
Generating the probability sampling protocol
Using a SAS macro program (TS 498 Generating Combinations and Permutations, http://support.sas.com/techsup/technote/ts498.html), we generated all combinations of 6 counties from the 30 that agreed to participate (13,14). We then created a data set that listed only optimal combinations by including only the combinations that met all of the criteria outlined above. We used this set of combinations as the sampling frame and randomly chose 1 combination of counties by using SAS version 9.1.3
(SAS Institute, Inc, Cary, North Carolina), after specifying an initial seed for random number generation (14). If 1 of the selected health departments did not agree to participate or was not successful in enrolling the minimum number of participants, our plan was to identify the other optimal combinations that included the 5 participating health departments and select 1 combination at random from among them.
Meeting with study sites
After the 6 study sites were selected, we scheduled an on-site meeting with the interventionist at each health department to provide an overview of the study, describe what participation would involve, and review compensation for participation. We also obtained their written consent to participate in a research study and asked that they complete 2 written surveys. The Health Department Capacity Survey is a 9-item written questionnaire administered to the health director or a designee. The
survey asked questions about the health department’s staffing and services, programs specific to adult weight management, and other resources. The Interventionist Survey asked about the interventionist’s education and work experience, adult weight management experience, and perceived training needs. After this meeting, all 6 sites agreed to participate.
Back to top
Health departments most commonly cited inadequate target population size as the reason that they were not eligible (Table 1). The most common reasons that health departments were not interested in participating were too many competing demands, self-assessed inadequate resources or capacity for program implementation, and self-assessed inadequate staffing.
From 30 eligible and interested sites, we calculated 593,775 possible combinations of samples of 6 sites (30!/[30 − 6]!/6!) (14,15). After applying the 7 criteria, 15,177 combinations were considered optimal and retained in the sampling frame, approximately 3% of the original possible combinations
(Table 2). The most limiting criterion was having no more than 1 county 150 miles away. The least limiting criterion was requiring no more than 1 county in a health district.
Differences between departments by eligibility, interest, and selection for the study were generally small
(Table 3). Not interested and not eligible sites were closer to Chapel Hill than were interested sites and nonresponders. Interested sites had larger populations on average than did the other groups. The mean percentage minority population was lower in nonresponders than in the other groups. However, the mean per capita income, percentage below poverty, and percentage enrolled in
Medicaid varied minimally across groups. Nonresponding health departments were less likely to participate in the North Carolina Breast and Cervical Cancer Control Program or WISEWOMAN. These health departments also had smaller staffs on average and the smallest average county population.
The 6 selected sites’ characteristics varied minimally from the 30 total sites that were eligible and interested (Table 3). The mean distance from Chapel Hill was shorter for selected sites than overall. The mean county population was also less, as was the mean number of health department staff. The staff positions were similar, with the exception that fewer of the selected sites had a registered dietitian. The mean percentage minority, per capita income, and percentage enrolled in
Medicaid were similar between the groups.
Most of the selected sites (n = 5) offered patient education in diabetes, hypertension, and cholesterol in a group format. Additionally, most of the selected sites (n = 5) offered some type of adult weight management program, through either individual (n = 5) or group-based counseling (n = 4). Three sites reported collaborating or partnering with another agency to provide adult weight management services. Collaborating agencies included the Expanded Food and Nutrition Education Program (n =
5), faith-based organizations (n = 4), other state or local government agencies (n = 4), businesses (n = 3), employee groups (n = 3), hospitals or medical centers (n = 1), community health centers or clinics (n = 1), and YMCA/YWCA (n = 1).
All 6 interventionists had bachelor’s degrees or higher. Half of the interventionists had substantial experience working in public health (Table 4). Similarly, the interventionists had been employed at their respective health departments for different periods: 3 were established (14-20 y), and 3 were new (1-3 y). Only 1 had received special training in adult weight management, although 4 had developed, implemented, or evaluated a weight management intervention. One-third had not been
involved in a weight management program previously. Most had worked with the target population, low-income women aged 40 to 64 years, through health screening programs, minority health activities, or women’s health promotion activities.
Interventionists were also asked to rate training topics. Topics that were rated most important included behavior change principles, weight management counseling, weight management program development, and community organization and mobilization. Least important topics included body mass index measurement and general physical activity and weight management recommendations and guidelines for adults. The most salient perceived barrier to implementing a weight management program at their
respective sites was a lack of client interest (reported by 5 interventionists).
Back to top
Using an optimized probability sampling method, we selected 6 study sites that were representative of the larger sample of 30 potential study sites. The SAS macro used to accomplish this has been described in the literature for obtaining balance in cluster randomized trials (13-15). One study that used this method was part of the Aid First Initiative in Baltimore, Maryland (14). This trial measured incidence rate of admission to treatment facilities for drug dependence after an intervention.
Using the covariate-based constrained randomization allowed the investigators to obtain balance between census tracts (the unit of randomization) in terms of factors that could affect the outcome of interest, including geographic location, the percentage of vacant housing, and percentage of men employed (14). We have extended this approach to show that it is useful in selecting a probability sample of sites for participation in a type 2 translation clinical trial.
The major strength of using this technique is to improve external validity by increasing the representativeness of study sites and interventionists. This approach is distinctively different from convenience sampling (nonrandom site selection by the investigative team), which is most commonly used in multisite trials. Selection bias at the patient level is a risk that is often minimized in randomized controlled trials, but little research addresses this bias at the site level. The method
described here addresses this bias by allowing for random selection from a set of eligible, interested sites.
An additional strength of the proposed approach is that it allows for the selection of a combination of sites that meet prespecified criteria (for example, distance from research center, percentage minority), ensuring study sites have desired characteristics and are logistically feasible. This method also allows an alternative site to be randomly selected if an initial site withdraws or is unable to enroll enough participants. Although this approach is similar to stratification, it allows
more opportunity for similar sites to be chosen together by not forcing sites into strict strata. Stratification is also more difficult to implement when several factors define strata, especially when a small number of units is selected.
A major limitation in this study, and more generally in all clinical trials that focus on type 2 translation and enroll participants at multiple sites, is the lack of willingness of eligible and representative sites to participate. In this study, 43 of 81 potential counties were eligible, and of these, 30 (70%) were willing to participate. Because 30% of eligible sites did not agree to participate, our sample may not be fully representative of all potential study sites. An additional
limitation is that only 3% of all combinations of 6 sites met our inclusion criteria. However, our approach ensures that from the identified 15,177 acceptable combinations of 6 study sites, an unbiased set was selected.
Enhanced external validity is key to type 2 translational studies and practical clinical trials (2,6,11). Translational studies should look not only at the representativeness of the participants but also at the participating settings and intervention staff. The optimized probability sampling method described here is useful in identifying an unbiased and representative sample of study sites.
Back to top
This study was supported through funding by the Centers for Disease Control and Prevention (CDC) grant no. 5R18DP001144-02. Other support was provided by the University of North Carolina Prevention Research Center (Center for Health Promotion and Disease Prevention) through funding by CDC cooperative agreement no. U48/DP000059. Dr Kraschnewski is supported by the Health Resources and Services Administration through a National Research Service Award Primary Care Research Fellowship (5T32
Back to top
Corresponding Author: Carmen D. Samuel-Hodge, PhD, MS, RD, University of North Carolina at Chapel Hill, 1700 Martin Luther King Jr Blvd, CB 7426, Chapel Hill, NC 27599. Telephone: 919-966-0360. E-mail:
Author Affiliations: Jennifer L. Kraschnewski, Thomas C. Keyserling, Shrikant I. Bangdiwala, Ziya Gizlice, Beverly A. Garcia, Larry F. Johnston, Alison Gustafson, Lindsay Petrovic, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Russell E. Glasgow, Kaiser Permanente Colorado, Denver, Colorado.
Back to top
- Glasgow RE. Translating research to practice: lessons learned, areas for improvement, and future directions. Diabetes Care 2003;26:2451-6.
- Glasgow RE, Lichtenstein E, Marcus AC.
Why don’t we see more translation of health promotion research to practice? Rethinking the efficacy-to-effectiveness transition. Am J Public Health 2003;93:1261-7.
- Sung NS, Crowley WF Jr, Genel M, Salber P, Sandy L, Sherwood LM, et al.
Central challenges facing the national clinical research enterprise. JAMA 2003;289:1278-87.
- Woolf SH. The meaning of translational research and why it matters. JAMA 2008;299:211-3.
- Glasgow RE. RE-AIMing research for application: ways to improve evidence for family medicine. J Am Board Fam Med 2006;19:11-9.
- Tunis SR, Stryer DB, Clancy CM.
Practical clinical trials: increasing the value of clinical research for decision making in clinical and health policy. JAMA 2003;290:1624-32.
- Glasgow RE, Emmons KM.
How can we increase translation of research into practice? Types of evidence needed. Annu Rev Public Health 2007;28:413-33.
- Glasgow RE. What types of evidence are most needed to advance behavioral medicine? Ann Behav Med 2008;35:19-25.
- Glasgow RE, Klesges LM, Dzewaltowski DA, Bull SS, Estabrooks P.
The future of health behavior change research: what is needed to improve translation of research into health promotion practice? Ann Behav Med 2004;27:3-12.
- Dzewaltowski DA, Estabrooks PA, Klesges LM, Bull S, Glasgow RE.
Behavior change intervention research in community settings: how generalizable are the results? Health Promot Int 2004;19:235-45.
- Glasgow RE, Magid DJ, Beck A, Ritzwoller D, Estabrooks PA.
Practical clinical trials for translating research to practice: design and measurement recommendations. Med Care 2005;43:551-7.
- 2005 Revised county estimates. North Carolina Office of State Budget and Management. http://www.osbm.state.nc.us/ncosbm/facts_and_figures/socioeconomic_data/ population_estimates/county_estimates.shtm. Accessed August 14, 2009.
- Chaudhary MA, Moulton LH.
A SAS macro for constrained randomization of group-randomized designs. Comput Methods Programs Biomed 2006;83:205-10.
- Moulton LH. Covariate-based constrained randomization of group-randomized trials. Clin Trials 2004;1:297-305.
- Raab GM, Butcher I.
Balance in cluster randomized trials. Stat Med 2001;20:351-65.
Back to top