The code needed to specify sampling design parameters using SAS Survey procedures is described below. In this example, the SAS Survey procedure, proc surveymeans, is used and the name of the dataset is Demo4_nh2. Proc surveymeans is being used as a generic example, but the strata, cluster and weight statements apply to all SAS Survey procedures.
When using SAS Survey procedures, the input dataset must be identified. However, the dataset does not have to be presorted by the sample design variables as it does in SUDAAN. Rather, the design variables—strata and PSU — are specified in subsequent steps.
The stratum statement names the variables that form the strata. For the NHANES II the variable that identifies the sample strata is named N2AH0324.
The cluster statement names the variables that identify the clusters in a clustered sample design such as NHANES II. Since there is also a strata statement needed in NHANES II, clusters are nested within the strata by SAS Survey procedures.
In NHANES II the variable that represents the sample clusters is named N2AH0326 (primary sampling units or PSUs).
In NHANES II, a sample weight is assigned to each sample participant. The sample weight is a measure of the number of individuals in the target population that the sampled individual represents. Sample weights are needed to obtain unbiased estimates of population parameters when the sample participants are chosen with unequal probabilities (see module on Weighting for more details).
The weight statement in SAS Survey procedures is required for all NHANES analyses. It identifies the sample weight. In this example, examined sample weight (N2AH0282) is used.
The following table shows how to combine the statements described above to properly specify the sample design parameters and sample weights using SAS Survey procedures. The procedure, proc surveymeans, is used as an example, but the strata, cluster and weight statements can be used in the same manner for all SAS Survey procedures. The steps in this task identify the most basic statements used in SAS Survey procedures to account for the complex sample design of NHANES II. Additional procedure options can be added to these statements to customize the variance estimates, statistics and the output from your procedure to suit individual analytic needs. Please consult the SAS/STAT manual for specifications on the options for each SAS Survey procedure.
Use the SAS Survey procedure, proc surveymeans, to calculate means and standard errors, and specify the data set (Demo4_nh2).
Use the stratum statement to specify the strata (N2AH0324) — this accounts for the effect of stratification in the sample design.
Use the cluster statement to specify primary sampling unit (N2AH0326) — this accounts for the effect of clustering in the sample design.
Use the weight statement to account for the unequal probability of sampling, survey nonresponse and adjustments to population control totals. In this example, the examined sample weight is used (N2AH0282).
Reference: SAS Institute Inc., SAS/STAT User's Guide, Version 9.1; see: Survey Means Procedure