Methodology
This methodology provides detailed information on data collection, weighting, and imputation for the Small Business Credit Survey, and covers both employer and nonemployer data.
Data collection
The SBCS uses a convenience sample of establishments. A diverse set of partner organizations that serve the small business community contact businesses in their networks to invite participation in the survey.1 The Federal Reserve Banks also directly contact prior SBCS participants and other small businesses from a variety of email lists.2 The survey instrument is an online questionnaire that typically takes 8 to 12 minutes to complete, depending on the intensity of a firm’s search for financing. The questionnaire uses question branching and flows based on responses to survey questions. For example, financing applicants receive a different line of questioning than nonapplicants. Therefore, the number of observations for each question varies by how many firms receive and complete a particular question.
Weighting
Samples for the SBCS are not selected randomly; thus, the SBCS may be subject to biases that random sample surveys would not be subject to. For example, there are likely firms that are not on our contact lists, and this could lead to noncoverage bias. To control for some known potential biases, the sample data are weighted so that the weighted distribution of firms in the SBCS matches the distribution of the small firm population in the United States across a set of firm and owner characteristics.
Due to the structure and availability of US Census Bureau data used to compute the weights, employer firms and nonemployer firms are weighted in separate processes. Both employers and nonemployers are weighted by age, industry, geographic location (urban or rural location), gender of owner(s), and race or ethnicity of owner(s). Employer firms are additionally weighted by firm size (measured by number of employees) and census division.
We first limit the sample in each year to only employer or nonemployer firms. We then post-stratify respondents by their firm characteristics. Using a statistical technique known as “raking,” we compare the share of employer or nonemployer businesses in each category of each stratum (for example, within the industry stratum, the share of firms in the sample that are manufacturers) to the share of employer or nonemployer firms in the nation that are in that category. As a result, underrepresented firms in the convenience sample are up-weighted and overrepresented firms are down-weighted. We iterate this process several times for each stratum to derive a sample weight for each respondent. This weighting methodology was developed in collaboration with the National Opinion Research Center (NORC) at the University of Chicago. See the table below for sources of population distribution estimates used to construct the weights.
Race/ethnicity and gender imputation
Not every respondent provides complete information on the gender, race, and/or ethnicity of their business’s owner(s). We need this information to correct for differences between the sample and the population data. To avoid losing these observations, we use a series of statistical models to impute the missing data. Generally, when the models predict a given characteristic with an accuracy rate of around 80 percent, we use the predicted values from the models for the missing data. When the model’s accuracy rate is below that threshold, those data are not imputed, and the responses are dropped. After the models impute the data, we compare descriptive statistics of key survey questions with and without imputed data to ensure stability of estimates.
Comparisons across survey years
Changes to the questionnaire and weighting methodology limit over-time comparability on certain metrics. The time series data in the most recent SBCS reports supersede and are not necessarily comparable with the time series data in earlier publications. Please see the methodology sections from the annual releases of the Report on Employer Firms for a more complete explanation of the survey methodology and changes to the questionnaire by survey year.
| Employer Firms | ||
|---|---|---|
| Source | Weighting variable | Strata |
| US Census Bureau Business Dynamics Statistics (BDS) | Age | 0-2 years, 3-5 years, 6-10 years, 11-20 years, 21+ years |
| US Census Bureau Annual Business Survey (ABS) |
Race/ethnicity | Hispanic, non-Hispanic Asian, non-Hispanic Black or African American, non-Hispanic Native American, non-Hispanic white |
| Gender | Equally owned by men and women, men-owned, women-owned | |
| US Census Bureau County Business Pattern (CBP) | Industry | Business support and consumer services, finance and insurance, healthcare and education, leisure and hospitality, manufacturing, nonmanufacturing goods production and associated services, professional services and real estate, retail |
| Geography | Rural, urban | |
| Firm size | 1-4 employees, 5-9 employees, 10-19 employees, 20-49 employees, 50-499 employees | |
| Nonemployer Firms | ||
| Source | Weighting variable | Strata |
| US Census Bureau Survey of Business Owners (SBO) | Age | 0-2 years, 3-4 years, 5-12 years, 13+ years |
| US Census Bureau Nonemployer Statistics-Demographics (NES-D) | Race/ethnicity | Hispanic, non-Hispanic Asian, non-Hispanic Black or African American, non-Hispanic Native American, non-Hispanic white |
| Gender | Equally owned or men-owned, women-owned | |
| US Census Bureau Nonemployer Statistics (NES) | Industry | Business support and consumer services, finance and insurance, healthcare and education, leisure and hospitality, manufacturing, nonmanufacturing goods production and associated services, professional services and real estate, retail |
| Geography | Rural, urban | |
Endnotes
- For a full list of partner organizations, please visit our Partner Organizations page. Return to 1
- For more on email lists used in a specific survey year, see the methodology page in that year’s respective Report on Employer Firms. Return to 2