SPSS Tutorials Creating a Codebook
A codebook summarizes key information about the variables in a research project. This tutorial shows how to create a codebook from an existing SPSS datafile.
A codebook is a document containing information about each of the variables in your dataset, such as:
- The name assigned to the variable
- What the variable represents (i.e., its label)
- How the variable was measured (e.g. nominal, ordinal, scale)
- How the variable was actually recorded in the raw data (i.e. numeric, string; how many characters wide it is; how many decimal places it has)
- For scale variables: The variable's units of measurement
- For categorical variables: If coded numerically, the numeric codes and what they represent
Codebooks can also contain documentation about when and how the data was created. A good codebook allows you to communicate your research data to others clearly and succinctly, and ensures that the data is understood and interpreted properly.
Many codebooks are created manually; however, in SPSS, it's possible to generate a codebook from an existing SPSS datafile.
Creating a Codebook from an SPSS Datafile
This codebook method prints most of the information found in the Variable View window. It gives the names, labels, measurement levels, widths, formats, and any assigned missing values labels for every variable in the dataset. It also prints a table with the assigned value labels for categorical variables.
- Open the SPSS datafile.
- Click File > Display Data File Information > Working File.
- The codebook will print to the Output Viewer window.
This codebook method includes all of the same information as the simple method, but also includes options for printing summary statistics as well. Unlike the simple method, you can choose which variables are included in the codebook, and you can choose which variable properties are included in the summary. Also unlike the simple method, the summary information for each variable will be printed in its own table.
- Open the SPSS datafile.
- Click Analyze > Reports > Codebook.
- In the Variables tab: Add the variables you want in the codebook to the Codebook Variables box. To include all variables, click inside the Variables box, press Ctrl + A, then click the arrow button.
- In the Output tab: (Optional) Choose what variable and datafile properties you want to be included in the codebook:
- Variable information: By default, includes Position, Label, Type, Format, Measurement level, Role, Value labels, Missing values, and Custom attributes.
- File information: None included by default.
- Variable display order: By default, ordered identically to how the variables are ordered in the file. Can also order alphabetically, by file, or by measurement level.
- Maximum number of categories: By default, limits to 200 categories.
- In the Statistics tab: (Optional) Choose what statistics you want in the codebook. By default, counts and percents will be printed for nominal and ordinal variables, and mean, standard deviation, and quartiles will be printed for scale variables.
- When finished, click OK.
CODEBOOK <variables-names-here> /VARINFO POSITION LABEL TYPE FORMAT MEASURE ROLE VALUELABELS MISSING ATTRIBUTES /FILEINFO NAME CASECOUNT /OPTIONS VARORDER=VARLIST SORT=ASCENDING MAXCATS=200 /STATISTICS COUNT PERCENT MEAN STDDEV QUARTILES.
Note: When listing the variable names in the syntax, the assigned measurement level must be given in brackets after each variable name: [s] for scale, [n] for nominal, [o] for ordinal.