LibGuides: SPSS Tutorials: Independent Samples t Test

Independent Samples t Test

The Independent Samples t Test compares the means of two independent groups in order to determine whether there is statistical evidence that the associated population means are significantly different. The Independent Samples t Test is a parametric test.

This test is also known as:

Independent t Test
Independent Measures t Test
Independent Two-sample t Test
Student t Test
Two-Sample t Test
Uncorrelated Scores t Test
Unpaired t Test
Unrelated t Test

The variables used in this test are known as:

Dependent variable, or test variable
Independent variable, or grouping variable

Common Uses

The Independent Samples t Test is commonly used to test the following:

Statistical differences between the means of two groups
Statistical differences between the means of two interventions
Statistical differences between the means of two change scores

Note: The Independent Samples t Test can only compare the means for two (and only two) groups. It cannot make comparisons among more than two groups. If you wish to compare the means across more than two groups, you will likely want to run an ANOVA.

Data Requirements

Your data must meet the following requirements:

Dependent variable that is continuous (i.e., interval or ratio level)
Independent variable that is categorical (i.e., nominal or ordinal) and has exactly two categories
Cases that have nonmissing values for both the dependent and independent variables
Independent samples/groups (i.e., independence of observations)
- There is no relationship between the subjects in each sample. This means that:
  - Subjects in the first group cannot also be in the second group
  - No subject in either group can influence subjects in the other group
  - No group can influence the other group
- Violation of this assumption will yield an inaccurate p value
Random sample of data from the population
Normal distribution (approximately) of the dependent variable for each group
- Non-normal population distributions, especially those that are thick-tailed or heavily skewed, considerably reduce the power of the test
- Among moderate or large samples, a violation of normality may still yield accurate p values
Homogeneity of variances (i.e., variances approximately equal across groups)
- When this assumption is violated and the sample sizes for each group differ, the p value is not trustworthy. However, the Independent Samples t Test output also includes an approximate t statistic that is not based on assuming equal population variances. This alternative statistic, called the Welch t Test statistic¹, may be used when equal variances among populations cannot be assumed. The Welch t Test is also known an Unequal Variance t Test or Separate Variances t Test.
No outliers

Note: When one or more of the assumptions for the Independent Samples t Test are not met, you may want to run the nonparametric Mann-Whitney U Test instead.

Researchers often follow several rules of thumb:

Each group should have at least 6 subjects, ideally more. Inferences for the population will be more tenuous with too few subjects.
A balanced design (i.e., same number of subjects in each group) is ideal. Extremely unbalanced designs increase the possibility that violating any of the requirements/assumptions will threaten the validity of the Independent Samples t Test.

¹Welch, B. L. (1947). The generalization of "Student's" problem when several different population variances are involved. Biometrika, 34(1–2), 28–35.

Hypotheses

The null hypothesis (H₀) and alternative hypothesis (H₁) of the Independent Samples t Test can be expressed in two different but equivalent ways:

H₀: µ₁ = µ₂ ("the two population means are equal")
H₁: µ₁ ≠ µ₂ ("the two population means are not equal")

H₀: µ₁ - µ₂ = 0 ("the difference between the two population means is equal to 0")
H₁: µ₁ - µ₂ ≠ 0 ("the difference between the two population means is not 0")

where µ₁ and µ₂ are the population means for group 1 and group 2, respectively. Notice that the second set of hypotheses can be derived from the first set by simply subtracting µ₂ from both sides of the equation.

Levene’s Test for Equality of Variances

Recall that the Independent Samples t Test requires the assumption of homogeneity of variance -- i.e., both groups have the same variance. SPSS conveniently includes a test for the homogeneity of variance, called Levene's Test, that can be requested when you run an independent samples t test.

Prior to SPSS Statistics version 31, Levene's Test was produced automatically as part of the Independent Samples t Test output. As of SPSS Statistics version 31, Levene's Test is now optional output that must be requested.

The hypotheses for Levene’s test are:

H₀: σ₁² - σ₂² = 0 ("the population variances of group 1 and 2 are equal")
H₁: σ₁² - σ₂² ≠ 0 ("the population variances of group 1 and 2 are not equal")

This implies that if we reject the null hypothesis of Levene's Test, it suggests that the variances of the two groups are not equal; i.e., that the homogeneity of variances assumption is violated.

The output in the Independent Samples Test table includes two rows: Equal variances assumed and Equal variances not assumed. If Levene’s test indicates that the variances are equal across the two groups (i.e., p-value large), you will rely on the first row of output, Equal variances assumed, when you look at the results for the actual Independent Samples t Test (under the heading t-test for Equality of Means). If Levene’s test indicates that the variances are not equal across the two groups (i.e., p-value small), you will need to rely on the second row of output, Equal variances not assumed, when you look at the results of the Independent Samples t Test (under the heading t-test for Equality of Means).

The difference between these two rows of output lies in the way the independent samples t test statistic is calculated. When equal variances are assumed, the calculation uses pooled variances; when equal variances cannot be assumed, the calculation utilizes un-pooled variances and a correction to the degrees of freedom.

Test Statistic

The test statistic for an Independent Samples t Test is denoted t. There are actually two forms of the test statistic for this test, depending on whether or not equal variances are assumed. SPSS produces both forms of the test, so both forms of the test are described here. Note that the null and alternative hypotheses are identical for both forms of the test statistic.

Equal variances assumed

When the two independent samples are assumed to be drawn from populations with identical population variances (i.e., σ₁² = σ₂²) , the test statistic t is computed as:

$$ t = \frac{\overline{x}_{1} - \overline{x}_{2}}{s_{p}\sqrt{\frac{1}{n_{1}} + \frac{1}{n_{2}}}} $$

with

$$ s_{p} = \sqrt{\frac{(n_{1} - 1)s_{1}^{2} + (n_{2} - 1)s_{2}^{2}}{n_{1} + n_{2} - 2}} $$

Where

$\bar{x}_{1}$ = Mean of first sample
$\bar{x}_{2}$ = Mean of second sample
$n_{1}$ = Sample size (i.e., number of observations) of first sample
$n_{2}$ = Sample size (i.e., number of observations) of second sample
$s_{1}$ = Standard deviation of first sample
$s_{2}$ = Standard deviation of second sample
$s_{p}$ = Pooled standard deviation

The calculated t value is then compared to the critical t value from the t distribution table with degrees of freedom df = n₁ + n₂ - 2 and chosen confidence level. If the calculated t value is greater than the critical t value, then we reject the null hypothesis.

Note that this form of the independent samples t test statistic assumes equal variances.

Because we assume equal population variances, it is OK to "pool" the sample variances (s_p). However, if this assumption is violated, the pooled variance estimate may not be accurate, which would affect the accuracy of our test statistic (and hence, the p-value).

Equal variances not assumed

When the two independent samples are assumed to be drawn from populations with unequal variances (i.e., σ₁² ≠ σ₂²), the test statistic t is computed as:

$$ t = \frac{\overline{x}_{1} - \overline{x}_{2}}{\sqrt{\frac{s_{1}^{2}}{n_{1}} + \frac{s_{2}^{2}}{n_{2}}}} $$

where

The calculated t value is then compared to the critical t value from the t distribution table with degrees of freedom

$$ df = \frac{ \left ( \frac{s_{1}^2}{n_{1}} + \frac{s_{2}^2}{n_{2}} \right ) ^{2} }{ \frac{1}{n_{1}-1} \left ( \frac{s_{1}^2}{n_{1}} \right ) ^{2} + \frac{1}{n_{2}-1} \left ( \frac{s_{2}^2}{n_{2}} \right ) ^{2}} $$

and chosen confidence level. If the calculated t value > critical t value, then we reject the null hypothesis.

Note that this form of the independent samples t test statistic does not assume equal variances. This is why both the denominator of the test statistic and the degrees of freedom of the critical value of t are different than the equal variances form of the test statistic.

Data Set-Up

Your data should include two variables (represented in columns) that will be used in the analysis. The independent variable should be categorical and include exactly two groups. (Note that SPSS restricts categorical indicators to numeric or short string values only.) The dependent variable should be continuous (i.e., interval or ratio). SPSS can only make use of cases that have nonmissing values for the independent and the dependent variables, so if a case has a missing value for either variable, it cannot be included in the test.

The number of rows in the dataset should correspond to the number of subjects in the study. Each row of the dataset should represent a unique subject, person, or unit, and all of the measurements taken on that person or unit should appear in that row.

Run an Independent Samples t Test

To run an Independent Samples t Test in SPSS, click Analyze > Compare Means and Proportions > Independent-Samples T Test.

The Independent-Samples T Test window opens where you will specify the variables to be used in the analysis. All of the variables in your dataset appear in the list on the left side. Move variables to the right by selecting them in the list and clicking the blue arrow buttons. You can move a variable(s) to either of two areas: Grouping Variable or Test Variable(s).

A Test Variable(s): The dependent variable(s). This is the continuous variable whose means will be compared between the two groups. You may run multiple t tests simultaneously by selecting more than one test variable.

B Grouping Variable: The independent variable. The categories (or groups) of the independent variable will define which samples will be compared in the t test. The grouping variable must have at least two categories (groups); it may have more than two categories but a t test can only compare two groups, so you will need to specify which two groups to compare. You can also use a continuous variable by specifying a cut point to create two groups (i.e., values at or above the cut point and values below the cut point).

C Define Groups: Click Define Groups to define the category indicators (groups) to use in the t test. If the button is not active, make sure that you have already moved your independent variable to the right in the Grouping Variable field. You must define the categories of your grouping variable before you can run the Independent Samples t Test procedure.

You will not be able to run the Independent Samples t Test until the levels (or cut points) of the grouping variable have been defined. The OK and Paste buttons will be unclickable until the levels have been defined.

You can tell if the levels of the grouping variable have not been defined by looking at the Grouping Variable box: if a variable appears in the box but has two question marks next to it, then the levels are not defined:

Define Groups

Clicking the Define Groups button (C) opens the Define Groups window:

1 Use specified values: If your grouping variable is categorical, select Use specified values. Enter the values for the categories you wish to compare in the Group 1 and Group 2 fields. If your categories are numerically coded, you will enter the numeric codes. If your group variable is string, you will enter the exact text strings representing the two categories. If your grouping variable has more than two categories (e.g., takes on values of 1, 2, 3, 4), you can specify two of the categories to be compared (SPSS will disregard the other categories in this case).

Note that when computing the test statistic, SPSS will subtract the mean of the Group 2 from the mean of Group 1. Changing the order of the subtraction affects the sign of the results, but does not affect the magnitude of the results.

2 Cut point: If your grouping variable is numeric and continuous, you can designate a cut point for dichotomizing the variable. This will separate the cases into two categories based on the cut point. Specifically, for a given cut point x, the new categories will be:

Group 1: All cases where grouping variable > x
Group 2: All cases where grouping variable < x

Note that this implies that cases where the grouping variable is equal to the cut point itself will be included in the "greater than or equal to" category. (If you want your cut point to be included in a "less than or equal to" group, then you will need to use Recode into Different Variables or use DO IF syntax to create this grouping variable yourself.) Also note that while you can use cut points on any variable that has a numeric type, it may not make practical sense depending on the actual measurement level of the variable (e.g., nominal categorical variables coded numerically). Additionally, using a dichotomized variable created via a cut point generally reduces the power of the test compared to using a non-dichotomized variable.

D Options: The Options section is where you can set your desired confidence level for the confidence interval for the mean difference, and specify how SPSS should handle missing values. Clicking the Options button opens the Options window:

The Confidence Interval Percentage box allows you to specify the confidence level for a confidence interval. Note that this setting does NOT affect the test statistic or p-value or standard error; it only affects the computed upper and lower bounds of the confidence interval. You can enter any value between 1 and 99 in this box (although in practice, it only makes sense to enter numbers between 90 and 99).

The Missing Values section allows you to choose if cases should be excluded "analysis by analysis" (i.e. pairwise deletion) or excluded listwise. This setting is not relevant if you have only specified one dependent variable; it only matters if you are entering more than one dependent (continuous numeric) variable. In that case, excluding "analysis by analysis" will use all nonmissing values for a given variable. If you exclude "listwise", it will only use the cases with nonmissing values for all of the variables entered. Depending on the amount of missing data you have, listwise deletion could greatly reduce your sample size.

E Homogeneity of variance test: Enabling this option will add Levene's Test to the output.

F Estimate effect sizes: Enabling this option will add effect size estimates, such as Cohen's d, to the output.

When finished, click OK to run the Independent Samples t Test, or click Paste to have the syntax corresponding to your specified settings written to an open syntax window. (If you do not have a syntax window open, a new window will open for you.)

Example: Independent samples T test when variances are not equal

Problem Statement

In our sample dataset, students reported their typical time to run a mile, and whether or not they were an athlete. Suppose we want to know if the average time to run a mile is different for athletes versus non-athletes. This involves testing whether the sample means for mile time among athletes and non-athletes in your sample are statistically different (and by extension, inferring whether the means for mile times in the population are significantly different between these two groups). You can use an Independent Samples t Test to compare the mean mile time for athletes and non-athletes.

The hypotheses for this example can be expressed as:

H₀: µ_non-athlete − µ_athlete = 0 ("the difference of the means is equal to zero")
H₁: µ_non-athlete − µ_athlete ≠ 0 ("the difference of the means is not equal to zero")

where µ_athlete and µ_non-athlete are the population means for athletes and non-athletes, respectively.

In the sample data, we will use two variables: Athlete and MileMinDur. The variable Athlete has values of either “0” (non-athlete) or "1" (athlete). It will function as the independent variable in this T test. The variable MileMinDur is a numeric duration variable (h:mm:ss), and it will function as the dependent variable. In SPSS, the first few rows of data look like this:

Before the Test

Before running the Independent Samples t Test, it is a good idea to look at descriptive statistics and graphs to get an idea of what to expect. Running Compare Means (Analyze > Compare Means and Proportions > Means) to get descriptive statistics by group tells us that the standard deviation in mile time for non-athletes is about 2 minutes; for athletes, it is about 49 seconds. This corresponds to a variance of 14803 seconds for non-athletes, and a variance of 2447 seconds for athletes¹. Running the Explore procedure (Analyze > Descriptives > Explore) to obtain a comparative boxplot yields the following graph:

If the variances were indeed equal, we would expect the total length of the boxplots to be about the same for both groups. However, from this boxplot, it is clear that the spread of observations for non-athletes is much greater than the spread of observations for athletes. Already, we can estimate that the variances for these two groups are quite different. It should not come as a surprise if we run the Independent Samples t Test and see that Levene's Test is significant.

Additionally, we should also decide on a significance level (typically denoted using the Greek letter alpha, α) before we perform our hypothesis tests. The significance level is the threshold we use to decide whether a test result is significant. For this example, let's use α = 0.05.

¹When computing the variance of a duration variable (formatted as hh:mm:ss or mm:ss or mm:ss.s), SPSS converts the standard deviation value to seconds before squaring.

Running the Test

To run the Independent Samples t Test:

Click Analyze > Compare Means and Proportions > Independent-Samples T Test.
Move the variable Athlete to the Grouping Variable field, and move the variable MileMinDur to the Test Variable(s) area. Now Athlete is defined as the independent variable and MileMinDur is defined as the dependent variable.
Click Define Groups, which opens a new window. Use specified values is selected by default. Since our grouping variable is numerically coded (0 = "Non-athlete", 1 = "Athlete"), type “0” in the first text box, and “1” in the second text box. This indicates that we will compare groups 0 and 1, which correspond to non-athletes and athletes, respectively. Click Continue when finished.
If you are using SPSS Statistics version 27 or later: Click the box next to Estimate effect sizes so that it is selected.
If you are using SPSS Statistics version 31 or later: Click the box next to Homogeneity of variance test so that it is selected.
Click OK to run the Independent Samples t Test. Output for the analysis will display in the Output Viewer window.

Syntax

Most SPSS Versions

T-TEST GROUPS=Athlete(0 1)
   /MISSING=ANALYSIS
   /VARIABLES=MileMinDur
   /CRITERIA=CI(.95).

SPSS Versions 27 and Later

T-TEST GROUPS=Athlete(0 1)
  /MISSING=ANALYSIS
  /VARIABLES=MileMinDur
  /ES DISPLAY(TRUE)
  /CRITERIA=CI(.95).

SPSS Versions 31 and Later

T-TEST GROUPS=Athlete(0 1)
  /MISSING=ANALYSIS
  /VARIABLES=MileMinDur
  /ES DISPLAY(TRUE)
  /HOMOGENEITY DISPLAY(TRUE)
  /CRITERIA=CI(.95).

Output

Tables

Assuming that you enabled the Effect Size and Homogeneity of Variance Test options, the output will have four sections (boxes): Group Statistics, Independent Samples Test, Homogeneity of Variance Test, and Independent Samples Effect Sizes.

Group Statistics

The first section, Group Statistics, provides basic information about the group comparisons, including the sample size (n), mean, standard deviation, and standard error for mile times by group. In this example, there are 166 athletes and 226 non-athletes. The mean mile time for athletes is 6 minutes 51 seconds, and the mean mile time for non-athletes is 9 minutes 6 seconds.

Independent Samples Test

The second section, Independent Samples Test, displays the results most relevant to the Independent Samples t Test. Depending on your version of SPSS Statistics, you may see Levene's Test results in this table.

From left to right:

t is the computed test statistic, using the formula for the equal-variances-assumed test statistic (first row of table) or the formula for the equal-variances-not-assumed test statistic (second row of table)
df is the degrees of freedom, using the equal-variances-assumed degrees of freedom formula (first row of table) or the equal-variances-not-assumed degrees of freedom formula (second row of table)
Significance contains the one-sided and two-sided p-values corresponding to the given test statistic and degrees of freedom
Mean Difference is the difference between the sample means, i.e. x₁ − x₂; it also corresponds to the numerator of the test statistic for that test
Std. Error Difference is the standard error of the mean difference estimate; it also corresponds to the denominator of the test statistic for that test
Confidence Interval of the Difference: This part of the t-test output complements the significance test results. Typically, if the CI for the mean difference contains 0 within the interval -- i.e., if the lower boundary of the CI is a negative number and the upper boundary of the CI is a positive number -- the results are not significant at the chosen significance level

Note that the mean difference is calculated by subtracting the mean of the second group from the mean of the first group. In this example, the mean mile time for athletes was subtracted from the mean mile time for non-athletes (9:06 minus 6:51 = 02:14). The sign of the mean difference corresponds to the sign of the t value. The positive t value in this example indicates that the mean mile time of the first group, non-athletes, is greater than the mean mile time of the second group, athletes.

Which row of the table should we look at? It depends on whether we believe the variance of the dependent variable is the same for both groups. Based on Levene's test (below), we would focus on the "Equal variances not assumed" row. In that case, the two-sided p-value is less than .001, which is below our chosen significance level α = .05, so we reject the null and conclude in favor of the alternative hypothesis: that the mean mile run time of the athletes is significantly different than the mean mile run time of the non-athletes.

Homogeneity of Variance Test

The third section, Homogeneity of Variance test, contains the Levene's Test results.

From left to right:

F is the test statistic of Levene's test
Sig. is the p-value corresponding to this test statistic.

The p-value of Levene's test is printed as p < 0.001 -- i.e., p very small -- so we we reject the null of Levene's test and conclude that the variance in mile time of athletes is significantly different than that of non-athletes. This suggests that we should look at the "Equal variances not assumed" row for the t test (and corresponding confidence interval) results. (If this test result had not been significant -- that is, if we had observed p > .05 -- then we would have used the "Equal variances assumed" output.)

Independent Samples Effect Sizes

The fourth section, Independent Samples Effect Sizes, contains the calculated effect sizes.

The primary column of interest in this table is the Point Estimate column.

Effect sizes are measures of the magnitude of the difference between groups or the strength of association between groups. While p-values tell us whether a statistically significant difference between groups exists (i.e., observed differences are not likely due to chance), effect sizes tell us how small or large that difference is by estimating how many standard deviation units the group means are from each other. Effect sizes are useful as they give us a more practical understanding of the significant difference or association between groups.

As the Independent Samples Effect Sizes table shows, there are three effect size measures for independent samples t tests: Cohen’s d, Hedges’ correction (also known as Hedges’ g), and Glass’s delta.

Cohen’s d and Hedges’ Correction

Both formulas are used to measure the standardized (i.e., standard deviation units) separation of two groups. A measure of 1.0 indicates that the group means are separated by one standard deviation, 2.0 means a separation of two standard deviations, etc. Effect sizes range from:

0.20 = small effect

0.50 = medium effect

0.80+ = large effect

Cohen’s d is calculated by subtracting the mean of group 2 (athletes) from the mean of group 1 (non-athletes) and dividing the difference by the pooled standard deviation. For our mile run time example, the first number in the Point Estimate column is Cohen’s d, which is 1.377. This means that the mean mile run time of non-athletes is separated from the mean mile run time of athletes by 1.377 standard deviations. Referring to the threshold chart above, this means there is a large effect.

Hedges’ correction is calculated by multiplying Cohen’s d by a correction factor. This correction factor was created to calculate a more conservative estimate of effect size, particularly in the case of small sample sizes. Generally, Hedges’ correction will be very close to Cohen’s d, as seen in the Point Estimate column in the table above. Hedges’ correction is 1.375, meaning that the mean mile run time of non-athletes is separated from the mean mile run time of athletes by 1.375 standard deviations. This is a large effect size.

Glass’s delta

Also measures the magnitude of separation between two groups, primarily when one group is a control group and the other is a treatment group. In SPSS, what we identify as Group 1 (in this example, “Non-athlete”) is the treatment group and Group 2 (“Athlete”) is the control group. Glass’s delta is calculated by subtracting the control group mean from the treatment group mean and dividing the difference by the standard deviation of the control group. This tends to produce a different estimate than Cohen’s d and Hedges’ correction—the standard deviation of a control group will usually be different than a pooled standard deviation, meaning Glass’s delta uses a different denominator than Cohen’s d and Hedges’ correction. Directionality also matters when interpreting Glass’s delta; a negative value means the treatment group’s mean is higher than the control, while a positive value means the treatment group’s mean is lower than the control. However, a positive or negative value does not influence the size of the effect, as effect sizes are interpreted based on the absolute value.

The Point Estimate column shows us that Glass’s delta is 2.725, meaning that the mean mile run time of non-athletes is higher than the mean mile run time of athletes and the means are separated by 2.725 standard deviation units. But we must ask ourselves if, given our example, it makes sense to interpret Glass’s delta? Recall that our hypothesis stated that the means of the two groups are not equal, but we did not specify that one group was a control and that the other would be exposed to a treatment that would be expected to produce a different mean. Therefore, our hypothetical research design does not justify the interpretation of Glass’s delta.

Decision and Conclusions

Since p < .001 is less than our chosen significance level α = 0.05, we can reject the null hypothesis, and conclude that the that the mean mile time for athletes and non-athletes is significantly different.

Based on the results, we can state the following:

There was a significant difference in mean mile time between non-athletes and athletes (t_315.846 = 15.047, p < .001).
The average mile time for athletes was 2 minutes and 14 seconds lower than the average mile time for non-athletes.
Cohen's d indicates that the group means are separated by 1.377 standard deviations, meaning there is a large effect.

Library Locations at the Kent Campus

Regional Campus Libraries

SPSS Tutorials: Independent Samples t Test

Sample Data Files

Independent Samples t Test

Common Uses

Data Requirements

Hypotheses

Levene’s Test for Equality of Variances

Test Statistic

Equal variances assumed

Equal variances not assumed

Data Set-Up

Run an Independent Samples t Test

Define Groups

Example: Independent samples T test when variances are not equal

Problem Statement

Before the Test

Running the Test

Syntax

Most SPSS Versions

SPSS Versions 27 and Later

SPSS Versions 31 and Later

Output

Tables

Group Statistics

Independent Samples Test

Homogeneity of Variance Test

Independent Samples Effect Sizes

Cohen’s d and Hedges’ Correction

Glass’s delta

Decision and Conclusions

Street Address

Mailing Address

Contact Us

Quick Links

Information