How to use hypothesis testing to compare groups

Question

I have a data frame with name, type, and Turnover per game. A sample of that df is given below.

Name    Type    Turnover per game
kevin   A       5
duke    B       10
jonas   A       12
angly   A       2
anjelo  B       10
wily    A       4
nick    A       8

What I want to do is implement a hypothesis test to check, Type A players have average less turnovers than Type B players..

What I tried :

Firstly, group by Type:

df.groupby('Type').mean()

But I don't know how to implement a hypothesis test to check the above condition.

Stef · Accepted Answer

Hypothesis testing can be done with ttest_ind:

import pandas as pd
from scipy import stats

data = {'Name': ['kevin', 'duke', 'jonas', 'angly', 'anjelo', 'wily', 'nick'],
        'Type': ['A', 'B', 'A', 'A', 'B', 'A', 'A'],
        'Turnover': [5, 10, 12, 2, 10, 4, 8]}
df = pd.DataFrame(data)

t,p = stats.ttest_ind(df.Turnover[df.Type.eq('A')], df.Turnover[df.Type.eq('B')], 
                      equal_var=False, alternative='less')

if p < 0.05:
    print('Type A players have average less turnovers than Type B players')
else:
    print('Null hypothesis (equal means) cannot be rejected.')

In your example, the null hypothesis that type A and B players have equal turnovers will be reject and the alternative hypothesis that type A players have average less turnovers than type B player will be accepted. See the section Interpretation in the above linked Wikepedia article for details.

How to use hypothesis testing to compare groups

Answers (2)

Related Questions