Create random groupings from list

Question

I need to take a list of over 500 people and place them into groups of 15. The groups should be randomized so that we don't end up with groups where everyone's last name begins with "B", for example. But I also need to balance the groups of 15 for gender parity as close as possible. The list is in a 'students.csv' file with this structure:


Last, First, ID, Sport, Gender, INT
James, Frank, f99087, FOOT, m, I
Smith, Sally, f88329, SOC, f, 
Cranston, Bill, f64928, ,m,

I was looking for some kind of solution in pandas, but I have limited coding knowledge. The code I've got so far just explores the data a bit.

import pandas as pd
data = pd.read_csv('students.csv', index_col='ID')
print(data)

print(data.Gender.value_counts())

user10325516 · Accepted Answer

Approach using pandas means - groups of 15 members. The rest are in the very last group. Gender ratio is kinda the same at the accuracy as pandas randomizer allows.

import pandas as pd

df = pd.read_csv('1.csv', skipinitialspace=True) # 1.csv contains sample data from the question

# shuffle data / pandas way
df = df.sample(frac=1).reset_index(drop=True)

# group size
SIZE = 15

# create column with group number
df['group'] = df.index // SIZE

# list of groups, groups[0] is dataframe with the first group members
groups = [
    df[df['group'] == num]
    for num in range(df['group'].max() + 1)]

Save dataframe to file:

# one csv-file
df.to_csv('2.csv')

# many csv-files
for num, group_df in enumerate(groups, 1):
    group_df.to_csv('group_{}.csv'.format(num))

Create random groupings from list

Answers (2)

Related Questions