Python 3.4, error when creating a DataFrame with panda

Question

I'm attempting to create a DataFrame with the following:

from pandas import DataFrame, read_csv

import matplotlib.pyplot as plt
import pandas as pd
import sys

# The inital set of baby names and birth rates
names =['Bob','Jessica','Mary','John','Mel']
births = [968, 155, 77, 578, 973]

#Now we wil zip them together
BabyDataSet = zip(names,births)
    ##we have to add the 'list' for version 3.x
print (list(BabyDataSet))

#create the DataFrame
df = DataFrame(BabyDataSet, columns = ['Names', 'Births'] )
print (df)

when I run the program I get the following error: 'data type can't be an iterator' I read the following, 'What does the "yield" keyword do in Python?', but I do not understand how that applies to what I'm doing. Any help and further understanding would be greatly appreciated.

chrisb · Accepted Answer

In python 3, zip returns an iterator, not a list like it does in python 2. Just convert it to a list as you construct the DataFrame, like this.

df = DataFrame(list(BabyDataSet), columns = ['Names', 'Births'] )

Python 3.4, error when creating a DataFrame with panda

Answers (2)

Related Questions