Pandas reading csv with a datetime period

Question

I currently have a dataframe loaded from a csv as such:

   name                  week  content
0  Dan  2012-07-09/2012-07-15      4.0
1  Jim  2012-07-09/2012-07-15      1.0
2  Joe  2012-07-09/2012-07-15      3.0
3  Sam  2012-07-16/2012-07-22     18.0
4  Tom  2012-07-16/2012-07-22      7.0

The datetime period week is stored as a string. How would I go about converting this to a datetime period?

jezrael · Accepted Answer

You can convert first of datetimes by to_datetime and then use Series.dt.to_period:

df['week'] = pd.to_datetime(df['week'].str.split('/').str[0]).dt.to_period('W')

print (df)
  name                   week  content
0  Dan  2012-07-09/2012-07-15      4.0
1  Jim  2012-07-09/2012-07-15      1.0
2  Joe  2012-07-09/2012-07-15      3.0
3  Sam  2012-07-16/2012-07-22     18.0
4  Tom  2012-07-16/2012-07-22      7.0

print (df['week'])
0    2012-07-09/2012-07-15
1    2012-07-09/2012-07-15
2    2012-07-09/2012-07-15
3    2012-07-16/2012-07-22
4    2012-07-16/2012-07-22
Name: week, dtype: period[W-SUN]

If want parse values in read_csv use converters with lambda function:

import pandas as pd
from io import StringIO

temp="""name;week;content
0;Dan;2012-07-09/2012-07-15;4.0
1;Jim;2012-07-09/2012-07-15;1.0
2;Joe;2012-07-09/2012-07-15;3.0
3;Sam;2012-07-16/2012-07-22;18.0
4;Tom;2012-07-16/2012-07-22;7.0"""
#after testing replace 'pd.compat.StringIO(temp)' to 'filename.csv'

f = lambda x: pd.to_datetime(x.split('/')[0]).to_period('W')
df = pd.read_csv(StringIO(temp), sep=";", converters={'week': f})

print (df)
  name                   week  content
0  Dan  2012-07-09/2012-07-15      4.0
1  Jim  2012-07-09/2012-07-15      1.0
2  Joe  2012-07-09/2012-07-15      3.0
3  Sam  2012-07-16/2012-07-22     18.0
4  Tom  2012-07-16/2012-07-22      7.0

print (df.dtypes)
name              object
week       period[W-SUN]
content          float64
dtype: object

Pandas reading csv with a datetime period

Answers (1)

Related Questions