parse dates with milliseconds in pandas read_csv

Question

My .csv looks like this:

     date      time  
0    20190101  181555700  
1    20190101  181545515

where the format is YYYYMMDD for date and HHMMSSMMM for time (last MMM are milliseconds). For example the first row would be 2019-01-01 18:15:55.700

Is there a way to parse this directly from pd.read_csv() without having to convert it later? Using only parse_dates does not work as it doesn't recognize the format. What I would like is to have a single column in my dataframe, with the timestamp correctly parsed like

    timestamp
0   2019-01-01 18:15:55.700

PenutChen · Accepted Answer

I think this is close to what you need:

import pandas as pd
import datetime as dt

data = pd.read_csv(
   './a.csv',
   delimiter='	',
   index_col=0,
   parse_dates=[1],
   converters={'time': lambda t: dt.datetime.strptime(t, '%H%M%S%f').time()}
)

Output:

        date             time
0 2019-01-01  18:15:55.700000
1 2019-01-01  18:15:45.515000

After some survey I found this:

data = pd.read_csv(
   './a.csv',
   delimiter='	',
   index_col=1,
   parse_dates={'datetime': [1, 2]},
   converters={'time': lambda t: dt.datetime.strptime(t, '%H%M%S%f').time()}
)

And the output is:

                 datetime
0 2019-01-01 18:15:55.700
1 2019-01-01 18:15:45.515

parse dates with milliseconds in pandas read_csv

Answers (2)

Related Questions