Pandas read_csv - How to handle a comma inside double quotes that are themselves inside double quotes

Question

The difference is that in that question: "ABC,DEF" was breaking the code.

Here, "ABC "DE" ,F" is breaking the code.

The whole string should be parsed in as 'ABC "DE", F'. Instead the inside double quotes are leading to the below-mentioned issue.

I am working with a csv file that contains the following type of entries:

header1, header2, header3,header4

2001-01-01,123456,"abc def",V4

2001-01-02,789012,"ghi "jklm" n,op",V4

The second row of data is breaking the code, with the following error:

ParserError: Error tokenizing data. C error: Expected 4 fields in line 1234, saw 5

I have tried playing with various sep, delimiter & quoting etc. arguments but nothing seems to work.

Can someone please help with this? Thank you!

Answers (1)