universal new line mode in csv reader make csv writer write mistake line break in file

Question

When i read csv file with universal line mode ("rU") cdv.reader it generates as new line in csv.writer. Do you know how to ignore new line in csv.writer? I had to use ("rU") in reader because my files contain new-line character.

this is the code i use

import csv

dict={}
with open('training_data.csv','rU') as f:
    reader = csv.reader(f,skipinitialspace=True)
for line in reader:
    try:
        dict[line[2]].append(line[3])
    except:
        dict[line[2]]=[line[3]]

with open('training_result.csv','w') as f:
writer = csv.writer(f, delimiter='|',dialect='excel-tab')
for key in dict:
    writer.writerow([key,','.join(dict[key])])

The input is like this

username, some of tweet that
want to be processed
by machine , label

Because that is line break and universal line mode activated, when i catch the data and want to write with csv writer it would be the same

What i want to be the output is like this

username, some of tweet that want to be processed by machine , label

Should i remove all of line breaks in csv file? But it is too large, the csv is around 150MB and contain 700 thousand row. Is there any approaches for this?

I already play with reader properties such as skipinitialspace and dialect, but still cannot handle the problem

Lalith J. · Accepted Answer

We can achieve this by replacing new lines by ", " and adding a new line for each new append. IF you do not want any new lines you can remove

dict[line[2]].append(line[3].replace("
", ", "));

Here is the code

import csv;

dict={};
with open('training_data.csv','rU') as f:
    reader = csv.reader(f,skipinitialspace=True);
    for line in reader:
        try:
            dict[line[2]].append("
"+line[3].replace("
", ", "));
        except:
            dict[line[2]]=[line[3].replace("
", ", ")];


with open('training_result.csv','w') as f:
    writer = csv.writer(f, delimiter=',',dialect='excel-tab');
    for key in dict:
        writer.writerow([key,','.join(dict[key])]);

universal new line mode in csv reader make csv writer write mistake line break in file

Answers (2)

Related Questions