SQL: OPENROWSET not returning data correctly

Question

I am trying to use OPENROWSET to query a csv file which works well 90% of the time but for some reasons some .csv files were returning this error:

Msg 4863, Level 16, State 1, Line 1
Bulk load data conversion error (truncation) for row 1, column 5 (Status Description).

or this error:

Msg 4832, Level 16, State 1, Line 1
Bulk load: An unexpected end of file was encountered in the data file.

My Query looks like this:

select * from OPENROWSET(BULK 'E:\File.csv', FORMATFILE= 'E:\schema.xml') AS a

My format file looks like this:

I found that if I copy the contents of my .csv into a brand new file and save it then run again the query will complete successfully. But this is not ideal so after tweaking the format file and running the same query I now get this as a result:

Column 1 Column2 Column3 Column4
ÿþD             
m               
m               
m               
m               
m

When my original data looks like this:

 Column 1 Column2   Column3 Column4
 Abc      elephant  Yes     Job has finished.               
 def      tiger     Yes     Job has finished.
 xyz      monkey    Yes     Job has finished.   
 ghi      dog       Yes     Job has finished.

It seems that now the query is completing but is returning garbage data.

Does anyone know how to fix this so that I can return accurate results?

Tony Hinkle · Accepted Answer

ÿþ is a byte-order mark, which tells me that it's a Unicode encoded file. Whatever is reading the file isn't smart enough to handle Unicode files, so it is unable to read it.

You'll need to modify whatever is creating the file to use ANSI, or modify what you're using to read the file to handle Unicode.

To work around the issue, you can convert the file to ANSI using the type command and redirect the output to a new file:

cmd /a /c type myfile.csv > myansifile.csv

SQL: OPENROWSET not returning data correctly

Answers (1)

Related Questions