Mistaken UTF8 Conversion of PDF File (c#)

Question

A developer was tasked with pushing PDF files to an FTP site. By accident, each PDF was read as a string, encoded to a UTF-8 byte array, and then pushed to the FTP. Obviously, this caused problems, since PDF files are NOT TEXT.

Below is the code that was executed:

//method passed in a filepath to use for the upload
var filePath = @"C:	emp\myFile.pdf";
byte[] pdfBytes;
using (var sr = new StreamReader(filePath))
{
    pdfBytes = Encoding.UTF8.GetBytes(sr.ReadToEnd());
}
//byte array was then uploaded

My question: Is there any way to REVERSE this type of corruption on a per file basis? Can you take the corrupt PDF, read its bytes, and somehow turn it back into a "PDF string"? (I know PDFs are not strings. Just trying to see if its possible to reverse the corruption)

NOTE: We've already fixed the code, and are getting the bytes as below. Just wanting to know if there is a way to UNDO what was done.

var pdfBytes = File.ReadAllBytes(filePath);

Mistaken UTF8 Conversion of PDF File (c#)

Answers (1)

Related Questions