How do I find the character encoding for a file?

Question

I have an XML that does not include the encoding (charset / Character encoding / character set / character map / codeset / code page). This is an example for one that does:

The XML is being generated by a Perl script and the following is an excerpt:

$fileName = $exportDirectory . $fileName;
open FILE, ">$fileName" or die;

The questions:

In this case, is there an easy way to find the encoding for the generated XML?
The script querying other sources of information (like Oracle database) and appends the data to the XML file. Is the charset encoding dictated by the source of information? Or by the open file operation?
In general, is there an easy way to find the encoding of arbitrary file?

I tried to use LibXML:

perl -MXML::LibXML -e 'XML::LibXML->load_xml(location => "2.xml")' 2.xml:1364531: parser error : Input is not proper UTF-8, indicate encoding ! Bytes: 0xBF 0x30 0x39 0x20 female presented in spring �09 due t ^

I hope I supplied sufficient information. Please let me know if further information is needed.

How do I find the character encoding for a file?

Answers (1)

Related Questions