Accessing BLOBS in an Oracle10g Database created with MS Access

Question

I am writing a Java (ok, Groovy, but that doesn't matter here) migration script to copy BLOB fields from an Oracle10g database to another. The data was created by an MS Access application. The files seem to have an incorrect encoding and I am guessing that MS Access or the ODBC driver manipulate the file in some way.

Using the query SELECT * FROM NLS_DATABASE_PARAMETERS WHERE PARAMETER = 'NLS_CHARACTERSET'; I found out that the source database has the characterset WE8MSWIN1252.

The source table is defined as:

CREATE TABLE CTR_DOCUMENTS (
  CTR_ID        NUMBER(11)           NOT NULL,
  CTR_A_ID      NUMBER(11),
  CTR_FILENAME  VARCHAR2(260 Char)   NOT NULL,
  CTR_COMMENT   VARCHAR2(255 Char),
  CTR_DATE      DATE,
  CTR_DATA      BLOB
)

I access the blobs like so:

def blob = sourceDB.firstRow("SELECT CTR_DATA FROM CTR_DOCUMENTS WHERE CTR_ID = ?",
  [id]).CTR_DATA
def blobSize = blob.length()
def blobStream = blob.getBinaryStream()
byte[] byteArray = new byte[blobSize]
blobStream.read(byteArray)

I saved some of the blobs as files, and the encoding looks wierd and the files cannot be opened by their programs. The second byte is always 00:

0000000: 2500 5000 4400 4600 2d00 3100 2e00 3500  %.P.D.F.-.1...5.

I also observe the same behavior accessing the BLOBS with an SQL Client (SQL Workbench/J, SQLDeveloper, TOAD).

For me, it looks like I have to convert the file from Windows-1252 to UTF8, but that does not work. Am I missing something here?

Accessing BLOBS in an Oracle10g Database created with MS Access

Answers (1)

Related Questions