How to handle the classic empty string being converted to a NULL in OracleDB with Debezium

Question

I'm working with a Kafka Cluster that is replicating data contained into a MySQL DB to an old Oracle DB, to achieve that I've connected the Source DB to the Debezium Mysql Connector and the Sink DB to the Debezium connector for JDBC.

I'm trying to replicate every table of a DB and I'm having an issue at the Sink level.
Basically on the Source server there's a table's field with a NOT NULL constraint and whoever inserted the data on that db worked around that requirement by inserting empty "" strings.

The Debezium JDBC Sink connector manages to correctly read the schema of the source table and recreates it on the Sink DB with all of its constraints but unfortunately Oracle DB interprets and empty string as a NULL value and so the INSERT gets refused by the DB and therefore Debezium crashes (of course).

So this is the average payload that Debezium (Source) writes into the Kafka Cluster, as you can see the third field contains an empty double quoted value:

....

"payload": {
        "before": null,
        "after": {
            "field1": 852,
            "field2": 480,
            "field3": ""
        },

...

Is there an easy way to handle this problem at a general level i.e. having the solution to work for any field of every table that might contain empty double quotes without having to specify the field's name?

Because unfortunately the Single Message Transform that replaces the value of a field with another value requires the field's name and that defeats the purpose of having Debezium taking care of everything.

This is the current configuration of the Debezium JDBC Sink Connector

{
    "connector.class": "io.debezium.connector.jdbc.JdbcSinkConnector",
    "tasks.max": "1",
    "connection.url": "jdbc:oracle:thin:@ldap://ldpap.com:3060/testdb,cn=OracleContext,dc=domain,dc=priv",
    "connection.username": "username",
    "connection.password": "password",
    "insert.mode": "insert",
    "schema.evolution": "basic",
    "database.time_zone": "UTC",
    "topics.regex": "(DBNAME\.DBNAME\.([^.]+)$)",
    "quote.identifiers": "true",
}

The connector correctly binds the Kafka topics containing the data and writes a few tables of the Source DB on the Sink DB until it reaches the offending table and then it crashes. Setting "insert.mode":"upsert" with Debezium taking care of the primary keys.

How to handle the classic empty string being converted to a NULL in OracleDB with Debezium

Answers (1)

Related Questions