Getting context in a column mask function

Question

I am trying to setup a column masking framework which allows enabling / disabling masking of column contents through a tag-based approach.

Each "relevant" column gets a tag indicating whether it's supposed to be masked or not.It also functions as a enable / disable toggle for masking. I can simply change the tag value to Y/N to indicate whether masking is enabled or not at that point in time; other factors notwithstanding.

In addition to this, I setup a masking function on the relevant columns which among, other things, whether the masking tag is ON or not.

What I am seeing is that the column masking function doesn't really know "its own context" - iow, when the function is executing, its aware of a few things, such as current_catalog() and current_schema(), but it doesn't really know the table / column combination for which the function was triggered. I need table and column names as well (so that I can extract the tag names/values for this column).

So far it seems that the databricks landscape doesn't allow to get that from "context" within function execution. I can't pass the context from the function call (from column DDL spec unless I define everything as a column in that table - not the cleanest way).

I might be over reaching the capabilities of the tool here, but my case of use is to get a generic data masking framework in place - which allows relatively simple toggle for masking of columns with sensitive information.

Thoughts?

Ideas?

Getting context in a column mask function

Answers (1)

Related Questions