user10724884
user10724884

Reputation:

apache nifi - split line of json by field value

In Apache Nifi, i want to split a line of a json file based on the content of a field delemited by comma.

This is an example of my input flowfile :

{
    "name":"app",
    "os":"linux",
    "instance":"instance1,instance2,instance3,instance4"
}

And this is want as output :

{
    "name":"app",
    "os":"linux",
    "instance":"instance1"
},
{
    "name":"app",
    "os":"linux",
    "instance":"instance2"
},
{
    "name":"app",
    "os":"linux",
    "instance":"instance3"
},
{
    "name":"app",
    "os":"linux",
    "instance":"instance4"
}

I need to know if it's possible to realise this task with the joltTransformJson processor or if i must do that with a script, in this case can you please show some similar examples scripts.

Thanks

Upvotes: 0

Views: 597

Answers (1)

daggett
daggett

Reputation: 28634

don't know about jolt.

with ExecuteGroovyProcessor you can do this transformation in a following way:

import groovy.json.*

def ff=session.get()
if(!ff)return
//read stream, convert to reader, parse to objects
def json=ff.read().withReader("UTF-8"){r-> new JsonSlurper().parse(r) }
//transform json
json = json.instance.split(',').collect{e-> json+[instance:e] }
//write
ff.write("UTF-8"){w-> new JsonBuilder(json).writeTo(w)}
//transfer to success
REL_SUCCESS<<ff

The same but for ExecuteScript processor:

import groovy.json.*

def ff=session.get()
if(!ff)return

ff = session.write(ff, {inputStream, outputStream ->
    def json=inputStream.withReader("UTF-8"){r-> new JsonSlurper().parse(r) }
    json = json.instance.split(',').collect{e-> json+[instance:e] }
    outputStream.withWriter("UTF-8"){w-> new JsonBuilder(json).writeTo(w)}
} as StreamCallback)

session.transfer(ff, REL_SUCCESS)

Upvotes: 2

Related Questions