Nifi obtain filename from csv column value

Question

I am reading from a database and making a csv out of it using QueryDatabaseRecord and ConvertRecord processors.

I want one of the columns in the csv to be extracted and used to name my CSV file that will be stored locally on my system via PutFile or in S3.

CSV looks like,

cola,colb,colc,date
A1,123,vin9,2020-02-04
A2,456,vin9,2020-02-04
A3,789,vin9,2020-02-04

I want to extract just the first row's colc and date field to produce a filename called vin9-2020-02-04.csv for my output dump.

Which processor can help me achieve this? Thanks!

notNull · Accepted Answer

We can do that using ExecuteStreamCommand + UpdateAttribute Processor!

Flows:

Option1:

1.QueryDatabaseRecord

2.ConvertRecord

3.ExecuteStreamCommand

Sample Shell Script:

This script gets file content and get only second line into attr attribute to the flowfile.

$ cat second_line.sh
#!/bin/sh
cat $1 |head -2|tail -1

4.UpdateAttribute:

Add new property:

filename

${attr:replaceAll('.+?,.+?,(.*)','$1'):replace(',','-')}.csv

Output flowfile from Updateattribute processor will have the desired filename.

Option2:

Another way would be using Extract Text processor:

Add new property as:

attr

^.* ? (.*)

once we get attr value as second line data then use UpdateAttribute processor to change the filename.

Answers (2)