Applying a transform to one output tag

Question

I think I have a function which produces two outputs (please correct me if I'm wrong):

PCollection words = ...;

final TupleTag shortWordsTag = new TupleTag(){};

PCollectionTuple results =
     words.apply(
         ParDo
         .of(new DoFn() {
             @ProcessElement
             public void processElement(ProcessContext context) {
                 String word = context.element();
                 if (word.length() < 5) {
                     context.output(shortWordsTag, word);
                 } else {
                     context.output(word);
             }

Now I'd like to call another function, but only apply it one of those outputs. Something like this:

results.apply(
    ParDo
    .of(new DoFn() {
        @ProcessElement
        public void processElement(ProcessContext context) {
            String word = context.element();
            // do stuff, but should only have words with length < 5 here
    }
)

I can see some examples that use withOutputTags but this method seems to take more than one tag (a tag, and a list of tags), and I'm not sure how to use it for my scenario.

How can I specify my results.apply to be only called for the data which is outputted to shortWordsTag tag?

Applying a transform to one output tag

Answers (1)

Related Questions