How to 'stream' input from 1 command to another in the same PowerShell pipeline without caching items

Question

There is more to this question than it initially appears, but there is only so much text you can put into a title.

I have an existing powershell command that processes items from the pipeline using the begin/process and end blocks; this all works as expected. This command is an 'internal' command to be invoked from another command as opposed to being an end user function invoked interactively.

I now wish to write a second command that makes use of the first command, that also accepts input from the pipeline. The second function needs to use the same single pipleline as the first. However, the first function IS designed to be used interactively and effectively is a wrapper around the second function, which the user should not be concerned with.

Idiomatically, you would do something like this:

1..4 | first-command | second-command

but as I said before, second-command is a complicated command that would be clunky to use interactively. So instead, I intend the user to do this instead:

1..4 | first-command

Where first-command handles the interaction with second-command as an internal implementation matter, all within a SINGLE pipeline. Also, you will note that I mentioned the word 'stream' in the title, which means that first-command should not be caching pipeline items; since the pipeline could be quite large.

I know what I'm actually asking may not be possible, but PowerShell is packed with surprises, which is why I ask the question.

I have written the following Pester test cases which illustrate what I am trying to achieve.

  Context 'given: pipeline variable defined as a scalar value' {
    It 'should: invoke pipeline in a single pass' {
      function invoke-first {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )

        begin { Write-Host '>>> invoke-first [SCALAR] >>>'; }
        process { $pipelineItem | invoke-second }
        end { Write-Host '<<< invoke-first [SCALAR] <<<'; }
      }

      function invoke-second {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )

        begin { Write-Host '>>> invoke-second [SCALAR] >>>'; }
        process { Write-Host "  [+] SECOND $($pipelineItem * 2)"; }
        end { Write-Host '<<< invoke-second [SCALAR] <<<'; }
      }
      # How to correctly 'stream' the items through in the same pipeline, instead of
      # 1 at a time, without caching the items?
      #
      1..4 | invoke-first
    }
  }

This displays the following:

>>> invoke-first [SCALAR] >>>
>>> invoke-second [SCALAR] >>>
  [+] SECOND 2
<<< invoke-second [SCALAR] <<<
>>> invoke-second [SCALAR] >>>
  [+] SECOND 4
<<< invoke-second [SCALAR] <<<
>>> invoke-second [SCALAR] >>>
  [+] SECOND 6
<<< invoke-second [SCALAR] <<<
>>> invoke-second [SCALAR] >>>
  [+] SECOND 8
<<< invoke-second [SCALAR] <<<
<<< invoke-first [SCALAR] <<<

The problem this shows is that this line of code (in invoke-first):

process { $pipelineItem | invoke-second }

is creating a new separate 1 item pipeline for each item it receives. This is denoted by the fact that we see '>>> invoke-first [SCALAR] >>>' and '<<< invoke-second [SCALAR] <<<' for every pipeline item. This is not what was intended.

I also tried to change the definition of the pipelineItem to be an array; "[int[]]$pipelineItem", but this does NOT make a material desired difference.

As previously stated, in order to achieve the outcome I require, we need to invoke invoke-second on the command line (but this is what I'm trying to avoid).

  Context 'given: pipeline variable defined as a piped scalar value' {
    It 'should: invoke pipeline in a single pass' {
      function invoke-first {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )

        begin { Write-Host '>>> invoke-first [PIPED-SCALAR] >>>'; }
        process { $pipelineItem }
        end { Write-Host '<<< invoke-first [PIPED-SCALAR] <<<'; }
      }

      function invoke-second {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )

        begin { Write-Host '>>> invoke-second [PIPED-SCALAR] >>>'; }
        process { Write-Host "  [+] SECOND $($pipelineItem * 2)"; }
        end { Write-Host '<<< invoke-second [PIPED-SCALAR] <<<'; }
      }
      # Don't like this because invoke-second is a complicated internal function
      # that the user should not need to know about and would be cumbersome in
      # an interactive session.
      #
      1..4 | invoke-first | invoke-second
    }
  }

produces this as output:

>>> invoke-first [PIPED-SCALAR] >>>
>>> invoke-second [PIPED-SCALAR] >>>
  [+] SECOND 2
  [+] SECOND 4
  [+] SECOND 6
  [+] SECOND 8
<<< invoke-first [PIPED-SCALAR] <<<
<<< invoke-second [PIPED-SCALAR] <<<

... and '>>> invoke-second [PIPED-SCALAR] >>>' and '<<< invoke-first [PIPED-SCALAR] <<<' are both displayed just the once, indicating there is only 1 pipeline.

If we cache the pipeline items in the first command:

  Context 'given: cached pipeline variable defined as a scalar value' {
    It 'should: invoke pipeline in a single pass' -Tag 'Current' {
      function invoke-first {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )
        # This method cheats, because it caches the items into a collection
        #
        begin { $coll = @(); Write-Host '>>> invoke-first [CACHED-SCALAR] >>>'; }
        process { $coll += $pipelineItem }
        end { $coll | invoke-second; Write-Host '<<< invoke-first [CACHED-SCALAR] <<<'; }
      }

      function invoke-second {
        param(
          [Parameter(ValueFromPipeline = $true)]
          [int]$pipelineItem
        )

        begin { Write-Host '>>> invoke-second [CACHED-SCALAR] >>>'; }
        process { Write-Host "  [+] SECOND $($pipelineItem * 2)"; }
        end { Write-Host '<<< invoke-second [CACHED-SCALAR] <<<'; }
      }
      1..4 | invoke-first;
    }
  }

this produces this output:

>>> invoke-first [CACHED-SCALAR] >>>
>>> invoke-second [CACHED-SCALAR] >>>
  [+] SECOND 2
  [+] SECOND 4
  [+] SECOND 6
  [+] SECOND 8
<<< invoke-second [CACHED-SCALAR] <<<
<<< invoke-first [CACHED-SCALAR] <<<

So the root of my problem is the interaction of the pipeline from the first command:

process { $pipelineItem | invoke-second }

How can we pipe the items to invoke-second, without caching and without forcing the user to invoke invoke-second? Again, I realise I may be barking up the wrong tree here, but I'm hoping there is a different technique that I can use that I'm not aware of.

EDIT: Integrating the pipelines of invoke-first and invoke-second

Currently, both commands define a paramter that accepts input from the pipeline eg:

[Parameter(ParameterSetName = 'InvokeScriptBlock', Mandatory, ValueFromPipeline = $true)]
[Parameter(ParameterSetName = 'InvokeFunction', Mandatory, ValueFromPipeline = $true)]
[System.IO.FileSystemInfo]$pipelineItem,

At the moment invoke-first has it own begin/process/end blocks, but it caches items into a temporary collection in the process block and then in the end block, pipes them into invoke-second all at once. This is what I want to get rid of:

process-block(invoke-first):

$collection += $pipelineItem;

end-block(invoke-first):

$collection | Invoke-Second @parameters

Somehow, I need to feed the pipeline of invoke-second from invoke-first, via the proxy.

With the proxy command, we now have 3 sets of begin/process/end blocks which need to be integrated into a single pipeline without making any changes to the code in invoke-second, because it it used in other contexts.

How to 'stream' input from 1 command to another in the same PowerShell pipeline without caching items

Answers (1)

Generating a proxy command

Module-scoped proxies

Modifying user-facing parameter sets

Pruning proxy parameters

Adding extra parameters to the proxy

Next steps

Related Questions

How to &#39;stream&#39; input from 1 command to another in the same PowerShell pipeline without caching items

Answers (1)

Generating a proxy command

Module-scoped proxies

Modifying user-facing parameter sets

Pruning proxy parameters

Adding extra parameters to the proxy

Next steps

Related Questions

How to 'stream' input from 1 command to another in the same PowerShell pipeline without caching items