Snakemake: how to produce multiple outputs from one input

Question

I am trying to make a snakemake workflow for Earth Observation applications and I have to download data from S3. First, I have a rule that query the data I need based on parameters in a file. The output of this rule is a list containing the data I need to download.

localrules: all

rule all:
    input:
        'results/test.csv'

rule query:
    input: 'input/{file}.csv'
    output: 'results/{file}.csv'
    shell: 'python search_catalog.py {input} {output}'

Now, I need to download those data. How can I make a rule to read the list and download each data listed and have it as the output of rule? Where can I read the content of results/something.csv and declare them in DATASET?

rull download:
    input: 'results/{file}.csv'
    output: expand('data/{file}' file=DATASET)
    shell: 'aws s3 cp s3://eodata/Sentinel-2/MSI/L2A/2024/01/15/{output}'

Snakemake: how to produce multiple outputs from one input

Answers (1)

Related Questions