Running a snakemake pipeline for multiple datasets

Question

I have a snakemake pipeline with rules that call other programs and custom R and python scripts.

I have multiple datasets on which this same pipeline needs to run. Usually I would make a separate folder for each dataset and put a config file specific to the dataset and run it individually.

As I have 20+ datasets this time, I was wondering if there is a more automated way to do this. There are mainly 4 parameters which change between the datasets: input file location, primer, quality control parameter and output dir for results. Is there a way to have a 'master' config file which would have information on these 4 parameters and a snakefile which then calls the second snakefile as many times as the number for datasets?

This whole idea seems like a for loop to me which loops through arrays of these 4 parameters but I can't figure out how to implement it in snakemake.

Any suggestions and ideas are welcome! Thanks Hena

Running a snakemake pipeline for multiple datasets

Answers (1)

Related Questions