How to do a partial expand in Snakemake?

后端 未结 3 1344
眼角桃花
眼角桃花 2021-01-13 13:29

I\'m trying to first generate 4 files, for the LETTERS x NUMS combinations, then summarize over the NUMS to obtain one file per element in LETTERS:

LETTERS =         


        
相关标签:
3条回答
  • 2021-01-13 13:40

    Partial expand is possible using allow_missing=True.

    For example:

    expand("text_{letter}_{num}.txt", num=NUMS, allow_missing=True)
    
    
    ["text_{letter}_1.txt", "text_{letter}_2.txt"]
    
    0 讨论(0)
  • 2021-01-13 13:42

    Update (25/11/2020): As per this answer, partial expands are now possible without multi-bracketing, thanks to the allow_missing argument of expand.


    It seems that this is not a limitation of expand, but a limitation of my familiarity with the way string-formatting works in python. I need to use double brackets for the non-expanded wildcard:

    LETTERS = ["A", "B"]
    NUMS = ["1", "2"]
    
    
    rule all:
        input:
            expand("combined_{letter}.txt", letter=LETTERS)
    
    rule generate_text:
        output:
            "text_{letter}_{num}.txt"
        shell:
            """
            echo "test" > {output}
            """
    
    rule combine text:
        input:
            expand("text_{{letter}}_{num}.txt", num=NUMS)
        output:
            "combined_{letter}.txt"
        shell:
            """
            cat {input} > {output}
            """
    

    Executing this snakefile now generates the expected following files:

    text_A_2.txt
    text_A_1.txt
    text_B_2.txt
    text_B_1.txt
    combined_A.txt
    combined_B.txt
    
    0 讨论(0)
  • 2021-01-13 13:57

    Indeed, braces need to be escaped when you want to ignore them in expand. It relies on str.format, and hence any rules from format apply to expand as well.

    0 讨论(0)
提交回复
热议问题