python regex repetition with capture question

后端 未结 2 1290
庸人自扰
庸人自扰 2021-01-23 03:34

using python3\'s regex capabilities, is it possible to capture variable numbers of capture blocks, based on the number of the repetitions found? for instance, in the following

相关标签:
2条回答
  • 2021-01-23 04:10

    This will match all the numbers before the dot:

    s = "zzz98zzz67zzz89zzz45vdvd55lplp111.mp3"
    res = re.findall("[0-9]+(?=.*\\.)", s)
    print(res)
    
    0 讨论(0)
  • 2021-01-23 04:11

    Most or all regular expression engines in common use, including in particular those based on the PCRE syntax (like Python's), label their capturing groups according to the numerical index of the opening parenthesis, as the regex is written. So no, you cannot use capturing groups alone to extract an arbitrary, variable number of subsequences from a string.

    The closest you can get (as far as I know) is to manually write out a certain number of capturing groups, something like this:

    s = ...
    res = re.match(r'\D*' + 25 * r'(\d+)\D+')
    numbers = [r for r in res.groups() if r is not None]
    

    This will get you up to 25 groups of digits. If you need more, replace 25 with some higher number.

    I wouldn't be surprised if this were less efficient than the iterative approach with findall(), although I haven't tested it.

    0 讨论(0)
提交回复
热议问题