How to set ignorecase flag for part of regular expression in Python?

后端 未结 3 1418
臣服心动
臣服心动 2020-12-19 07:54

Is it possible to implement in Python something like this simple one:

#!/usr/bin/perl
my $a = \'Use HELLO1 code\';
if($a =~ /(?i:use)\\s+([A-Z0-9]+)\\s+(?i:c         


        
相关标签:
3条回答
  • 2020-12-19 08:23

    According to the docs, this is not possible. The (?x) syntax only allows you to modify a flag for the whole expression. Therefore, you must split this into three regexp and apply them one after the other or do the "ignore case" manually: /[uU][sS][eE]...

    0 讨论(0)
  • 2020-12-19 08:37

    Since python 3.6 you can use flag inside groups :

    (?imsx-imsx:...)

    (Zero or more letters from the set 'i', 'm', 's', 'x', optionally followed by '-' followed by one or more letters from the same set.) The letters set or removes the corresponding flags: re.I (ignore case), re.M (multi-line), re.S (dot matches all), and re.X (verbose), for the part of the expression.

    Thus (?i:use) is now a correct syntaxe. From a python3.6 terminal:

    >>> import re
    >>> regex = re.compile('(?i:use)\s+([A-Z0-9]+)\s+(?i:code)')
    >>> regex.match('Use HELLO1 code')
    <_sre.SRE_Match object; span=(0, 15), match='Use HELLO1 code'>
    >>> regex.match('use HELLO1 Code')
    <_sre.SRE_Match object; span=(0, 15), match='use HELLO1 Code'>
    
    0 讨论(0)
  • 2020-12-19 08:43

    As far as I could find, the python regular expression engine does not support partial ignore-case. Here is a solution using a case-insensitive regular expression, which then tests if the token is uppercase afterward.

    #! /usr/bin/env python
    
    import re
    
    token_re = re.compile(r'use\s+([a-z0-9]+)\s+code', re.IGNORECASE)
    def find_token(s):
        m = token_re.search(s)
        if m is not None:
            token = m.group(1)
            if token.isupper():
                return token
    
    if __name__ == '__main__':
        for s in ['Use HELLO1 code',
                  'USE hello1 CODE',
                  'this does not match',
                 ]:
            print s, '->',
            print find_token(s)
    

    Here is the program's output:

    Use HELLO1 code -> HELLO1
    USE hello1 CODE -> None
    this does not match -> None
    
    0 讨论(0)
提交回复
热议问题