Regex : Split on comma , but exclude commas within parentheses and quotes(Both single & Double)

前端 未结 3 968
甜味超标
甜味超标 2021-01-15 13:55

I have one string

5,(5,5),C\'A,B\',\'A,B\',\',B\',\'A,\',\"A,B\",C\"A,B\" 

I want to split it on comma but need to exclude commas within pa

相关标签:
3条回答
  • 2021-01-15 14:13

    You can use this regex:

    String input = "5,(5,5),C'A,B','A,B',',B','A,',\"A,B\",C\"A,B\"";
    String[] toks = input.split( 
                    ",(?=(([^']*'){2})*[^']*$)(?=(([^\"]*\"){2})*[^\"]*$)(?![^()]*\\))" );
    for (String tok: toks)
        System.out.printf("<%s>%n", tok);
    

    Output:

    <5>
    <(5,5)>
    <C'A,B'>
    <'A,B'>
    <',B'>
    <'A,'>
    <"A,B">
    <C"A,B">
    

    Explanation:

    ,                         # Match literal comma
    (?=(([^']*'){2})*[^']*$)  # Lookahead to ensure comma is followed by even number of '
    (?=(([^"]*"){2})*[^"]*$)  # Lookahead to ensure comma is followed by even number of "
    (?![^()]*\\))             # Negative lookahead to ensure ) is not followed by matching
                              # all non [()] characters in between
    
    0 讨论(0)
  • 2021-01-15 14:25

    Instead of splitting the string, consider matching instead.

    String s  = "5,(5,5),C'A,B','A,B',',B','A,',\"A,B\",C\"A,B\"";
    Pattern p = Pattern.compile("(?:[^,]*(['\"])[^'\"]*\\1|\\([^)]*\\))|[^,]+");
    Matcher m = p.matcher(s);
    while (m.find()) {
      System.out.println(m.group());
    }
    

    Output

    5
    (5,5)
    C'A,B'
    'A,B'
    ',B'
    'A,'
    "A,B" 
    C"A,B"
    
    0 讨论(0)
  • 2021-01-15 14:26
    ,(?![^(]*\))(?![^"']*["'](?:[^"']*["'][^"']*["'])*[^"']*$)
    

    Try this.

    See demo.

    For java

    ,(?![^(]*\\))(?![^"']*["'](?:[^"']*["'][^"']*["'])*[^"']*$)
    
    0 讨论(0)
提交回复
热议问题