trim lines and shrink whitespaces using regex for multi line string

前端 未结 5 1845
轮回少年
轮回少年 2021-01-22 05:53

I\'m using a php function want to create a function to trim all unnecessary white spaces from a multi line string.

The regex that it\'s not working is the one tha

相关标签:
5条回答
  • 2021-01-22 06:28
     preg_replace('/*(.*) +?\n*$/', $content)
    

    Live Demo

    0 讨论(0)
  • 2021-01-22 06:31

    Use a two step approach:

    <?php
    
    $text = " first  line... abc   
     second  is  here... def   
      <-- blank space here
     fourth  line... hi  there   
    
     sith  is  here....   ";
    
    // get rid of spaces at the beginning and end of line
    $regex = '~^\ +|\ +$~m';
    $text = preg_replace($regex, '', $text);
    
     // get rid of more than two consecutive spaces
    $regex = '~\ {2,}~';
    $text = preg_replace($regex, ' ', $text);
    echo $text;
    
    ?>
    

    See a demo on ideone.com.

    0 讨论(0)
  • 2021-01-22 06:32

    You need to /gm instead of just /m

    The code should become: (this code won't work, the update one will)

    $patterns[] = ['/ +$/mg', ''];
    

    Working example here: https://regex101.com/r/z3pDre/1

    Update:

    The g identifier, don't work like this. We need to replace preg_match with preg_match_all

    Use the regex without g, like this:

    $patterns[] = ['/ +$/m', ''];
    
    0 讨论(0)
  • 2021-01-22 06:45

    You may redefine where $ matches using the (*ANYCRLF) verb.

    See the following PHP demo:

    $s = " ffffd    \r\n  bbb     ";
    $n = preg_replace('~(*ANYCRLF)\h+$~m', '', $s); // if the string can contain Unicode chars,
    echo $n;                                        // also add "u" modifier ('~(*ANYCRLF)\h+$~um')
    

    Details:

    • (*ANYCRLF) - specifies a newline convention: (*CR), (*LF) or (*CRLF)
    • \h+ - 1+ horizontal whitespace chars
    • $ - end of line (now, before CR or LF)
    • ~m - multiline mode on ($ matches at the end of a line).

    If you want to allow $ to match at any Unicode line breaks, replace (*ANYCRLF) with (*ANY).

    See Newline conventions in the PCRE reference:

    (*CR)        carriage return
    (*LF)        linefeed
    (*CRLF)      carriage return, followed by linefeed
    (*ANYCRLF)   any of the three above
    (*ANY)       all Unicode newline sequences
    

    Now, if you need to

    • Trim the lines from both start and end
    • Shrink whitespaces inside the lines into just a single space

    use

    $s = " Ł    ę  d    \r\n  Я      ёb     ";
    $n = preg_replace('~(*ANYCRLF)^\h+|\h+$|(\h){2,}~um', '$1', $s);
    echo $n;
    

    See the PHP demo.

    0 讨论(0)
  • 2021-01-22 06:48

    preg_replace ( mixed $pattern , mixed $replacement , mixed $subject [, int $limit = -1 [, int &$count ]] )

    so you want preg_replace('/[\s]+$/m', '', $string)

    0 讨论(0)
提交回复
热议问题