trim lines and shrink whitespaces using regex for multi line string

前端未结

关注

 5  1845

轮回少年

I~~\'m using a php function~~ want to create a function to trim all unnecessary white spaces from a multi line string.

The regex that it\'s not working is the one tha

相关标签:

5条回答

無奈伤痛

2021-01-22 06:28
```
 preg_replace('/*(.*) +?\n*$/', $content)
```
Live Demo
0 讨论(0)
发布评论:

提交评论
- 加载中...

南笙

2021-01-22 06:31

Use a two step approach:

<?php

$text = " first  line... abc   
 second  is  here... def   
  <-- blank space here
 fourth  line... hi  there   

 sith  is  here....   ";

// get rid of spaces at the beginning and end of line
$regex = '~^\ +|\ +$~m';
$text = preg_replace($regex, '', $text);

 // get rid of more than two consecutive spaces
$regex = '~\ {2,}~';
$text = preg_replace($regex, ' ', $text);
echo $text;

?>

See a demo on ideone.com.

0 讨论(0)

粉色の甜心

2021-01-22 06:32
You need to /gm instead of just /m

The code should become: (this code won't work, the update one will)
```
$patterns[] = ['/ +$/mg', ''];
```
Working example here: https://regex101.com/r/z3pDre/1

Update:

The g identifier, don't work like this. We need to replace preg_match with preg_match_all

Use the regex without g, like this:
```
$patterns[] = ['/ +$/m', ''];
```
0 讨论(0)
发布评论:

提交评论
- 加载中...
北恋

2021-01-22 06:45
You may redefine where $ matches using the (*ANYCRLF) verb.

See the following PHP demo:
```
$s = " ffffd    \r\n  bbb     ";
$n = preg_replace('~(*ANYCRLF)\h+$~m', '', $s); // if the string can contain Unicode chars,
echo $n;                                        // also add "u" modifier ('~(*ANYCRLF)\h+$~um')
```
Details:
- (*ANYCRLF) - specifies a newline convention: (*CR), (*LF) or (*CRLF)
- \h+ - 1+ horizontal whitespace chars
- $ - end of line (now, before CR or LF)
- ~m - multiline mode on ($ matches at the end of a line).
If you want to allow $ to match at any Unicode line breaks, replace (*ANYCRLF) with (*ANY).

See Newline conventions in the PCRE reference:
```
(*CR)        carriage return
(*LF)        linefeed
(*CRLF)      carriage return, followed by linefeed
(*ANYCRLF)   any of the three above
(*ANY)       all Unicode newline sequences
```
Now, if you need to
- Trim the lines from both start and end
- Shrink whitespaces inside the lines into just a single space
use
```
$s = " Ł    ę  d    \r\n  Я      ёb     ";
$n = preg_replace('~(*ANYCRLF)^\h+|\h+$|(\h){2,}~um', '$1', $s);
echo $n;
```
See the PHP demo.
0 讨论(0)
发布评论:

提交评论
- 加载中...
予麋鹿

2021-01-22 06:48

preg_replace ( mixed $pattern , mixed $replacement , mixed $subject [, int $limit = -1 [, int &$count ]] )

so you want preg_replace('/[\s]+$/m', '', $string)

0 讨论(0)
发布评论:

提交评论
- 加载中...