发表新帖

发表新帖

Exclude everything after the second occurrence of a certain string

后端未结

关注

 2  1766

清酒与你 2021-01-18 07:39

I have the following string

string <- c(\'a - b - c - d\',
            \'z - c - b\',
            \'y\',
            \'u - z\')

I would

2条回答

醉梦人生 (楼主)

2021-01-18 08:00
Note that you cannot use a negated character class to negate a sequence of characters. [^ - ]*$ matches any 0+ chars other than a space (yes, it matches -, too, because the - created a range between a space and a space) followed by the end of the string marker ($).

You may use a sub function with the following regex:
```
^(.*? - .*?) - .*
```
to replace with \1. See the regex demo.

R code:
```
> string <- c('a - b - c - d', 'z - c - b', 'y', 'u - z')
> sub("^(.*? - .*?) - .*", "\\1", string)
[1] "a - b" "z - c" "y"     "u - z"
```
Details:
- ^ - start of a string
- (.*? - .*?) - Group 1 (referred to with the \1 backreference in the replacement pattern) capturing any 0+ chars lazily up to the first space, hyphen, space and then again any 0+ chars up to the next leftmost occurrence of space, hyphen, space
- - - a space, hyphen and a space
- .* - any zero or more chars up to the end of the string.
0 讨论(0)

查看其它2个回答
发布评论:

提交评论
- 加载中...

热议问题