发表新帖

发表新帖

Capturing parts of string using regular expression in R

前端未结

关注

 3  1848

盖世英雄少女心 2021-01-21 19:01

I have these strings:

myseq <- c(\"ALM_GSK_LN_06.ID\",\"AS04_LV_06.ID.png\",\"AS04_SP_06.IP.png\")

What I want to do is to capture parts of

3条回答

清酒与你 (楼主)

2021-01-21 19:30
Your regular expression incorrectly matches the prefix because [A-Z]+ only matches letters. To fix this simply change the first group to a greedy operator such as (.+), here is another solution.
```
library(gsubfn)
myseq <- c('ALM_GSK_LN_06.ID', 'AS04_LV_06.ID.png', 'AS04_SP_06.IP.png')
strapply(myseq, '(.+)_([A-Z]+)[^.]+\\.([A-Z]+)', c, simplify = rbind)

#      [,1]      [,2] [,3]
# [1,] "ALM_GSK" "LN" "ID"
# [2,] "AS04"    "LV" "ID"
# [3,] "AS04"    "SP" "IP"
```
0 讨论(0)

查看其它3个回答
发布评论:

提交评论
- 加载中...

热议问题