各位先進,
我想從字串中提取出數字的部分。我的字串有規律性,結構上,開頭是2到3位的數字,空白,接著不等位數的字母,或字母和數字。我想提取的是開頭的部分。我試了兩個方法,方法1只拿出數字的最末位,方法2拿出完整的數字。我不知道方法1的regex寫法有什麼錯。
以下是我的字串及code
# Strings to extract
strings <- c('130 UDINE', '162 BF02', '163 AS04', '164 AL08', '165 BR12', '166 S
A13', '167 MA14', '167 MA14', '168 OC15', '85 BERGAMO')
# Method 1 to extract the beginning part of the strings (not working)
gsub(pattern = '^(\\d){2,3}(\\s).*', replacement = '\\1', x=strings)
# [1] '0' '2' '3' '4' '5' '6' '7' '7' '8' '5'
# Method 2 to extract the beginning part of the strings (not working)
gsub(pattern = '^(\\d+)(\\s).*', replacement = '\\1', x=strings)
# [1] '130' '162' '163' '164' '165' '166' '167' '167' '168' '85'
謝謝
--