r语言 - 提取 URL "/"中特定字符串之间的文本



我正在尝试收集特定字符集之前的所有内容

即,我有一个URL,例如以下

url = "https://www.somewebsiteLink.com/someDirectory/Directory/ascensor/163235494/d"
url2 = "https://www.somewebsiteLink.com/someDirectory/Directory/aire-acondicionado-calefaccion-ascensor/45837493/d

我想从链接中提取两件事:

链路1:ascensor163235494链路2:aire-acondicionado-calefaccion-ascensor45837493

因此,倒数第一个/之间的数字,以及倒数第二个/之间的文本。

/上拆分字符串并提取倒数第三个和第二个元素:

url = "https://www.somewebsiteLink.com/someDirectory/Directory/ascensor/163235494/d"
url2 = "https://www.somewebsiteLink.com/someDirectory/Directory/aire-acondicionado-calefaccion-ascensor/45837493/d"
urls = c(url, url2)
pieces = strsplit(urls, split = "/")
result = lapply(pieces, (x) x[length(x) - 2:1])
## for older R verions:
# result = lapply(pieces, function(x) x[length(x) - 2:1])
result                
# [[1]]
# [1] "ascensor"  "163235494"
# 
# [[2]]
# [1] "aire-acondicionado-calefaccion-ascensor" "45837493"        

最新更新