小贝子编程

使用扫描方法+正则表达式将字符串分解为单词，如果单词有""字符，请删除此字符及其后的所有内容

本文关键字：单词字符删除如果方法扫描正则表达式分解字符串 regex ruby parsing
更新时间 : 2024-03-24
英文 : Break string into words using scan method + regexp, if word has `'` character, drop this character and everything after it

sample_string = "let's could've they'll you're won't"
sample_string.scan(/w+/)

上面给我:

["let", "s", "could", "ve", "they", "ll", "you", "re", "won", "t"]

我想要的:

["let", "could", "they", "you", "won"]

一直在https://rubular.com/和尝试断言像w+(?<=')，但没有运气。

给定:

> sample_string = "let's could've they'll you're won't"

你可以做分割和映射:

> sample_string.split.map{|w| w.split(/'/)[0]}
=> ["let", "could", "they", "you", "won"]

可以使用

sample_string.scan(/(?<![w'])w+/)
sample_string.scan(/b(?<!')w+/)

参见Rubular演示。模式(它们是绝对的同义词)匹配

参见Ruby演示:

sample_string = "let's could've they'll you're won't"
p sample_string.scan(/(?<![w'])w+/)
# => ["let", "could", "they", "you", "won"]

相关内容