如何匹配任何空的或非字母符号(python re)

我想写python re-expression来匹配字符串，比如"苹果"苹果&"苹果"等等。然而，我不想匹配像"；苹果"；或"；苹果"；。

如何做到这一点？如果我写r"；apple[，！.-]*"；，它还匹配"；苹果"；。

基本上我想匹配"；苹果"字符串末尾或非字母符号"；

假设我们想要匹配：

"苹果"字符串末尾或非字母符号"；

虽然非字母符号表示所有字符，但不是字母，我们应该能够匹配像apple;_-_>:Z这样的字符串，但不能匹配像appleZ;_-_>:这样的字符串。也就是说，很可能等效于以下情况：任何字符串，以apple开头，后跟除abc...xyz和ABC...XYZ之外的任何其他字符

在您的情况下，即：接受["apple", "apple!", "apple."]，拒绝["apples", "appler"]。

我们有几种可再生能源的选择来实现这一目标。根据Python文档，一个选项可能是(?!...)，称为负前瞻断言。另一个可能是[^...]，称为互补。

下面的代码为您的案例提供了一个相对简单的实现。

import re

def test_regex(list_should_match, list_should_NOT_match):
regex = "^apple(?![a-zA-Z]).*$"       # lookahead.       recommended
# regex="^apple[^a-zA-Z]*$"           # complementing.      works but less flexibility
ptn = re.compile(regex)
for str in list_should_match:
if ptn.match(str) == None:
return False
for str in list_should_NOT_match:
if ptn.match(str) != None:
return False
return True

# All cases you provided
list_should_match = ["apple", "apple!", "apple."]
list_should_NOT_match = ["apples", "appler"]
res = test_regex(list_should_match, list_should_NOT_match)
print("tests result:    %s" % ("PASS" if res else "X"))
# Some additional cases
list_should_match = ["apple;_-_>:Z", "apple,", "apple-",
"apple;", "apple[", "apple>123456789",
"apple123456789", "apple123456789ZZZ"]
list_should_NOT_match = ["appleZ;_-_>:", "appleABC", "appleZZZ;;;;"]
res = test_regex(list_should_match, list_should_NOT_match)
print("(additional)tests result:    %s" % ("PASS" if res else "X"))

相关内容

最新更新

热门标签：