小贝子编程

我如何使用正则表达式在正确的位置划分汉字

本文关键字：位置划分汉字何使用正则表达式 python python-3.x
更新时间 : 2023-09-12
英文 : How could I use regular expression to split Chinese characters at right position?

我想在某些行中划分特定的汉字。实际上，它起作用了，但是拆分角色的位置是错误的。我该如何在正确的位置拆分？

我通过python 3.6

中的正则表达式划分汉字

str = '我到家'
pattern = '([u4e00-u9fa5]?(?:到))'
compiled_pattern = re.compile(pattern)
new_str = re.split(compiled_pattern, str)
print(new_str)

我的代码输出是

['', '我到', '家']

但是我想要的是

['我', '到', '家']

我该如何修复并在正确位置吐出角色？

只喜欢这个

list(str)

我尝试过，输出与您想要的

相同

['我', '到', '家']

尝试以下：

str = '我到家'
pattern = '(我)(到)(家)'
compiled_pattern = re.compile(pattern)
new_str = list(re.findall(compiled_pattern,str)[0])
print(new_str)

相关内容