Python:无法替换文本文件中的特殊/外来字符



我有一个名为g.txt的文本文件。它包含大量德语单词。我希望替换äõÖüÜß字符。我从ü开始,想用html Unicode调用&252;然而,它没有抛出错误,但不起作用。我试着替换普通字母,它可以用下面的代码,但不能用德语元音变音符。

reading_file = open("g.txt", "r")
new_file_content = ""
for line in reading_file:
stripped_line = line.strip()
new_line = stripped_line.replace("ü", "&#252")
new_file_content += new_line +"n"
reading_file.close()
writing_file = open("g.txt", "w")
writing_file.write(new_file_content)
writing_file.close()

需要任何帮助!

您应该使用编码utf8。试试这个代码

reading_file = open("abc.data", "r", encoding="utf8")
new_file_content = ""
for line in reading_file.readlines():
stripped_line = line.strip()
new_line = stripped_line.replace("ü", "&#252")
new_file_content += new_line +"n"
reading_file.close()
writing_file = open("abc.data2", "w", encoding="utf8")
writing_file.write(new_file_content)
writing_file.close()

您可能需要使用要替换的字符的unicode转义版本。

new_line = stripped_line.replace("u0252", "&#252")

最新更新