从rtf文件中提取超链接和文本?


  • rtf_to_text正在从文件读取文本并转义超文本。是否有任何python模块可从rtf文件读取文本和超文本?

with open(r"C:UsersDocumentsfile_name.rtf") as infile:
content = infile.read()
text = rtf_to_text(content)
print(text)

输入:-获取最新消息在ndtv.com

current_output:-获取最新消息

desired_output -获取最新消息在ndtv.com

使用pyth3库读取RTF文件。

代码:

from pyth.plugins.rtf15.reader import Rtf15Reader
from pyth.plugins.plaintext.writer import PlaintextWriter
with open(filename, "rb") as rtf_file:
doc = Rtf15Reader.read(rtf_file)
text = PlaintextWriter.write(doc).getvalue()

希望这对你有帮助!

最新更新