python-将八进制转换为文件中的非英语文本



我正在尝试将编码在 Octal 中编码的非英语文件转换为本机格式中并将其存储在另一个文件中。文件包括:

  • i_file:带有八进制编码文本的输入原始文件
  • o_file:应包含Kannada(有关非英语语言(文本的输出目标文件。
  • octal_to_text.py:应该在目标文件中输入八进制文本并生成非英语文本的Python程序。

示例 i_file

340262270340263215-340262207+340262241340263215
340262205-340262246340263215+340262255340263215
340262205-340262252340263215+340262252340263215
340262250340263215-340262205+340262265340263215
340262205+340262246340263215
340262266340263215-340262206+340262270340263215
340262246340263215-340262205+340262252340263215
340262244340263215-340262205+340262237340263215 340262250340263215-340262205+340262265340263215
340262247340263215-340262205
340262247340263215-340262212
340262205-340262234340263215+340262206
340262263340263215-340262205
340262263340263215-340262207
340262263340263215-340262211
340262212+340262263340263215 340262247340263215-340262212
sp
340262256340263215-340262217+340262262340263215
340262254340263215-340262216+340262237340263215
340262260340263215-340262205+340262271340263215
340262252340263215+340262260340263215 340262205-340262252340263215+340262252340263215
340262265340263215-340262207+340262270340263215 340262270340263215-340262207+340262241340263215
340262217-340262225340263215+340262205
340262211+340262227340263215 340262263340263215-340262211
340262243340263215-340262205+340262227340263215
340262212-340262256340263215+340262254340263215
340262250340263215-340262216+340262263340263215
340262216+340262244340263215
340262205-340262260340263215+340262256340263215
340262260340263215+340262205
340262260340263215+340262206 340262260340263215+340262205
340262260340263215+340262207
340262260340263215+340262212 340262260340263215+340262207
340262260340263215+340262223 340262260340263215+340262207
340262255340263215-340262205+340262246340263215
340262205-340262247340263215+340262257340263215
340262211-340262237340263215+340262211
340262206+340262225340263215
340262205-340262260340263215 340262205-340262260340263215+340262256340263215
340262250340263215-340262250340263215+340262205
340262250340263215-340262250340263215+340262206 340262250340263215-340262250340263215+340262205
340262250340263215-340262250340263215+340262207 340262250340263215-340262250340263215+340262205
340262250340263215-340262250340263215+340262211 340262250340263215-340262250340263215+340262205
340262250340263215-340262250340263215+340262212 340262250340263215-340262250340263215+340262205
340262250340263215-340262250340263215+340262222 340262250340263215-340262250340263215+340262205
340262220-340262250340263215
340262206-340262252340263215+340262206 340262205-340262252340263215+340262252340263215
340262206-340262252340263215+340262223 340262205-340262252340263215+340262252340263215
340262222-340262246340263215+340262205
340262225340263215-340262222+340262237340263215
340262205-340262270340263215+340262205
340262205-340262270340263215+340262207
340262205-340262256340263215+340262270340263215 340262212-340262256340263215+340262254340263215
340262244340263215-340262205+340262260340263215
340262230340263215-340262205 340262263340263215-340262205
340262265340263215-340262206+340262227340263215 340262266340263215-340262206+340262270340263215
340262270340263215-340262205+340262270340263215 340262244340263215-340262205+340262260340263215
340262244340263215-340262211+340262265340263215
340262257340263215

我认为可以使用的代码是使用bytearray()将文本转换为字节数组,然后将其解码为utf-8并将其写入Targe文件。 octal_to_text.py

"""
Convert file contents from Octal to text
"""
with open('i_file','r') as tl, open('o_file','w+') as tk:
    for line in tl.readlines():
        line = (line.strip())
        br = bytearray(line)
        tk.write("{}n".format(br.decode('utf-8')))

在上面的代码中,生成的输出文件与输入相同。bytearray似乎什么也没做。我到底在做什么错?您能提供Python2.7解决方案吗?

注意输出文件应包含如下所示的字符

ಅ
ಆ
ಇ
ಈ
ಉ
ಊ
ಋ
ಎ
ಏ
ಐ
ಒ
ಓ
ಔ
ಕ್
ಖ್
ಗ್
ಘ್
ಚ್
ಛ್
ಜ್
ಝ್
ಟ್
ಠ್
ಡ್
ಢ್
ಣ್

八分代编码字符串的 print语句,仅在工作python解释器时,将八分代解码为非英语符号。

因此,围绕的简单工作是执行以下步骤:

  1. 复制要解码的文件的内容
  2. 打开Python解释器并将文件内容分配给多线字符串中的变量。
  3. 打印变量
  4. 复制内容到新文件

最新更新