我有一个输入文本文件,如下所示,保存为 12.txt:
[(442, 165), (442, 184), (487, 165), (487, 184)],english
我的目标是从此文件中删除所有特殊字符并覆盖它:我正在使用下面的python脚本 :
import os
import numpy as np
import math
import cv2 as cv
#path = '/media/D/code/OCR/text-detection-ctpn/data/mlt_english+chinese/image'
gt_file = '12.txt'
with open(gt_file, 'r+') as f:
for line in f.readlines():
line = line.replace("[", "")
line = line.replace("(", "")
line = line.replace(")", "")
line = line.replace("]", "")
line = line.replace(" ", "")
f.write(line)
但是它给了我这个输出:
[(234, 162), (234, 183), (307, 162), (307, 183)],english 234,162,234,183,307,162,307,183,english
我不希望附加输出,如上所示,我希望输出覆盖 12.txt。文件即 12.txt运行 python 脚本后应如下所示:
234,162,234,183,307,162,307,183,english
我已经参考了 Python 替换和覆盖而不是附加,但我缺少一些东西
您需要打开临时文件并从文件中读取,删除旧文件并重命名为新名称
import os
import numpy as np
import math
import cv2 as cv
#path = '/media/D/code/OCR/text-detection-ctpn/data/mlt_english+chinese/image'
gt_file = '12.txt'
output = open("temp.txt","w")
with open(gt_file, 'r') as f:
for line in f:
line = line.replace("[", "")
line = line.replace("(", "")
line = line.replace(")", "")
line = line.replace("]", "")
line = line.replace(" ", "")
output.write(line)
output.close()
os.remove(gt_file) # remove old file
os.rename("temp.txt",gt_file) # rename as old file
将两个过程分开,它更干净。
#read and edit lines
to_write = [] #store edited lines
with open(gt_file, 'r') as f:
for line in f.readlines():
...
to_write.append(line)
#write back edited lines
with open(gt_file, 'w') as f:
for line in to_write:
f.write(line)
原因是,你的文件指针已经到达了末尾。给定文本文件中的一行,seek(( 和 truncate(( 函数可用于重现所需的结果。
import os
import numpy as np
import math
gt_file = '12.txt'
with open(gt_file, 'r+') as f:
for line in f.readlines():
line = line.replace("[", "")
line = line.replace("(", "")
line = line.replace(")", "")
line = line.replace("]", "")
line = line.replace(" ", "")
f.seek(0)
f.truncate()
f.write(line)
我用了这个
file = open('test_file', 'r')
data = file.read()
file.close()
data = data.replace("[", "")
data = data.replace("(", "")
data = data.replace(")", "")
data = data.replace("]", "")
data = data.replace(" ", "")
print(data, file=open('test', 'w'))
检查这是否适合您。