使用python faliure在文件中搜索字符串

我使用此代码在特定文件中搜索电子邮件，并将它们写入另一个文件。我使用了"in"运算符来确保电子邮件不重复。但是此代码不会在for line in f:行之后执行。有人能指出我在这里犯的错误吗？

tempPath = input("Please Enter the Path of the Filen")
temp_file = open(tempPath, "r")
fileContent = temp_file.read()
temp_file.close()
pattern_normal = re.compile("[-a-zA-Z0-9._]+@[-a-zA-Z0-9_]+.[a-zA-Z0-9_.]+")
pattern_normal_list = pattern_normal.findall(str(fileContent))
with open('emails_file.txt', 'a+') as f:            
    for item in pattern_normal_list:            
        for line in f:
            if line in item:
                print("duplicate")
            else:
                print("%s" %item)
                f.write("%s" %item)
                f.write('n')

新解决方案：

tempPath = input("Please Enter the Path of the Filen")
temp_file = open(tempPath, "r")
fileContent = temp_file.read()
temp_file.close()
pattern_normal = re.compile("[-a-zA-Z0-9._]+@[-a-zA-Z0-9_]+.[a-zA-Z0-9_.]+")
addresses = list(set(pattern_normal.findall(str(fileContent))))
with open('new_emails.txt', 'a+') as f:
    f.write('n'.join(addresses))

我认为你的逻辑是错误的，这是有效的：

addresses = ['test@wham.com', 'heffa@wham.com']
with open('emails_file.txt', 'a+') as f:
    fdata = f.read()
    for mail in addresses:
        if not mail in fdata:
            f.write(mail + 'n')

在不深入阅读代码的情况下，它看起来像是一行一行地循环，检查你循环通过的地址是否存在于行中，如果不存在，你就把你的电子邮件附加到行中？但在100行中，99%的地址不在行中，因此您会得到不需要的添加。

我的代码片段的输出：

[Torxed@faparch ~]$ cat emails_file.txt 
test@wham.com
Torxed@whoever.com
[Torxed@faparch ~]$ python test.py 
[Torxed@faparch ~]$ cat emails_file.txt 
test@wham.com
Torxed@whoever.com
heffa@wham.com
[Torxed@faparch ~]$

for line in f:

你不应该先调用f.readlines（）吗？

lines = f.readlines()
for line in lines:

检查一下。

新解决方案：

相关内容

最新更新

热门标签：