小贝子编程

如何只过滤那些在DataFrame中包含所需字符串的记录，忽略其间的字符

本文关键字：记录字符字符串过滤那些包含所 DataFrame python pandas dataframe
更新时间 : 2023-09-22
英文 : How to filter only those records which contains required string in DataFrame by ignoring characters in between

我有一个巨大的excel，我已经使用熊猫导入。我想知道如何过滤有字符串'ABCD'的记录

我需要通过忽略中间的所有字符来过滤那些具有'ABCD'的记录。假设我有这样一个数据框。Col_2是包含'ABCD'

的列

col_1 col_2  col_3  col_4
1    ABCD    123    890
2    AB/CD   345    789
3    XYZ     777    980
4    AB_CD   010    345

新的数据框架应该是

col_1 col_2  col_3  col_4
1    ABCD    123    890
2    AB/CD   345    789
3    AB_CD   010    345

尝试使用loc和str.replace:

print(df.loc[df['col_2'].str.replace('[^ABCD]', '').eq('ABCD')])

输出:

col_1  col_2  col_3  col_4
0      1   ABCD    123    890
1      2  AB/CD    345    789
3      4  AB_CD     10    345

相关内容