我有一个pandas数据框架,其结构如下:
df =
<表类>
repl_str
normal_str
tbody><<tr>1 _labelled 1 _text 2 _labelled 2 _text 4 _labelled 4 _text 5 _labelled 5 _text 7 _labelled 7 _text 8 _labelled 8 _text 表类>
如果normal_str
列的值都是唯一的,可以创建一个字典,将normal_str
列映射到repl_str
列
A = [['1_text', '3_text', '4_text'], ['5_text'], ['6_text', '8_text']]
d = df.set_index(['normal_str'])['repl_str'].to_dict()
B = [[d.get(text, text) for text in lst] for lst in A]
print(B)
[['1_labelled', '3_text', '4_labelled'], ['5_labelled'], ['6_text', '8_labelled']]