我在pandas数据帧中有一列,其中包含不完整的类似JSON数组的数据。因为该列的第一行类似于[{quot;名称":"A","类型":"食物","服务":"xyz_abc"},{quotquot;姓名"B","种类":"饮料","服务":"pqr_ghi"},":"C","类型">
我想提取数据中服务后给定的值xyz_abc和pqr_ghi。如何在python中使用regex实现这一点?
import re
df['service'] = df['col'].apply(lambda x: re.findall(r'(?<=service")s*:s*"(.+?)"', x))
print(df)
col service
0 [{"name":"A","type":"Food","service":"xyz_abc"... [xyz_abc, pqr_ghi]