我有一列值为:
伪数据:
df["temp_design"] = ['Are Premium Design tree (LKL#)',
'THE Premium Design tree (TKL+)',
'THZPremium Design tree (TKL+)',
'THG THEM Entry tree temporary align (MKP#)', nan,
'THZPremium Design tree (CHU#)',
'THZPremium Design tree (ZHU2+)',
'TRUE PREMIUM TEMPORARY DESIGN (ZHU+)',
'BASIC TEMPORARY DESIGN (ZHU+)']
我想创建一个新列,该列的值位于最后一个括号内。
有人能帮我把这根绳子脱光吗?
df["output_col"] =["LKL#","TKL+","TKL+","MKP#","CHU#","ZHU2+","ZHU+","ZHU+"]
将extract与匹配左括号和右括号的regex一起使用,并在单位中捕获内容
df['temp_design'].str.extract(r'((.*?))$')
0
0 LKL#
1 TKL+
2 TKL+
3 MKP#
4 NaN
5 CHU#
6 ZHU2+
7 ZHU+
8 ZHU+