Python基于流体起点/终点删除字符串的部分

我正在使用pandas数据帧，如果可能的话，我想要下面的解决方案。

我有字符串"A"，我想修改它，根据识别起点和终点删除字符串"B"中显示的某些部分(数据包含许多行，删除的项目可以出现在字符串中的任何位置(

A-

Sunshine <long swing-people="11-22-33">#Drive</long>, A frog is any member of a diverse and largely   <long swing-people="11-22-33">#Thru</long> carnivorous group of short-bodied, tailless amphibians.

B-

Sunshine #Drive, A frog is any member of a diverse and largely #Thru carnivorous group of short-bodied, tailless amphibians.

这应该是一个注释，但我还不能注释。。。

起点/终点是否已经确定？

在这些例子中，似乎起点是#，终点是单个.，总是这样吗？如果是这样的话，这应该是有效的(参见pandas的文档(：

df[column_name].str.replace(r"#.*(?=.)", "", regex=True)

编辑：我(再次(混淆了str.replace和pandas.Series.str.replace。。。修复了代码片段

相关内容

最新更新

热门标签：