import pandas as pd
name =
r"C:Userssaarif2201DesktopClassificationVidio_IndonesiaVidioDataset-Apr'22.xlsx"
df = pd.read_excel(name)
consist = ["Episode","Ep"]
def cat_marking(x):
if consist in (x):
return "Series"
else:
return ""
df['Content_Category'] = df['vod_episode_name'].apply(cat_marking)
我写的这个是为了添加一个列名称content_category,并根据在vod_episode_name列上的条件(它包含插曲或Ep作为字符串)将其中的一些行标记为系列。
我无法访问您发布的.xlsx文件,但我认为如果您想将函数应用于vod_episode_name
列,则以下代码应该适用于您。
import pandas as pd
name = r"C:Userssaarif2201DesktopClassificationVidio_IndonesiaVidioDataset-Apr'22.xlsx"
df = pd.read_excel(name)
consist = ["Episode","Ep"]
def cat_marking(x):
if any(ext in x for ext in consist):
return "Series"
else:
return ""
df['Content_Category'] = df['vod_episode_name'].apply(lambda x: cat_marking(x))