过滤掉Beautifulsoup提供的链接



我一直在寻找一些关于从以下结果中筛选出特定链接(每个链接按季节(的帮助。我试图尽可能多地使用这个代码,以避免在应用我不确定的新代码时出现进一步的问题:

def get_playable_podcast0(soup0):
subjects = []
for content in soup0.find_all('item'):
try:
link = content.find('enclosure')
link = link.get('url')
print("nnLink: ", link)
title = content.find('title')
title = title.get_text()
except AttributeError:
continue
item = {
'url': link,
'title': title,
'thumbnail': "(imagelinkhere)",
}
subjects.append(item)
return subjects
def compile_playable_podcast0(playable_podcast0):
items = []
for podcast in playable_podcast0:
items.append({
'label': podcast['title'],
'thumbnail': podcast['thumbnail'],
'path': podcast['url'],
'is_playable': True,
})
return items

我试图过滤掉每一季的链接,最好的方法可能是使用我一直在抓取的.mp3链接中的标签,要解析的链接和这里的例子:

<enclosure url="https://cbc.mc.tritondigital.com/..0916.mp3?ttag=season:5" length="34742214" type="audio/mpeg"/>

简言之,我如何告诉它只返回与预先指定的季节的链接?

def filter_season(season_id):
urls = []
for content in soup.find_all('item'):
try:
link = content.find('enclosure')
link = link.get('url')
if link.endswith("ttag=season:{}".format(season_id)):
urls.append(link) 
except AttributeError:
continue
return urls

最新更新