Python-过滤BS4内容



当前代码:

import bs4
import requests
url = 'hidden'
res = requests.get(url)
soup = bs4.BeautifulSoup(res.text, 'html.parser')
bs4_content = soup.find_all(class_='user-post-count')
print(bs4_content)

我唯一能得到的内容是

[<p class="user-post-count">This user has made <strong>5 posts</strong>
</p>] 

我试图只获取强标签之间的内容。

谢谢大家的帮助,非常感谢

您可以使用内部.find_all

import bs4
import requests
url = 'hidden'
res = requests.get(url)
soup = bs4.BeautifulSoup(res.text, 'html.parser')
bs4_content = soup.find_all(class_='user-post-count')
for strong in bs4_content.find_all('strong'):
print(strong.text)

尝试使用CSS选择器.user-post-count strong,它选择user-post-count类下的<strong>标记。

from bs4 import BeautifulSoup
html = '''<p class="user-post-count">This user has made <strong>5 posts</strong>
</p>
'''
soup = BeautifulSoup(html, "html.parser")
for tag in soup.select('.user-post-count strong'):
print(tag.text)

输出:

5 posts

最新更新