小贝子编程

如何在Python中使用Beautifulsoup获取嵌套标签的文本

本文关键字：嵌套获取标签文本 Beautifulsoup Python python html web-scraping
更新时间 : 2023-09-21
英文 : How to get the text of nested tag with Beautifulsoup in Python?

运行此代码后

section = soup.find_all('section', class_='b-branches')

我得到

<div class="b-branches__item"><i class="icon fa"><b>Firm</b> </i>RJT Roadlines</div>

现在，我只想提取RJIT路线，而不是。。。公司

所以我尝试了

for i in section: firm = i.find('div', class_='b-branches__item') print(firm)

它将返回公司和RJIT路线

那么，如何只提取div标记的文本呢？？

您可以使用tag.contents[1]来获得预期的输出。

示例：

from bs4 import BeautifulSoup
html = """
<div class="b-branches__item"><i class="icon fa"><b>Firm</b> </i>RJT Roadlines</div>
"""
soup=BeautifulSoup(html,'html.parser')
tag = soup.find('div', class_='b-branches__item')
print(tag.contents[1])

输出：

RJT Roadlines

如何在Python中使用Beautifulsoup获取嵌套标签的文本

相关内容

最新更新

热门标签：