我可以提取标签信息,但事实证明,尝试查找文本很棘手。
我想从这里获取地址:
<item itemprop="streetAddress">Some Road, London SW1 1AA</item>
我试过:
address = soup.find("item", "streetAddress").text
print address
但我得到:
address = soup.find("item", "streetAddress").text
AttributeError: 'NoneType' object has no attribute 'text'
对这个简单问题的任何帮助都非常感谢...我遵循了教程,它们似乎都表明这应该有效:S
我想你想使用:
address = soup.find('item', {'itemprop': 'streetAddress').text
或者你可以做:
address = soup.find('item', itemprop = 'streetAddress').text
例:
from bs4 import BeautifulSoup as BS
html = "<item itemprop='streetAddress'>Some Road, London SW1 1AA</item>"
soup = BS(html, 'html.parser')
print(soup.find('item', {'itemprop':'streetAddress'}).text)
结果:
Some Road, London SW1 1AA