如何在没有类的 span 标签中提取 href 和标签


from urllib.request import urlopen
from bs4 import BeautifulSoup as soup
import pandas as pd
amazon_url = "https://www.amazon.in/s?k=earbuds"
amazon_data = urlopen(amazon_url)
print (type(amazon_url)) 
amazon_html = amazon_data.read()
#amazon_html
amazon_soup = soup(amazon_html,'html.parser')
page= amazon_soup.findAll('span',{'class':'s-pagination-item s-pagination-disabled'})['a']

有很多方法可以访问元素。

<span id="logo-ext" />

例如每个id:

CSS选择器->span[id="logo-ext"]
XPATH->//span[@id='logo-ext']

如果绝对没有属性可识别,则可以按路线进行识别。

例如。CSS选择器->div[class="div-including-element"] > span

你需要Wich元素吗?

最新更新