from urllib.request import urlopen
from bs4 import BeautifulSoup as soup
import pandas as pd
amazon_url = "https://www.amazon.in/s?k=earbuds"
amazon_data = urlopen(amazon_url)
print (type(amazon_url))
amazon_html = amazon_data.read()
#amazon_html
amazon_soup = soup(amazon_html,'html.parser')
page= amazon_soup.findAll('span',{'class':'s-pagination-item s-pagination-disabled'})['a']
有很多方法可以访问元素。
<span id="logo-ext" />
例如每个id:
CSS选择器->span[id="logo-ext"]
XPATH->//span[@id='logo-ext']
如果绝对没有属性可识别,则可以按路线进行识别。
例如。CSS选择器->div[class="div-including-element"] > span
你需要Wich元素吗?