小贝子编程

如何有一些链接，而不是所有的链接与BeautifulSoup

本文关键字：链接 BeautifulSoup python python-3.x web-scraping beautifulsoup
更新时间 : 2023-09-22
英文 : How to have some link sand not all the links with BeautifulSoup

我想有这个网站的链接:https://www.bilansgratuits.fr/secteurs/finance-assurance,k.html

但不是所有的链接，只有:links

不幸的是，我的脚本在这里给我所有的链接。

import requests
from requests import get
from bs4 import BeautifulSoup
import pandas as pd


url = 'https://www.bilansgratuits.fr/secteurs/finance-assurance,k.html'
links = []
results = requests.get(url)
soup = BeautifulSoup(results.text, "html.parser")

links = [a['href'] for a in soup.find_all('a', href=True)]
print(links)

你知道怎么做吗?

你想要的所有链接都包含在一个类名为listeEntreprises的div中，所以你可以做

links = [a['href'] for a in soup.find("div", {"class": "listeEntreprises"}).find_all('a', href=True)]

如何有一些链接，而不是所有的链接与BeautifulSoup

相关内容

最新更新

热门标签：