在两个强标签之间抓取文本



我正在努力刮"167〃;(右上角(来自以下网站:https://www.goodfirms.co/billing-invoicing-software/.

我可以设法得到所有的文本,但我只是想得到数字,我不确定如何隔离它。有人能帮我吗?

代码:

from bs4 import BeautifulSoup as bs
import requests
import requests_random_user_agent
s = requests.Session()
user_agent = s.headers['User-Agent']
headers = {
'accept': '*/*',
'accept-language': 'en-GB,en-US;q=0.9,en;q=0.8,es;q=0.7,ru;q=0.6',
'referer': 'https://www.google.com/',
'connection': 'Keep-alive',
'user-agent': user_agent,
}
response = requests.get('https://www.goodfirms.co/billing-invoicing-software/', headers=headers)
soup = bs(response.content, 'lxml')
test = soup.find("section", class_="section-breadcrumb blog-breadcrumb overflow").text
print(test)

输出:

Home >
Billing and Invoicing Software
167 Softwares  |  Last updated: Jul 31, 2022

期望输出:

167

编号167位于标签下,类别class="last_update位于<strong>标签内:

import requests
from bs4 import BeautifulSoup

url = "https://www.goodfirms.co/billing-invoicing-software/"
soup = BeautifulSoup(requests.get(url).content, "html.parser")
num = soup.select_one(".last_update strong")
print(num.text)

打印:

167

最新更新