是否有任何方法可以从API获取其值的网页废弃值



我正在做一个项目,我必须从url= "https://www.screener.in/company/ITC/consolidated/"的源代码中获取'6596626'。该值在网页上不可见,因此难以使用xpath提取。下面的代码是page源代码的一部分,其中包含我想要提取的值。

<div
data-company-id="1552"
data-warehouse-id="6596626"
data-user-is-registered="true"
data-consolidated="true"
id="company-info">
</div>

这是我尝试的代码,我希望直接从源代码中提取值,但没有结果。

from urllib import request
from bs4 import BeautifulSoup
from lxml import etree
symbol=input("Enter symbol of the companyn")
response = request.urlopen("https://www.screener.in/company/"+symbol+"/consolidated/")
page_source = response.read().decode('utf-8')
soup=BeautifulSoup(page_source,'html.parser')
id=soup.get_text('data-warehouse-id')
print(id)
from bs4 import BeautifulSoup
import requests

def main(url):
r = requests.get(url)
soup = BeautifulSoup(r.text, 'lxml')
print(soup.select_one('#company-info')['data-warehouse-id'])

main('https://www.screener.in/company/ITC/consolidated/')

输出:

6596626

最新更新