使用BeautifulSoup从web服务器中抓取一个变量



我只想从每隔几秒钟更新一次的web服务器中提取一个摄氏度温度变量。到目前为止,我的代码是:

from bs4 import BeautifulSoup
from urllib.request import urlopen
url = "http://192.168.251.184"
page = urlopen(url)
html = page.read().decode("utf-8")
soup = BeautifulSoup(html, "html.parser")
print(soup.get_text())

但这也打印原始html格式的数据:

我曾尝试使用find((函数只打印变量,即19.44,但没有成功。

该变量的页面来源摘录为(最后第三行的目标变量为19.44(:

<!DOCTYPE HTML><html>
<head>
<meta name="viewport" content="width=device-width, initial-scale=1">
<link rel="stylesheet" href="https://use.fontawesome.com/releases/v5.7.2/css/all.css" integrity="sha384-fnmOCqbTlWIlj8LyTjo7mOUStjsKC4pOpQbqyi7RrhN7udi9RwhKkMHpvLbHG9Sr" crossorigin="anonymous">
<style>
html {
font-family: Arial;
display: inline-block;
margin: 0px auto;
text-align: center;
}
h2 { font-size: 3.0rem; }
p { font-size: 3.0rem; }
.units { font-size: 1.2rem; }
.ds-labels{
font-size: 1.5rem;
vertical-align:middle;
padding-bottom: 15px;
}
</style>
</head>
<body>
<h2>ESP DS18B20 Server</h2>
<p>
<i class="fas fa-thermometer-half" style="color:#059e8a;"></i> 
<span class="ds-labels">Temperature Celsius</span> 
<span id="temperaturec">19.44</span>
<sup class="units">&deg;C</sup>
</p>

你能帮我刮一下摄氏度变量吗?

float(soup.find("span", {"id": "temperaturec"}).text)

是。以下是您的操作方法:

celsius = soup.find("span", {"id": "temperaturec"}).text

由于温度在第二个跨度标签中,即<span id="temperaturec">19.44</span>,我们将使用soup.find_all访问该标签,并使用.string打印其中的内容

from bs4 import BeautifulSoup as bs
html_doc = """
<!DOCTYPE HTML><html>
<head>
<meta name="viewport" content="width=device-width, initial-scale=1">
<link rel="stylesheet" href="https://use.fontawesome.com/releases/v5.7.2/css/all.css" integrity="sha384-fnmOCqbTlWIlj8LyTjo7mOUStjsKC4pOpQbqyi7RrhN7udi9RwhKkMHpvLbHG9Sr" crossorigin="anonymous">
<style>
html {
font-family: Arial;
display: inline-block;
margin: 0px auto;
text-align: center;
}
h2 { font-size: 3.0rem; }
p { font-size: 3.0rem; }
.units { font-size: 1.2rem; }
.ds-labels{
font-size: 1.5rem;
vertical-align:middle;
padding-bottom: 15px;
}
</style>
</head>
<body>
<h2>ESP DS18B20 Server</h2>
<p>
<i class="fas fa-thermometer-half" style="color:#059e8a;"></i> 
<span class="ds-labels">Temperature Celsius</span> 
<span id="temperaturec">19.44</span>
<sup class="units">&deg;C</sup>
</p>
</div>"""

soup = bs(html_doc,'html.parser')
list_of_spans = soup.find_all('span')
print(list_of_spans[1].string)

最新更新