python在解析html内容时使用剥离的属性时返回有关属性的错误。此HTML内容已添加到类中。这是代码段,需要提取的数据是列表的一部分。
Updated Code :
从输入导入列表中,任何可选
导入请求从BS4进口
Restapp类: def init (self,url(: self.url = url
def getAllUsers(self):
# the list is added here
data = '''<ul class="users-list clearfix">
<li>
<img src="dist/img/user1-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Alexander Pierce</a>
<span class="users-list-date">Today</span>
</li>
<li>
<img src="dist/img/user8-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Norman</a>
<span class="users-list-date">Yesterday</span>
</li>
<li>
<img src="dist/img/user7-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Jane</a>
<span class="users-list-date">12 Jan</span>
</li>
<li>
<img src="dist/img/user6-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">John</a>
<span class="users-list-date">12 Jan</span>
</li>
<li>
<img src="dist/img/user2-160x160.jpg" alt="User Image">
<a class="users-list-name" href="#">Alexander</a>
<span class="users-list-date">13 Jan</span>
</li>
<li>
<img src="dist/img/user5-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Sarah</a>
<span class="users-list-date">14 Jan</span>
</li>
<li>
<img src="dist/img/user4-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Nora</a>
<span class="users-list-date">15 Jan</span>
</li>
<li>
<img src="dist/img/user3-128x128.jpg" alt="User Image">
<a class="users-list-name" href="#">Nadia</a>
<span class="users-list-date">15 Jan</span>
</li>
</ul>'''
soup = BeautifulSoup(data, "html.parser")
user_names = soup.find('ul', class_='users-list clearfix')
split_details = list(user_names.stripped_strings)
print(split_details)
test = RESTApp("https://adminlte.io/themes/AdminLTE/pages/examples/profile.html")
test.getAllUsers()
实际结果= attributeError:'nontype'对象没有属性'剥离'
您正在用字符串字面的 '''data'''
构造美丽的小组,而不是变量data
。
import requests
from bs4 import BeautifulSoup
class RESTApp:
def __init__(self, url):
self.url = url
def getAllUsers(self):
# the list is added here
data = '''<ul class="users-list clearfix">
<li>
...............
</li>
<li>
......
</li>
<li>
.....
</li>
</ul>'''
# parsing the data to get text from list.
soup = BeautifulSoup(data, "html.parser")
user_names = soup.find('ul', class_='users-list clearfix')
split_details = list(user_names.stripped_strings)
print(split_details)
test = RESTApp("https://adminlte.io/themes/AdminLTE/pages/examples/profile.html")
test.getAllUsers()
将 soup = BeautifulSoup('''data''', "html.parser")
更改为 soup = BeautifulSoup(data, "html.parser")