python在解析HTML内容时使用剥离_strings时返回有关属性的错误.此HTML内容已添加到类中



python在解析html内容时使用剥离的属性时返回有关属性的错误。此HTML内容已添加到类中。这是代码段,需要提取的数据是列表的一部分。

Updated Code : 

从输入导入列表中,任何可选

导入请求从BS4进口

Restapp类: def init (self,url(: self.url = url

def getAllUsers(self):
    # the list is added here
    data = '''<ul class="users-list clearfix">
                <li>
                  <img src="dist/img/user1-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Alexander Pierce</a>
                  <span class="users-list-date">Today</span>
                </li>
                <li>
                  <img src="dist/img/user8-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Norman</a>
                  <span class="users-list-date">Yesterday</span>
                </li>
                <li>
                  <img src="dist/img/user7-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Jane</a>
                  <span class="users-list-date">12 Jan</span>
                </li>
                <li>
                  <img src="dist/img/user6-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">John</a>
                  <span class="users-list-date">12 Jan</span>
                </li>
                <li>
                  <img src="dist/img/user2-160x160.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Alexander</a>
                  <span class="users-list-date">13 Jan</span>
                </li>
                <li>
                  <img src="dist/img/user5-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Sarah</a>
                  <span class="users-list-date">14 Jan</span>
                </li>
                <li>
                  <img src="dist/img/user4-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Nora</a>
                  <span class="users-list-date">15 Jan</span>
                </li>
                <li>
                  <img src="dist/img/user3-128x128.jpg" alt="User Image">
                  <a class="users-list-name" href="#">Nadia</a>
                  <span class="users-list-date">15 Jan</span>
                </li>
              </ul>'''
    soup = BeautifulSoup(data, "html.parser")
    user_names = soup.find('ul', class_='users-list clearfix')
    split_details = list(user_names.stripped_strings)
    print(split_details)
    test = RESTApp("https://adminlte.io/themes/AdminLTE/pages/examples/profile.html")
    test.getAllUsers()

实际结果= attributeError:'nontype'对象没有属性'剥离'

您正在用字符串字面的 '''data'''构造美丽的小组,而不是变量data

import requests
from bs4 import BeautifulSoup
class RESTApp:
    def __init__(self, url):
        self.url = url
    def getAllUsers(self):
        # the list is added here
        data = '''<ul class="users-list clearfix">
                <li>
                ...............
                </li>
                <li>
                ......
                </li>
                <li>
                .....
                </li>
            </ul>'''

        # parsing the data to get text from list.
        soup = BeautifulSoup(data, "html.parser")
        user_names = soup.find('ul', class_='users-list clearfix')
        split_details = list(user_names.stripped_strings)
        print(split_details)
test = RESTApp("https://adminlte.io/themes/AdminLTE/pages/examples/profile.html")
test.getAllUsers()

soup = BeautifulSoup('''data''', "html.parser")更改为 soup = BeautifulSoup(data, "html.parser")

最新更新