在1cak.com/shuffle下载图片时,请使用禁用网关



我正在尝试制作一个从这个网站下载图像的机器人

from bs4 import BeautifulSoup as bs
from requests import get
wan = get('http://1cak.com/shuffle')
soup = bs(get(wan.url).content, 'html.parser')
url = soup.img.get('src')
wan = get(url)
print(wan.content)

但它给了我这个

<html>rn<head><title>403 Forbidden</title></head>rn<body bgcolor="white">rn<center><h1>403 Forbidden</h1></center>rn<hr><center>nginx/1.12.1</center>rn</body>rn</html>rn

要获取图像,请指定RefererHTTP标头:

from bs4 import BeautifulSoup as bs
from requests import get
soup = bs(get('http://1cak.com/shuffle').content, 'html.parser')
url = soup.img.get('src')
wan = get(url, headers={'Referer': url}, verify=False)
print(wan.content)

打印警告(因为您已关闭SSL验证(和图像内容:

b'xffxd8xffxe0x00x10JFIFx00x01x01x01x00x01x00x01x00x00xffxdbx00Cx00nx07x07x08x07x06nx08x08x08x0bnnx0bx0ex18x10x0errx0ex1dx15x16x11x18#x1f%$"x1f"!&+7/&)4)!"0A149;>>>%.DIC<H7=>;xffxdbx00Cx01nx0bx0bx0erx0ex1cx10x10x1c;("(;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;;xffxc0x00x11x08x08#x01xbcx03x01"x00x02x11x01x03x11x01xffxc4x00x1fx00x00x01x05x01x01x01x01x01x01x00x00x00x00x00x00x00x00x01x02x03x04x05x06x07x08tnx0bxffxc4x00xb5x10x00x02x01x03x03x02x04x03x05x05x04x04x00x00x01}x01x02x03x00x04x11x05x12!1Ax06x13Qax07"qx142x81x91xa1x08#Bxb1xc1x15Rxd1xf0$3brx82tnx16x17x18x19x1a%&'()*456789:CDEFGHIJSTUVWXYZcdefghijstuvwxyzx83x84x85x86x87x88x89x8ax
... 

最新更新