.csv的URL何时打开或下载文件



我正在学习一些Python Panda,课程使用https://gist.githubusercontent.com/sh7ata/e075ff35b51ebb0d2d577fbe1d19ebc9/raw/b966d02c7c26bcca60703acb1390e938a65a35cb/drinks.csv

单击此链接可在浏览器中打开实际.csv文件内容,我可以立即将数据读取到panda中。

然而,这不适用于https://www.spss-tutorials.com/downloads/browsers.csv.如果我单击此链接,Google Chrome将下载该文件,而不是显示其内容。

为什么会这样?我该怎么办?我的意思是,它们都是.csv文件的URL,对吧?

您可以使用带有自定义HTTP头的requests模块来下载它。例如:

import requests
import pandas as pd
from io import StringIO

url = "https://www.spss-tutorials.com/downloads/browsers.csv"
headers = {
"User-Agent": "Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:99.0) Gecko/20100101 Firefox/99.0"
}
req = requests.get(url, headers=headers)
df = pd.read_csv(StringIO(req.text))
print(df.to_markdown())

打印:

ages_session>0:02:140>td style="ext-align:left;">74.62%<1.93>>>td style="text-align:left;>>td style="text-align:left;>td style="text-align:left;">441>td style="text-align:left;">286.00%>td style="text-align:left;td style="text align=left;">0.00%>td style="ext-align:left;">84.62%>td style="text-align:left;">885.00%<1td>00:01:18td style="text align=left;">0.00%>td style="text-align:left;>td style="text-align:left;">70>td style="text-align:left;">507.14%<1.06>>td style="ext-align:left;">60.53%3
screen_resolutionavg_session_durationgoal_conversion_rategoal_completions目标值
01366x76822840.00%0"0.00
111920x10800.00%
¥0.00
280x102412170.00%0¥0.00
31680x1050105268.16%71700:01:460.00%0¥0.00
41440x9000.00%0¥0.00
51280x8000.00%0¥0.00
61600x9000.00%0¥0.00
71920x120064.85%0¥0.00
81024x7680¥0.00
92560x14400.00%0¥0.00
101280x7200.00%0¥0.00
111536x8640.00%0¥0.00
12320x5681041.890¥0.00
13768x10240.00%0¥0.00
141360x7680.00%0¥0.00
15360x64071.43%00:02:060.00%0¥0.00
161600x12000.00%0¥0.00
171344x4400.00%0¥0.00
18320x4800.00%0¥0.00
191093x6140.00%0¥0.00
201280x768380.00%0¥0.00
211024x6000.00%0¥0.00

最新更新