UnicodeDecodeError:"utf-8"编解码器无法解码字节



在为我的商业智能类加载这个数据集时遇到问题。我尝试了不同的csv文件,这是有效的。我试着在谷歌上搜索一些解决方案,但没有找到。任何帮助将非常感激!

# load data
col_names = ['age', 'gender', 'coffee_bags_bought', 'spent_last_week', 'spent_last_month', 'income', 'online', 'new_product']
# load dataset
coffeeStore = pd.read_csv("/content/CoffeeStore.xlsx", header=None, names=col_names)
coffeeStore.head(2)

这是我遇到的错误:

---------------------------------------------------------------------------
UnicodeDecodeError                        Traceback (most recent call last)
<ipython-input-35-e3969313ee59> in <module>()
3 col_names = ['age', 'gender', 'coffee_bags_bought', 'spent_last_week', 'spent_last_month', 'income', 'online', 'new_product']
4 # load dataset
----> 5 coffeeStore = pd.read_csv("/content/CoffeeStore.xlsx", header=None, names=col_names)
6 coffeeStore.head(2)
9 frames
/usr/local/lib/python3.7/dist-packages/pandas/_libs/parsers.pyx in pandas._libs.parsers.raise_parser_error()
UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 15-16: invalid continuation byte

您也可以将引擎参数更改为'python'

coffeeStore = pd.read_csv("/content/CoffeeStore.xlsx", header=None, names=col_names,engine='python')

有关unicode, utf-8等的更详细的解释,请阅读这篇传奇的博客文章

您在excel文件上使用read_csv。使用read_excel代替

coffeeStore = pd.read_excel("/content/CoffeeStore.xlsx", header=None, names=col_names)

最新更新