'charmap'编解码器无法解码位置 2273 中的字节0x9d:字符映射到<undefined>



我正在尝试执行情感分析,并且正在添加用于嵌入向量的单词库,以帮助我的模型将单词转换为数字。我收到一个错误,无法通过。你能不能看一下,并提出一个下一步的建议?

import pandas as pd
d213_data = pd.read_csv('D213_Combined_Cleaned.csv')
d213_data
d213_data['Rating'].value_counts()
!pip install wget
import wget
url = 'http://downloads.cs.stanford.edu/nlp/data/glove.6B.zip'
filename = wget.download(url)
print(filename) 
import sys
from zipfile import PyZipFile
for zip_file in sys.argv[1:]:
pzf = PyZipFile('glove.6B.zip')
pzf.extractall()
import numpy as np
words = dict()
def add_to_dict(d, filename):
with open(filename, 'r') as f:
for line in f.readlines():
line = line.split(' ')
print(line)
break

try:
d[line[0]] = np.array(line[1:], dtype=float)
except:
continue
add_to_dict(words, 'glove.6B.50d.txt')

输入图片描述

考虑这样打开或读取文件:

with open('filename.extension', "r+", encoding="utf-8") as json_data:
#file.read()
#file.load()

最新更新