基数为10的int()的文字无效:b'\x1f\x8b\x08\x80\xf\xa8R\x02\



我使用Python 3.6和windows 10

请帮我解决这个问题我是一个初学者,简要解释

错误:

data loaded!
number of sentences: 2467
vocab size: 30417
max sentence length: 2721
loading word2vec vectors...
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-13-2ecf35f5c4fb> in <module>
8 print ('max sentence length: ' + str(max_l))
9 print ('loading word2vec vectors...',)
---> 10 w2v = load_bin_vec(w2v_file, vocab)
11 print ('word2vec loaded!')
12 print ('num words already in word2vec: ' + str(len(w2v)))
<ipython-input-12-824546589dfe> in load_bin_vec(fname, vocab)
49     with open(fname, "rb") as f:
50         header = f.readline()
---> 51         vocab_size, layer1_size = map(int, header.split())
52         binary_len =  np.dtype('float32').itemsize * layer1_size
53         for line in range(vocab_size):
ValueError: invalid literal for int() with base 10: b'x1fx8bx08x08x80xffxa8Rx02x03GoogleNews-vectors-negative300.binx00Lx9dx07xbcxcexf5xfbxffxadx902#xbbx8cPx19x99xc5xb9xefsTFxd1x92Rxd1DBx11Exa9xbe'

如何删除此错误?

在第49行,您已经以'rb'模式读取了该文件
'rb'模式读取二进制格式的数据。
'r'替换'rb'

最新更新