"string"的类型不正确(预期的str,得到了spacy.tokens.doc.Doc)



我有一个数据帧:

train_review = train['review']
train_review

它看起来像:

0      With all this stuff going down at the moment w...
1      The Classic War of the Worlds" by Timothy Hi...
2      The film starts with a manager (Nicholas Bell)...
3      It must be assumed that those who praised this...
4      Superbly trashy and wondrously unpretentious 8...

我将令牌添加到一个字符串中:

train_review = train['review']
train_token = ''
for i in train['review']:
train_token +=i

我想要的是使用Spacy标记评论。以下是我尝试过的,但我得到了以下错误:

参数"string"的类型不正确(应为str,实际为spacy.tokens.doc.doc(

如何解决此问题?提前感谢!

for循环中,您从数据帧中获取spacy.tokens并将其附加到字符串中,因此应该将其强制转换为str。像这样:

train_review = train['review']
train_token = ''
for i in train['review']:
train_token += str(i)

最新更新