Gensim逐行数据帧摘要



我使用"Gensim"生成不同行的摘要。以下是原始数据帧的样子:

df.head()
Example Content
0   Not happy they have just reduced rates for Und...
1   One of the worst banks. I had a very bad exper...
2   Some one in lloyds has signed a form in My nam...
3   Card blocked due to ordering a takeaway from m...
4   There are plenty of better banks than Lloyds....

我能够使用gensim将摘要应用于每一行。问题是,我希望每一行摘要都与原始摘要相对应,但这并没有发生。以下是我的代码:

a = []
for i in df['Example Content']:
i= i + str(". This is second sentence. This is third")             # this is to add two more sentences so that gensim summarizes it. These sentence add no value to summary.
a = summarize(i, ratio=0.4, split = True)
df['Summary'] = a

这是上面代码的输出:

Example Content                                 Summary
0   Not happy they have just reduced rates for Und...       Today I got a new phone and switched my sim an...
1   One of the worst banks. I had a very bad exper...       Today I got a new phone and switched my sim an...
2   Some one in lloyds has signed a form in My nam...       Today I got a new phone and switched my sim an...
3   Card blocked due to ordering a takeaway from m...       Today I got a new phone and switched my sim an...
4   There are plenty of better banks than Lloyds....       Today I got a new phone and switched my sim an...

下面显示的是由gensim生成的每行的所有单独摘要:

The 2nd address was a shopping centre and they didnt even give me the name of the business.
I wasn't to know as I through Gallarias Novas was the shop name but that was just the place.
They said that they had issued a new card that I hadn't received and even though they new I was abroad using my card they stopped it anyway.
When my new card did arrive after getting home I now know the reason was that they were making me have a con tactless card whcih I did nto request.
Today I got a new phone and switched my sim and set up my banking apps inc Halifax and LloydÕs.
Halifax worked fine, usual 4 digit code and confirmation call came through and all set up in mins.

我应该如何获取与原始内容相对应的单个摘要并将其放置在数据帧中?

您一直在覆盖您的列表。更换

a = summarize(i, ratio=0.4, split = True)

带有

a.append(summarize(i, ratio=0.4, split = True))

最新更新