我是Pycharm Professional Ide,Python和NLTK的新手。我想将斯坦福纳特巨人用于我的项目工作。当我在Pycharm Professional
中使用Python文件中的以下代码from nltk.tag import StanfordNERTagger
from nltk.tokenize import word_tokenize
stanford_classifier = '/home/PycharmProjects/Stanford-NER-Python/stanford/stanford-ner-2015-12-09/classifiersenglish.all.3class.distsim.crf.ser.gz'
stanford_ner_path = '/home/PycharmProjects/Stanford-NER-Python/stanford/stanford-ner-2015-12-09/stanford-ner.jar'
# Creating Tagger Object
st = StanfordNERTagger(stanford_classifier, stanford_ner_path, encoding='utf-8')
text = 'While in France, Christine Lagarde discussed short-term stimulus efforts in a recent interview with the Wall Street Journal.'
tokenized_text = word_tokenize(text)
classified_text = st.tag(tokenized_text)
print classified_text
显示的错误是
Traceback (most recent call last):
File "/home/premchikkus/PycharmProjects/Stanford-NER-Python/main.py", line 9, in <module>
st = StanfordNERTagger(stanford_classifier, stanford_ner_path, encoding='utf-8')
File "/home/premchikkus/PycharmProjects/Stanford-NER-Python/venv/local/lib/python2.7/site-packages/nltk/tag/stanford.py", line 168, in __init__
super(StanfordNERTagger, self).__init__(*args, **kwargs)
File "/home/premchikkus/PycharmProjects/Stanford-NER-Python/venv/local/lib/python2.7/site-packages/nltk/tag/stanford.py", line 53, in __init__
verbose=verbose)
File "/home/premchikkus/PycharmProjects/Stanford-NER-Python/venv/local/lib/python2.7/site-packages/nltk/internals.py", line 719, in find_jar
searchpath, url, verbose, is_regex))
File "/home/premchikkus/PycharmProjects/Stanford-NER-Python/venv/local/lib/python2.7/site-packages/nltk/internals.py", line 635, in find_jar_iter
(name_pattern, path_to_jar))
LookupError: Could not find stanford-ner.jar jar file at /home/PycharmProjects/Stanford-NER-Python/stanford/stanford-ner-2015-12-09/stanford-ner.jar
我已经将pycharm IDE中的环境变量设置为Python/Stanford/Stanford-ner-2015-12-09/Stanford-ner.jar
我正在使用NLTK版本3.2.1
Python 2.7
Ubuntu 16.04
预先感谢。
没关系。我得到了它。stanford_classifier和stanford_ner_path的位置是错误的,需要在Standfordnertagger((中通过第一个参数((不需要ASCII。