Python Wikipedia library



我正在使用python库维基百科来解析数据。当它进入代码的第二部分时,我会得到页面错误。页面错误

import wikipedia

print ("1: Searching Wikipedia for 'List of Lexus vehicles'")
try:
    print (wikipedia.page('List of Lexus'))
    print ('-' * 60)
except wikipedia.exceptions.DisambiguationError as e:
    print (str(e))
    print ('+' * 60)
    print ('DisambiguationError: The page name is ambiguous')
print

print ("2: Searching Wikipedia for 'List of Lexus (vehicles)'")
print (wikipedia.page('List of Lexus_(vehicles)'))
print

result = wikipedia.page('List of Lexus_(vehicles)').content.encode('UTF8')
print ("3: Result of searching Wikipedia for 'List of Lexus_(vehicles)':")
print (result)
print
lexus_count = result.count('ct','lfa','rx')
print

print ("The Wikipedia page for 'Lexus_(company)' has " + 
    "{} occurrences of the word 'Lexus'".format(lexus_count))
print

更新了我可以解析页面数据,但是在计数上获取类型错误

23 print
24
25 lexus_count = result.count('ct','lfa','rx')
26 print
TypError: slice indices must be integers or None or have an __index__ method

您的程序有多个问题。这是一个更新的程序,其错误已固定和标记。

import wikipedia

print ("1: Searching Wikipedia for 'Lexus'")
try:
    print (wikipedia.page('Lexus'))
    print ('-' * 60)
except wikipedia.exceptions.DisambiguationError as e:
    print (str(e))
    print ('+' * 60)
    print ('DisambiguationError: The page name is ambiguous')
print

print ("2: Searching Wikipedia for 'Lexus (company)'")
result = wikipedia.page('Lexus (company)') 
# ERR; PAGE NAME SEPARATED BY SPACE NOT WITH AN UNDERSCORE
# <> PAGE ERROR AS PAGE WILL NOT BE FOUND.  
print (result)
print

result = result.content
print ("3: Result of searching Wikipedia for 'Lexus_(company)':")
print (result)
print
lexus_count = result.count('Lexus')
# changed variable name from orange_count -> lexus_count, as referenced in the print function below. 
# you were counting for 'lexus' you will not find any occurrences as this function is case sensitive.
print

print ("The Wikipedia page for 'Lexus_(company)' has " + 
    "{} occurrences of the word 'Lexus'".format(lexus_count))
print

希望这会有所帮助。

您完全遇到了哪个页面错误?

根据Wikipedia文档:https://wikipedia.readthedocs.io/en/latest/quickstart.html#quickstart

但是请注意-Wikipedia.summary如果页面是歧义页面,则如果该页面不存在,则会提出一个DisambiguationError(尽管该页面不存在(尽管默认情况下,但它试图找到您的含义和建议和搜索的页面。):

最新更新