我想获得使用谷歌特殊功能的搜索结果,如"define: [phrase]",但我似乎找不到相关信息。
有没有人知道我在哪里可以得到JSON格式的数据(像谷歌的api的其余部分)没有抓取结果页面手动?
谢谢,伊莱
我希望我没有为此设置赏金,因为几天后我偶然发现了答案,它真的很简单。
如果你想定义love:
http://www.google.com/dictionary/json?callback=a& sl = en& tl = en& q =爱
您将得到一个包含JSONP的响应流,其中包含以下内容(出于学习目的而"美化",API返回所有内容在一行中):
a(
{
"query":"love",
"sourceLanguage":"en",
"targetLanguage":"en",
"primaries":[
{
"type":"headword",
"terms":[
{
"type":"text",
"text":"love",
"language":"en",
"labels":[
{
"text":"Noun",
"title":"Part-of-speech"
}
]
},
{
"type":"phonetic",
"text":"/ləv/",
"language":"und"
},
{
"type":"sound",
"text":"http://www.gstatic.com/dictionary/static/sounds/de/0/love.mp3",
"language":"und"
}
],
"entries":[
{
"type":"related",
"terms":[
{
"type":"text",
"text":"loves",
"language":"und",
"labels":[
{
"text":"plural"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"An intense feeling of deep affection",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"babies fill parents with intense feelings of x3cemx3elovex3c/emx3e",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"their x3cbx3ex3cemx3elovex3c/emx3e forx3c/bx3e their country",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A deep romantic or sexual attachment to someone",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"it was x3cemx3elovex3c/emx3e at first sight",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"they were both x3cbx3ein x3cemx3elovex3c/emx3e withx3c/bx3e her",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"we were slowly x3cbx3efalling in x3cemx3elovex3c/emx3ex3c/bx3e",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A personified figure of x3cemx3elovex3c/emx3e, often represented as Cupid",
"language":"en"
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A great interest and pleasure in something",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"his x3cbx3ex3cemx3elovex3c/emx3e forx3c/bx3e football",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"we share a x3cbx3ex3cemx3elovex3c/emx3e ofx3c/bx3e music",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"Affectionate greetings conveyed to someone on onex27s behalf",
"language":"en"
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A formula for ending an affectionate letter",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"take care, lots of x3cemx3elovex3c/emx3e, Judy",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A person or thing that one x3cemx3elovesx3c/emx3e",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"she was x3cbx3ethe x3cemx3elovex3c/emx3e of his lifex3c/bx3e",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"their two great x3cemx3elovesx3c/emx3e are tobacco and whiskey",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"A friendly form of address",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"itx27s all right, x3cemx3elovex3c/emx3e",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"Used to express affectionate approval for someone",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"donx27t fret, therex27s a x3cemx3elovex3c/emx3e",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"(in tennis, squash, and some other sports) A score of zero; nil",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"x3cemx3elovex3c/emx3e fifteen",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"he was down two sets to x3cemx3elovex3c/emx3e",
"language":"en"
}
]
}
]
}
]
},
{
"type":"headword",
"terms":[
{
"type":"text",
"text":"love",
"language":"en",
"labels":[
{
"text":"Verb",
"title":"Part-of-speech"
}
]
},
{
"type":"phonetic",
"text":"/ləv/",
"language":"und"
},
{
"type":"sound",
"text":"http://www.gstatic.com/dictionary/static/sounds/de/0/love.mp3",
"language":"und"
}
],
"entries":[
{
"type":"related",
"terms":[
{
"type":"text",
"text":"loved",
"language":"und",
"labels":[
{
"text":"past participle"
}
]
},
{
"type":"text",
"text":"loves",
"language":"und",
"labels":[
{
"text":"3rd person singular present"
}
]
},
{
"type":"text",
"text":"loving",
"language":"und",
"labels":[
{
"text":"present participle"
}
]
},
{
"type":"text",
"text":"loved",
"language":"und",
"labels":[
{
"text":"past tense"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"Feel a deep romantic or sexual attachment to (someone)",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"do you x3cemx3elovex3c/emx3e me?",
"language":"en"
}
]
}
]
},
{
"type":"meaning",
"terms":[
{
"type":"text",
"text":"Like very much; find pleasure in",
"language":"en"
}
],
"entries":[
{
"type":"example",
"terms":[
{
"type":"text",
"text":"Ix27d x3cemx3elovex3c/emx3e a cup of tea, thanks",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"I just x3cemx3elovex3c/emx3e dancing",
"language":"en"
}
]
},
{
"type":"example",
"terms":[
{
"type":"text",
"text":"a fun-x3cemx3elovingx3c/emx3e girl",
"language":"en"
}
]
}
]
}
]
}
]
}
,200,null)
Google Dictionary API以回调函数(JSONP)的形式返回。在可以在函数调用中处理JSON之前,必须剥离回调函数数据。
这意味着删除第一个{
之前的所有内容和最后一个}
之后的所有内容。
这之后你就完成了!这就是你需要的所有信息,当你在谷歌上搜索define love
时,它基本上是相同的(实际上更多一点!)信息。
from bs4 import BeautifulSoup
import requests
word = "democracy"
url = 'https://www.google.co.in/search?q=define%20' + word + '#cns=1'
response = requests.get(url, headers={"user-agent":"Mozilla/5.0(Macintosh; Intel Mac OS X 10.12; rv:49.0) Gecko/20100101 Firefox/49.0"})
html = response.content
final_soup = BeautifulSoup(html,"html5lib")
everyThing = final_soup.select("div._Jig")
for line in everyThing:
print("-",line.text)
我写了这个脚本来从google获取一个单词的定义。
对于任何回到这个问题的人来说,之前的所有答案似乎都被打破了。以下是我截至2022年4月的工作内容:
import bs4
import requests
def definition(word):
URL = "https://www.google.com/search?q=define+" + word
page = requests.get(URL)
soup = bs4.BeautifulSoup(page.text, 'html.parser')
first_find = soup.find('ol', class_='yRG22b v7pIac')
if (first_find is not None):
return first_find.find('div', class_='BNeawe s3v9rd AP7Wnd').getText()
return None
definition(word)返回一个字符串,该字符串是键入define (word)"到谷歌。如果谷歌没有该词的定义,它返回None。如果我违反了python的标准实践,我很抱歉,我是这门语言的初学者。
我假设这将打破在一个月当谷歌稍微修改他们的html。