小贝子编程

Python Goose无法提取Mashable / usatoday /政治因线文章

本文关键字：文章 usatoday Python 提取 Mashable Goose python text-extraction goose
更新时间 : 2023-09-05
英文 : Python Goose not able to extract mashable / usatoday / politicalwire articles

我正在使用python鹅提取器及其在mashable.com和usatoday.com上的每篇文章的失败。有人可以解决这个问题吗？

for Usatoday.com文章：

g = Goose()
article = g.extract(url='http://www.usatoday.com/story/tech/columnist/talkingtech/2014/01/25/namm-2014---ik-multimedias-rings-to-make-music/4863193/')
assert(article.cleaned_text=='')

用于Mashable文章：

g = Goose()
article = g.extract(url='http://mashable.com/2014/01/26/square-cofounder-jim-mckelvey/')
assert(article.cleaned_text=='')

政治界线文章：

g = Goose()
article = g.extract(url='http://politicalwire.com/archives/2014/01/27/some_republicans_go_off_script_in_sotu_response.html')
assert(article.cleaned_text=='')

我认为这些是文本提取的非常重要的网站。有人可以建议修复吗？谢谢

最新版本的鹅可以从usatoday.com和mashable.com提取

Python Goose无法提取Mashable / usatoday /政治因线文章

相关内容

最新更新

热门标签：