我正在使用Nokogiri解析HTML,然后获取这些类型元素。
<li data-item="{"title":"where is title","slug":"about some",
"has_many_images":false,"show_image":"abbxb","created_at":1373737401,
"show_attr":{"value":"150"},
"location":"Alabama",
"category":"Table",
"is_business":false}">
//here other many more
</li>
现在我想得到这个data-item
,我正在使用:
page.css("li[data-item]")[0]
我得到了这样的东西:
#<Nokogiri::XML::Element:0x14fc250 name="li" attributes=[#<Nokogiri::XML::Attr:0x14fc178 name="class" value="item">,
等。。。
但我想要这样:
"{"title":"where is title","slug":"about some",
"has_many_images":false,"show_image":"abbxb","created_at":1373737401,
"show_attr":{"value":"150"},
"location":"Alabama",
"category":"Table",
"is_business":false}"
有什么建议吗?
您可以通过以下选择获得该属性:
page.at_xpath("//li[1]/@data-item").content
编辑
应@Priti的要求,进行更完整的演示:
body = %Q{
<body>
<li data-item='{"title":"where is title","slug":"about some",
"has_many_images":false,"show_image":"abbxb","created_at":1373737401,
"show_attr":{"value":"150"},
"location":"Alabama",
"category":"Table",
"is_business":false}'>
</li>
</body>
}
page = Nokogiri::XML(body)
result = page.at_xpath("//li[1]/@data-item").content
# "{"title":"where is title","slug":"about some", "has_many_images":false,"show_image":"abbxb","created_at":1373737401, "show_attr":{"value":"150"}, "location":"Alabama", "category":"Table", "is_business":false}"