小贝子编程

Nokogiri::HTML#content 中 HTML 元素之间的空格

本文关键字：之间空格元素 HTML HTML#content Nokogiri ruby nokogiri
更新时间 : 2023-08-30
英文 : Spaces inbetween HTML elements in Nokogiri::HTML#content

当我运行这个

Nokogiri::HTML('<div class="content"><p>Hello</p><p>Good Sir</p></div>').content

我明白这个

"HelloGood Sir"

有没有办法通过Nokogiri的API获得以下内容？

"Hello Good Sir"

require 'nokogiri'
doc = Nokogiri::HTML('<div class="content"><p>Hello</p><p>Good Sir</p></div>')
# below will fetch all text nodes irrespective of any tag,from the current document.
doc.xpath("//text()").map(&:text)
# => ["Hello", "Good Sir"]
doc.xpath("//text()").map(&:text).join(" ")
# => "Hello Good Sir"
# below will fetch all text nodes which are wrapped inside the p tag,
# from the current document.
doc.xpath("//p").map(&:text)
# => ["Hello", "Good Sir"]
doc.xpath("//p").map(&:text).join(" ")
# => "Hello Good Sir"

就像奥雅纳指出的那样

doc = Nokogiri::HTML('<div class="content"><p>Hello</p><p>Good Sir</p></div>')
doc.css('p').map(&:text).join(" ") #=> "Hello Good Sir"

Nokogiri::HTML#content 中 HTML 元素之间的空格

相关内容

最新更新

热门标签：