完整的新手,所以感谢建议。
我知道如何得到一个标签的内容,如果它有一个唯一的id,但许多网站都像下面,我想一个标签的内容跟随另一个标签与特定的字符串作为内容。
是否有一种方法可以尝试做一个条件表达式或提取整个部分,并运行正则表达式或更好的方式?
谢谢!
在例1中,我想要"May 11, 2021"它跟在以下标签后面:
<div class="a-section a-spacing-small a-text-center rpi-attribute-label">
<span>Publication date</span>
</div>
<div class="a-section a-spacing-small a-text-center">
<span class="rpi-icon book_details-publication_date"></span>
</div>
在例2中,我想要"September 7, 2021">
示例1:
<li class="a-carousel-card rpi-carousel-attribute-card" role="listitem" aria posinset="3">
<div class="a-section rpi-attribute-content">
<div class="a-section a-spacing-small a-text-center rpi-attribute-label">
<span>Publication date</span>
</div>
<div class="a-section a-spacing-small a-text-center">
<span class="rpi-icon book_details-publication_date"></span>
</div>
<div class="a-section a-spacing-none a-text-center rpi-attribute-value">
<span>May 11, 2021</span>
</div>
</div>
</li>
示例2:
<li><span class="a-list-item">
<span class="a-text-bold">Publication date
‏
:
‎
</span>
<span>September 7, 2021</span>
</span></li>
例#1,试试这个xpath
"//div[@class='a-section a-spacing-none a-text-center rpi-attribute-value']/span"
例2
"//span[@class='a-list-item']/span[2]"