我正在尝试在xml文件中搜索这些实例:
<xml>
<corpus>
<body>
<nonterminals>
<graph>
<s>
<nt id="s9_509" cat="fcl"></nt>
<nt id="s9_501" cat="pp">
<edge label="H" idref="s9_1"/>
<edge label="DP" idref="s9_502"/>
<edge label="STA" idref="s9_509"/>
<edge label="P" idref="s9_19"/>
<edge label="S" idref="s9_510"/>
<edge label="PU" idref="s9_25"/>
</nt>
</nonterminals>
</graph>
</s>
</body>
</corpus>
</xml>
我的代码是:
XPath xpath = XPathFactory.newInstance().newXPath();
String path = "//xml//corpus//build//s//graph//nonterminals//nt";
XPathExpression expr = xpath.compile(path);
System.out.println("Query1: "+path);
Object result = expr.evaluate(document, XPathConstants.NODESET);
NodeList nodes = (NodeList) result;
System.out.println("Number of nodes: "+nodes.getLength());
for (int i = 0; i < nodes.getLength(); i++) {
System.out.println(nodes.item(i).getAttributes().getNamedItem("id").getNodeValue());
这个查询正确吗?存在另一种尝试方式吗?如何删除节点结果?
假设这是 XML 的整个结构,并且<xml>
是根节点,我会说您的 XPath 应该是:
xml/corpus/build/s/graph/nonterminals/nt
如果要匹配每个nt
节点,而不管它出现在文档中的位置,则可以使用:
/nt
有一些有用的免费工具可以查看 XPath 查询的结果,而无需预先编写 java:
http://xml-copy-editor.sourceforge.net/
http://www.xpathtester.com/