Parsing XML with filter

Question

i parse XML document in java with:

doc = DocumentBuilderFactory
           .newInstance()
           .newDocumentBuilder()
           .parse(new URL(url).openStream());

work, but is possible to parse with some filter? for example my XML file have one attribute priority, is possible to parse with filter for example priority>8 ?

So in the doc have only element with priority > 8.

Example xml:


http
2015-02-26
Hello</titolo>
<priority>1.0</priority>
</url>
...
</code></pre>

<p>Thanks</p>

Ravi K Thapliyal · Accepted Answer

For the following sample input file named urls.xml

You first create the full Document tree as usual

Document document = DocumentBuilderFactory
           .newInstance()
           .newDocumentBuilder()
           .parse(new File("urls.xml"));

Then run the XPath query that selects all the Nodes above a certain priority

XPathExpression expr = XPathFactory.newInstance()
                      .newXPath().compile("//url[priority > 5]");
NodeList urls = (NodeList) expr.evaluate(document, XPathConstants.NODESET);

If you want to serialize the results to another xml file, create a new Document first.

Document result = DocumentBuilderFactory.newInstance()
        .newDocumentBuilder().newDocument();
Node root = result.createElement("results");
result.appendChild(root);

Then append the filtered url Nodes as

for (int i = 0; i < urls.getLength(); i++) {
    Node copy = result.importNode(urls.item(i), true);
    root.appendChild(result.createTextNode("
	"));
    root.appendChild(copy);
}
root.appendChild(result.createTextNode("
"));

Now, all you need to do is to serialize the new Document to a String and write that out to a file. Here's I'm just printing it out on to the console.

System.out.println(
        ((DOMImplementationLS) result.getImplementation())
        .createLSSerializer().writeToString(result));

Output:

Parsing XML with filter

Answers (2)

Related Questions