Reputation: 21
I have little problem with my code.. I need to parse XML file into Java
but I need to parse certain things...I need to parse books that their price is over 10 and their publish date is over 2005..
I wrote a code for that using XPath, but it didn't work.
Here is the XML file:
<pre>
<code>
<catalog>
<book id="bk101">
<author>Gambardella, Matthew</author>
<title>XML Developer's Guide</title>
<genre>Computer</genre>
<price>41.95</price>
<publish_date>2000-10-01</publish_date>
<description>An in-depth look at creating applications
with XML.</description>
</book>
<book id="bk102">
<author>Ralls, Kim</author>
<title>Midnight Rain</title>
<genre>Fantasy</genre>
<price>5.95</price>
<publish_date>2002-12-16</publish_date>
<description>A former architect battles corporate zombies,
an evil sorceress, and her own childhood to become queen
of the world.</description>
</book>
<book id="bk103">
<author>Corets, Eva</author>
<title>Maeve Ascendant</title>
<genre>Fantasy</genre>
<price>5.95</price>
<publish_date>2000-11-17</publish_date>
<description>After the collapse of a nanotechnology
society in England, the young survivors lay the
foundation for a new society.</description>
</book>
<book id="bk104">
<author>Corets, Eva</author>
<title>Oberon's Legacy</title>
<genre>Fantasy</genre>
<price>5.95</price>
<publish_date>2001-03-10</publish_date>
<description>In post-apocalypse England, the mysterious
agent known only as Oberon helps to create a new life
for the inhabitants of London. Sequel to Maeve
Ascendant.</description>
</book>
<book id="bk105">
<author>Corets, Eva</author>
<title>The Sundered Grail</title>
<genre>Fantasy</genre>
<price>5.95</price>
<publish_date>2001-09-10</publish_date>
<description>The two daughters of Maeve, half-sisters,
battle one another for control of England. Sequel to
Oberon's Legacy.</description>
</book>
<book id="bk106">
<author>Randall, Cynthia</author>
<title>Lover Birds</title>
<genre>Romance</genre>
<price>4.95</price>
<publish_date>2003-09-02</publish_date>
<description>When Carla meets Paul at an ornithology
conference, tempers fly as feathers get ruffled.</description>
</book>
<book id="bk107">
<author>Thurman, Paula</author>
<title>Splish Splash</title>
<genre>Romance</genre>
<price>4.95</price>
<publish_date>2004-11-02</publish_date>
<description>A deep sea diver finds true love twenty
thousand leagues beneath the sea.</description>
</book>
<book id="bk108">
<author>Knorr, Stefan</author>
<title>Creepy Crawlies</title>
<genre>Horror</genre>
<price>4.95</price>
<publish_date>2005-12-06</publish_date>
<description>An anthology of horror stories about roaches,
centipedes, scorpions and other insects.</description>
</book>
<book id="bk109">
<author>Kress, Peter</author>
<title>Paradox Lost</title>
<genre>Science Fiction</genre>
<price>6.95</price>
<publish_date>2006-11-02</publish_date>
<description>After an inadvertant trip through a Heisenberg
Uncertainty Device, James Salway discovers the problems
of being quantum.</description>
</book>
<book id="bk110">
<author>O'Brien, Tim</author>
<title>Microsoft .NET: The Programming Bible</title>
<genre>Computer</genre>
<price>36.95</price>
<publish_date>2006-12-09</publish_date>
<description>Microsoft's .NET initiative is explored in
detail in this deep programmer's reference.</description>
</book>
<book id="bk111">
<author>O'Brien, Tim</author>
<title>MSXML3: A Comprehensive Guide</title>
<genre>Computer</genre>
<price>36.95</price>
<publish_date>2007-12-01</publish_date>
<description>The Microsoft MSXML3 parser is covered in
detail, with attention to XML DOM interfaces, XSLT processing,
SAX and more.</description>
</book>
<book id="bk112">
<author>Galos, Mike</author>
<title>Visual Studio 7: A Comprehensive Guide</title>
<genre>Computer</genre>
<price>49.95</price>
<publish_date>2008-04-16</publish_date>
<description>Microsoft Visual Studio 7 is explored in depth,
looking at how Visual Basic, Visual C++, C#, and ASP+ are
integrated into a comprehensive development
environment.</description>
</book>
</catalog>
</code>
</pre>
This is my Java code:
XPathFactory factory = XPathFactory.newInstance();
XPath path = factory.newXPath();
XPathExpression xpe = path.compile("//book[price>10]/author | //book[publish_date>2005]/publish_date");
File xmlDocument = new File("boo.xml");
InputSource inputSource = new InputSource(new FileInputStream(xmlDocument));
Object result = xpe.evaluate(inputSource, XPathConstants.NODESET);
NodeList nodeList = (NodeList)result;
for (int i = 0; i < nodeList.getLength(); i++) {
Node nNode = nodeList.item(i);
System.out.println("\nCurrent Element :" + nNode.getNodeName());
if (nNode.getNodeType() == Node.ELEMENT_NODE) {
Element eElement = (Element) nNode;
System.out.println("Author : "
+ eElement
.getElementsByTagName("author")
.item(0)
.getTextContent());
System.out.println("Title : "
+ eElement
.getElementsByTagName("title")
.item(0)
.getTextContent());
System.out.println("Genre : "
+ eElement
.getElementsByTagName("genre")
.item(0)
.getTextContent());
System.out.println("Price : "
+ eElement
.getElementsByTagName("price")
.item(0)
.getTextContent());
System.out.println("Publish date : "
+ eElement
.getElementsByTagName("publish_date")
.item(0)
.getTextContent());
System.out.println("Description : "
+ eElement
.getElementsByTagName("description")
.item(0)
.getTextContent());
}
}
So if you could help me I would appreciate it
Upvotes: 1
Views: 691
Reputation: 153
try to use the following xpath condition:
//book[price>10 and publish_date>'2005']
This only select: bk110, bk111, bk112.
Tested on https://www.freeformatter.com/xpath-tester.html#ad-output
Upvotes: 0
Reputation: 1188
The problem is with your XPath . The "|" operator basically allows you to select several paths. So in your code this will just return all the author and publish_date nodes. That's not what you want. You seem to want a list of books that match a certain criteria. In that case you just want to use the following XPath:
XPathExpression xpe = path.compile("//book[price>10 and number(translate(publish_date,'-','')) >20050101]");
Note the date comparison. I just use Jan 1, 2005 for dates greater than 2005. Adjust accordingly if that's not what you want.
Upvotes: 2