get Element by ID using DOM parser in JAVA

Question

I have an XML file which is structured like that:



  
     
        jeune
      
      
         
           
              un jeune homme
           
          
      
  
  
    
        petits
            mpl

I need to parse it using JAVA to obtain each quote value contained in a cit element with the attribute type="translation" :

I just need to have the text content of the quote element but I don't need to have the text content of the immediate node such as petits mpl
I don't need to have the text content of the quote element contained in an re element

Finally I need to obtain this result:

entry ==> young_1
  translations ==> [jeune;petits]

For now my JAVA code is:

    //load xml document for DOM parsing
    Document doc = loadXMLFromString(xmlContent);

    //now try to parse it
    NodeList nList = doc.getElementsByTagName("sense");
    for (int i = 0; i < nList.getLength(); i++) {
        Node nNode = nList.item(i);
            if (nNode.getNodeType() == Node.ELEMENT_NODE) {
                Element eElement = (Element) nNode;
                NodeList fieldNodes = eElement.getElementsByTagName("cit");
                for(int j = 0; j < fieldNodes.getLength(); j++) {
                    Node fieldNode = fieldNodes.item(j);
                    NamedNodeMap attributes = fieldNode.getAttributes();
                    Node attr = attributes.getNamedItem("type");
                    if(attr != null) {
                        if(attr.getTextContent().equals("translation")) {
                            //how can I access  element ???
                        }
                    }
                }
            }
        }

But I don't know how can I access the ...

M A · Accepted Answer

You can access the element exactly the same way you're accessing the elements: by using the Element#getElementsByTagName(String name) method:

Node attr = attributes.getNamedItem("type");
if (attr != null) {
    if (attr.getTextContent().equals("translation")) {
        Element citElement = (Element) fieldNode;
        NodeList quoteNodeList = citElement.getElementsByTagName("quote");
        if(quoteNodeList.getLength() > 0) {
            Node quoteNode = quoteNodeList.item(0);
            String quote = quoteNode.getTextContent();
            ...
        }
    }
}

In order to exclude nodes contained in a node, you can check the parent of the node using nNode.getParentNode().getNodeName(), e.g.:

 if (!nNode.getParentNode().getNodeName().equals("re")) {
       ....
 }

get Element by ID using DOM parser in JAVA

Answers (1)

Related Questions