Reputation: 514
I'm trying to get an attribute id (fileID
) from my XML document to use as the filename for my XML split. The split works I just need to extract the fileID
to use as the name.
I could use this as help on this.
This is my xml document
<root>
<envelope fileID="000152OP.XML">
<record id="850">
</record>
</envelope>
<envelope fileID="000153OP.XML">
<record id="850">
</record>
</envelope>
<envelope fileID="000154OP.XML">
<record id="850">
</record>
</envelope>
</root>
And here's my Java code [EDITED] I can read the attribute now but it doesn't create the last xml file. So in my example it create the first 2 files with the correct name but last fileID "000154OP.XML" isn't created.
public static void splitXMLFile (String file) throws Exception {
String[] temp;
String[] temp2;
String[] temp3;
String[] temp4;
String[] temp5;
String[] temp6;
File input = new File(file);
DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
Document doc = dbf.newDocumentBuilder().parse(input);
XPath xpath = XPathFactory.newInstance().newXPath();
NodeList nodes = (NodeList) xpath.evaluate("//root/envelope", doc, XPathConstants.NODESET);
int itemsPerFile = 1;
Node staff = doc.getElementsByTagName("envelope").item(0);
NamedNodeMap attr = staff.getAttributes();
Node nodeAttr = attr.getNamedItem("fileID");
String node = nodeAttr.toString();
temp = node.split("=");
temp2 = temp[1].split("^\"");
temp3 = temp2[1].split("\\.");
Document currentDoc = dbf.newDocumentBuilder().newDocument();
Node rootNode = currentDoc.createElement("root");
File currentFile = new File("C:\\XMLFiles\\" + temp3[0]+ ".xml");
for (int i=1; i <= nodes.getLength(); i++) {
Node imported = currentDoc.importNode(nodes.item(i-1), true);
rootNode.appendChild(imported);
Node staff2 = doc.getElementsByTagName("envelope").item(i);
NamedNodeMap attr2 = staff2.getAttributes();
Node nodeAttr2 = attr2.getNamedItem("fileID");
String node2 = nodeAttr2.toString();
temp4 = node2.split("=");
temp5 = temp4[1].split("^\"");
temp6 = temp5[1].split("\\.");
if (i % itemsPerFile == 0) {
writeToFile(rootNode, currentFile);
rootNode = currentDoc.createElement("root");
currentFile = new File("C:\\XMLFiles\\" + temp6[0]+".xml");
}
}
writeToFile(rootNode, currentFile);
}
private static void writeToFile(Node node, File file) throws Exception {
Transformer transformer = TransformerFactory.newInstance().newTransformer();
transformer.transform(new DOMSource(node), new StreamResult(new FileWriter(file)));
}
Upvotes: 3
Views: 13420
Reputation: 6867
ArrayList<String> files = new ArrayList<String>();
SAXBuilder builder = new SAXBuilder();
Document Doc;
try {
Doc = builder.build(new File(myxmlfile.xml));
Element root = Doc.getRootElement();
List<Element> category = root.getChildren();
for(int i=0 ; i < category.size(); i++) {
Element elem = category.get(i);
String file = elem.getAttributeValue("fileID");
files.add(file);
}
} catch (Exception e) {
}
This will give you an array list of the fileIds in the XML file. I've used the SAX reader to parse the XML.
Upvotes: 1
Reputation: 49177
Perhaps you can try the following xpath:
//root/envelope/record/@id
If the XPath library you're using does not support the entire XPath set you can try the excellent library jaxen
Upvotes: 2