user156073
user156073

Reputation: 2001

parsing a XML file in java

I have wrote a SAX parser. It works fine when the attibute values are nested in double quotes. But if i dont use quote it throws a exception. I want my parser to parse the XML file whose attributes values are not inside quotes. I want to parse following type of file:

<root>
    <tag1 attribute1=value1 > my data  </tag1>
</root>

Note that value1 is not inside quotes

Can i make my parser to parse the above file? If yes how?

Upvotes: 0

Views: 663

Answers (2)

Sam
Sam

Reputation: 6250

Try NekoHTML ( http://nekohtml.sourceforge.net/usage.html )

e.g.

package sample;

import org.apache.xerces.parsers.AbstractSAXParser;
import org.cyberneko.html.HTMLConfiguration;

public class HTMLSAXParser extends AbstractSAXParser {
    public HTMLSAXParser() {
        super(new HTMLConfiguration());
    }
}

Upvotes: 0

cletus
cletus

Reputation: 625007

The SAX parser won't read that because it's not well-formed XML. All attribute values need to be enclosed in either single or double character quotes.

To make your parser read it, you'd find have to tidy/purify/fix it with a relevant library.

Upvotes: 6

Related Questions