Implementing an xml parser in c

Question

i am looking at building a simple xml parser with c99, i want to implement every single detail to it just for learning purposes, from my understanding the best way is implementing a tree structure and tokenizing the xml string into a tree structure so it will look something like enter image description here

and i will have 2 simple structs one that represents a node and one that represents an attribute, how bad is the above design?

any suggestions for improvement?

M Oehm · Accepted Answer

The complexity of your chosen task aside, your data structure looks good at first sight, but in my opinion there are two or three things wrong:

You'll have to account not only for child nodes, but also for sibling nodes that share the same parent
There's no need to make the sttribute tree a binary tree. For simplicity, I'd just use a singly-linked list.
You need to account for the contents of the nodes between the opening and closing brackets (unless your node structure already accounts fot it.)

So you really need a binary tree for the xml structure itself and a linked list of attributes for each node. For example, consider this simple xml-style data:


    
        Consomme
        Tomato soup
    
    
        Green salad
    
    
        Steak and kidney pie
        Spinach lasagna
    
    
        Fruit
        Ice cream
        Coffee

The food items are the children of the courses, but are siblings of each other if they have the same course as parent. The tree structure looks like the indentation: Items on the same level are siblings, indented items are children.

You need only keep a pointer to the oldest child, other children are reachable via the sibling relationship, which is also a pointer. (In binary-tree nomenclature, children are the left links and siblings are the right links.) For easy traversing you should also keep a pointer to the parent.

The textual content and the attributes are just data attached to the nodes.

(Of course, looking at the source of existing XML parsers might give you better ideas.)

Implementing an xml parser in c

Answers (2)

Related Questions