C# parse XML File

Question

I've a problem to parse my XML File (RSS Feed) in C#. I just want to read out the "entry" entries (the root parent - "feed" - is not relevant). All "entry" entries are almost even, except the part "state". Some entries doesn't have that entry.

So i just want to read out the following: "entry" nodes:

updated
expires
title
summary
state (if exists)

Any suggestions? Thank you very much.



   2011-01-01T00:00:00+0100
   
   
      Mr X
      Mr_X@domain.com
   
   Some infos....
   domain.com

   2011-01-01T00:00:00Z
   2011-01-02T00:00:00Z
   My first Title
   First ID
  
   My first important summary
   domain.com
   
      
         
            
         
      
   


  2011-01-01T00:00:00Z
  2011-01-02T00:00:00Z
  My second Title
  active
  Second ID
  
  My second important summary
  domain.com
  
    
      
        
      
    
  
  
{

   2011-01-01T00:00:00+0100
   
   
      Mr X
      Mr_X@domain.com
   
   Some infos....
   domain.com

   2011-01-01T00:00:00Z
   2011-01-02T00:00:00Z
   My first Title
   First ID
  
   My first important summary
   domain.com
   
      
         
            
         
      
   


  2011-01-01T00:00:00Z
  2011-01-02T00:00:00Z
  My second Title
  active
  Second ID
  
  My second important summary
  domain.com

My current C# code:

public void ParseXML(XmlDocument xmlFile)
    {
        ArrayList updated = new ArrayList();
        ArrayList expires = new ArrayList();
        ArrayList title = new ArrayList();
        ArrayList summary = new ArrayList();
        ArrayList state = new ArrayList();

        ObservableCollection trafInfo = new ObservableCollection();
        myCollection = trafInfo;
        XmlNodeReader reader = new XmlNodeReader(xmlFile);

        StringBuilder output = new StringBuilder();

        while (reader.Read())
        {
            switch (reader.NodeType)
            {
                case XmlNodeType.Element:
                    if(reader.Name == "updated")
                    {
                        updated.Add(reader.ReadString());
                    }

                    if (reader.Name == "expires")
                    {
                        expires.Add(reader.ReadString());
                    }

                    if (reader.Name == "title")
                    {
                        title.Add(reader.ReadString());
                    }

                    if (reader.Name == "summary")
                    {
                        summary.Add(reader.ReadString());
                    }

                    if (reader.Name == "state")
                    {
                        state.Add(reader.ReadString());
                    }

                    break;
            }
        }
    }

In that case, I don't have a relationship between the data (if state doesn't exists).

mj82 · Accepted Answer

You could use XPath expression for that. Below is complete example on console-appliactaion - as you use xlmns namespace, it requries a little modification of ParseXML method.

using System;
using System.Xml;

namespace ConsoleApplication1
{
    class Program
    {
        static void Main(string[] args)
        {
            XmlDocument xmlDocument = new XmlDocument();
            xmlDocument.Load("XMLFile1.xml");
            XmlNamespaceManager xmlnm = new XmlNamespaceManager(xmlDocument.NameTable);
            xmlnm.AddNamespace("ns", "http://www.w3.org/2005/Atom");

            ParseXML(xmlDocument, xmlnm);

            Console.WriteLine("
---XML parsed---");
            Console.ReadKey();
        }

        public static void ParseXML(XmlDocument xmlFile, XmlNamespaceManager xmlnm)
        {
            XmlNodeList nodes = xmlFile.SelectNodes("//ns:updated | //ns:expires | //ns:title | //ns:summary | //ns:state", xmlnm);

            foreach (XmlNode node in nodes)
            {
                Console.WriteLine(node.Name + " = " + node.InnerXml);
            }
        }
    }
}

// in XPath expression means, you want to select all nodes with specific name, no matter where they are.

If you want to search only elements, you can use following:
"//ns:entry/ns:updated | //ns:entry/ns:expires | //ns:entry/ns:title | //ns:entry/ns:summary | //ns:entry/ns:state"

C# parse XML File

Answers (2)

Related Questions