HtmlAgilityPack multiple element

Question

I have a html document that contains multiple divs

Example:



I don't know how to adapt my code to extract the href and the title element 
at the same time.

Each div should be an object with the included a tags as properties.

public class CheckBoxListItem
{
    public string Text { get; set; }
    public string Href { get; set; }
}

Erwin · Accepted Answer

You can use the following xpath query to retrieve only a tags with a title and href :

//a[@title and @href]

The you can use your code like this:

List items = new List();
var nodes = Web.DocumentNode.SelectNodes("//a[@title and @href]");
if (nodes != null)
{
   foreach (var node in nodes)
   {
      items.Add(new CheckBoxListItem()
      {
        Text = node.Attributes["title"].Value,
        Href = node.Attributes["href"].Value
      });
   }
}

HtmlAgilityPack multiple element

Answers (2)

Related Questions