How to get namespaced nodes from scala.xml?

Question

Looking at RSS, something like Craigslist's(http://chambana.craigslist.org/cta/index.rss) gives both nodes that are namespaced and not.

something like:



<![CDATA[ 1965 Pontiac Tempest GTO tribute ]]>

...tl;dr...

something like:

(item \ "title").text

gives the title twice. How do you access a namespaced node?

Travis Brown · Accepted Answer

You'll need to filter the resulting NodeSeq:

val unprefixedTitle = (item \ "title").filter(_.prefix == null)
val dublinCoreTitle = (item \ "title").filter(_.prefix == "dc")

Each of these filtered sequences will contain a single element.

If you have the entire document (or at least the part with the namespace declarations) you can filter by namespace instead of prefix, which is more robust:

val dublinCoreTitle = (item \ "title").filter(
  _.namespace == "http://purl.org/dc/elements/1.1/"
)

Now you'll get the desired element even if you're working with a document that happens to map this namespace to a different prefix.

How to get namespaced nodes from scala.xml?

Answers (1)

Related Questions