Xquery: Counting the number of occurrences of a term in each record within a set of records

Question

Given a set of xml records and a set of terms $terms . The terms in $terms sequence are extracted from the set of records. I want to count the number of occurrences of each term in each paragraph record. I used the following code to do so:

for $record in /rec:Record
for $term in $terms
return   xdmp:unquote(concat('',string(count(lower-case($record/rec:paragraph )[. = lower-case($term)])), ''))

For each term in each record i got 0 count:

Example: $term:='Mathematics', $record/rec:paragraph:='Mathematics is the study of topics such as quantity'

I want the number of occurances of the term Mathematics in $record/rec:paragraph

Any idea of what caused this result? Is there any other way to count the number of occurrences of each of the terms in each paragraph.

Chondrops · Accepted Answer

Use tokenize() to split up the input string into word tokens. Then the counting itself is trivial. For example:

let $text := 'Mathematics is the study of topics such as quantity'
let $myterms := 'mathematics'
let $wds := tokenize($text, '\s+')

for $t in $myterms
return {count($wds[lower-case(.)=lower-case($t)])}

Returns this:

Xquery: Counting the number of occurrences of a term in each record within a set of records

Answers (1)

Related Questions