Solr Snowball stemmer is inconsistent with Spanish

Question

I have this stemmed field:

The expected result of the search query alquileres (rents) would be a match of alquiler (rent). But when I go to "Field Analysis" in the Solr Admin site, and check an index value of alquiler and a query value of alquileres, the following happens:

When indexing alquiler, it gets stemmed into alquil.
When querying alquileres, it gets stemmed into alquiler.

So the simple case of searching the plural form of a word (alquileres) would not match its singular form (alquiler).

Shouldn't both the index and the query be stemmed into the same stem (either alquiler or alquil)? Is this a limitation of the algorithm or a misunderstanding/misconfiguration from my part?

Romain Meresse · Accepted Answer

Snowball stemming is very limited... You'd get better result by using a dictionary (Hunspell stemmer) : http://wiki.apache.org/solr/Hunspell

Solr Snowball stemmer is inconsistent with Spanish

Answers (2)

Related Questions