Revelants queries suggestion for autocomplete with Solr

Question

I use Solr 6.4 with Haystack 2.6.1, pySolr 3.6:

I'm looking for a google like suggestions autocomplete. Actually use EdgeNGram works good but it returns my documents titles only what is not what I want:

example:

typing: 'new y'
return:

New york, fabulous city that never sleep
A trip to new york by night
...

This give the user only the choice to select a document in particular in the suggestion list and the search will return only document with search based on suggested title.

What I want is a suggestion of revelants words like:

typing: 'new y'
return:

new york
new york by night
new york city
trip to new york

There is an article that suggest to use indexed queries by users that return results and then to use these queries as suggestions: https://lucidworks.com/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/

This mean parsing solr log or use a Data import (DIH) from a bunch of saved user's queries in DB.

Actually this article is pretty old (2009) and since then Solr have bring to us the Suggester (https://cwiki.apache.org/confluence/display/solr/Suggester)

Anyway I wonder if there is actually a good tutorial on how to use Suggester with revelant queries instead of returning my documents titles without the need to save the user's queries in DB, import them via scheduled process, reindexing, etc.

My search_indexes.py

class ArticleIndex(indexes.SearchIndex, indexes.Indexable): 

    text = indexes.CharField(document=True, use_template=True)
    created = indexes.DateTimeField(model_attr='created')
    rating = indexes.IntegerField(model_attr='rating')
    title = indexes.CharField(model_attr='title', boost=1.125)
    term = indexes.EdgeNgramField(model_attr='title')


    def get_model(self):
            return Article

My article_text.txt

{{ object.title }}
{{ object.created }}
{{ object.rating }}

My schema.xml

My solrconfig.xml


    
        true
        infixSuggester
        true
        10
        true
    
    
        suggest
    


    
        infixSuggester
        AnalyzingInfixLookupFactory
        infix_suggestions
        false
        DocumentDictionaryFactory
        term
        weight
        suggestType
        false
        false

I use pysolr to query Solr as Haystack doesn't have the suggest method implemented yet:

from pysolr import Solr

solr = Solr(settings.HAYSTACK_CONNECTIONS['default']['URL'], search_handler='/suggest', use_qt_param=False)
raw_results = solr.search('', **{'suggest.q': query_string})

kollo · Accepted Answer

After struggling hours I finally get something. Not perfect but good enough.

According to this article : http://alexbenedetti.blogspot.fr/2015/07/solr-you-complete-me.html

I used the FreeTextLookupFactory

My search_indexes.py

class ArticleIndex(indexes.SearchIndex, indexes.Indexable): 

    text = indexes.CharField(document=True, use_template=True)
    created = indexes.DateTimeField(model_attr='created')
    rating = indexes.IntegerField(model_attr='rating')
    title = indexes.CharField(model_attr='title', boost=1.125)

    def get_model(self):
            return Article

My schema.xml

My Solrconfig.xml


  
    suggest
    FreeTextLookupFactory 
    DocumentDictionaryFactory
    title
    3
    0.004
    false
    false
     
    text_general
  



  
    suggest
    true
    10
  
  
    suggest

As I use Solr 6.4, it is by default on managed schema mode (which did not take my changes in schema.xml in consideration), I had to switch to manual edit mode by adding in solrconfig.xml :

See here: https://cwiki.apache.org/confluence/display/solr/Schema+Factory+Definition+in+SolrConfig#SchemaFactoryDefinitioninSolrConfig-Switchingfromschema.xmltoManagedSchema

Then restart Solr, Rebuild index using Haystack with rebuild_index

And of course build the suggester with curl: curl http://127.0.0.1:8983/solr/collection1/suggest?suggest.build=true

And finally the results:

curl http://127.0.0.1:8983/solr/collection1/suggest?suggest.q=new%20y

I will try to digg more into the FreeTextLookupFactory to see if I can make it more accurate but it is already satisfying. Hope this help.

PS: always keep an eye on the logs at: http://127.0.0.1:8983/solr/#/~logging I would strongly suggest to have it always open on a tab. It saved my hours of pain...

Revelants queries suggestion for autocomplete with Solr

Answers (2)

Related Questions