see613
see613

Reputation: 494

Indexing html with Sphinx without complex scripts

As far as I know the sphinx search engine can index html, but it doesn't have any in-built drivers like it does for sql-data. That means we have to parse and prepare html content ourselves.

Does anyone know of any drivers or third party add-ons make sphinx index html automatically?

Can anyone help? Thanks in advance.

Upvotes: 1

Views: 277

Answers (1)

barryhunter
barryhunter

Reputation: 21091

Well if you have a database of the .html filenames, can use

http://sphinxsearch.com/docs/current.html#conf-sql-file-field

to index them, sphinx will load each individiaul file in turn and index the contents.

Combine with http://sphinxsearch.com/docs/current.html#conf-html-strip

Upvotes: 2

Related Questions