houbysoft
houbysoft

Reputation: 33430

Zoom Search Engine-like search engine, but for Linux/UNIX

I recently found the Zoom Search Engine, which struck me as quite interesting, since its software allows for easy decoupling of the indexing process and the searching process.

In other words, you run the indexer on your local machine, and then you upload this index plus the PHP files using it to search them to your webserver.

So your webserver doesn't have to do the indexing. I have a host in a shared environment where it's best to use as few resources as possible, so this would be great to me. Moreover, I have a mostly unused small server at home (this is not the webserver I have) that I could use for indexing purposes.

However, it runs Linux, SSH only, so the Zoom Search Engine is not an option.

Is there something that has the same principle as the Zoom Search Engine (index locally, upload index + PHP to website), but available for a command line Linux environment?

Upvotes: 2

Views: 1843

Answers (3)

Nikita K
Nikita K

Reputation: 1

The closest solution which can replace Zoom search engine is a Whoosh module on python. It also creates an index file and does not need any database or services up and running. It’s also really very fast.

It’s much more relevant and environment-independent, as based fully on python. Zoom works very badly with big documents - they break it’s results relevancy and you get a mess in the output. Whoosh makes it good on with large inputs also.

The difference is that you’ll have to prepare the data for Whoosh. Zoom has a feature, when it extracts the data from PDF, DOC and even it recognizes TIFF, JPEG and some other formats - but the results relevance is very scarce in comparison to Whoosh.

The basic setup with Whoosh is quite easy, it could take 10-15 minutes if you have some dataset prepared and some knowledge in Python.

https://pypi.org/project/Whoosh/

This is a quite historical question, but I encountered the complexity of finding an easy, functional and fast solution without launching any extra services like DB, search engine service. I hope this will help someone to decide on the solution.

Upvotes: 0

Yavar
Yavar

Reputation: 11931

SPHINX SEARCH SERVER: http://sphinxsearch.com/

Absolutely fulfilling all your needs and also used by some popular shops like Craigslist, MySQL etc.

PHP is very inherent to Sphinx. All the interfaces are in PHP with the actual engine written in C++. Its blazing fast.

I myself use Solr/Lucene but I give Sphinx +1 for your tasks.

Upvotes: 2

Naveen A.N
Naveen A.N

Reputation: 31

My recommendation is to have a look at OpenSearchServer . A lucene based Search Engine. Easy to setup, mature and stable.

For Your requirements :

OpenSearchServer supports Linux and windows platform.

SSH is enough for running OpenSearchServer remotely.

You can crawl the website locally and update the index (The data directory of OpenSearchServer ) to your remote machine through replication or through FTP.For larger index replication is the best option.

It has an PHP client library so that you can easily enable search in your existing or new application.

Upvotes: 2

Related Questions