Reputation: 3669
I'm loggin millions of small log documents weekly to do:
My first thought was put all docs in HBase/HDFS and run Hadoop jobs generating stats results.
The problem is: some of results must be near real-time.
So, after some research I discovered ElasticSearch and Now I'm thinking about transfer all millions of documents and use DSL-Queries to generate stats results.
Is this a good idea? ElasticSearch seems to be so easy to handle with millions/billions of documents.
Upvotes: 10
Views: 9214
Reputation: 8705
Upvotes: 11