store terabytes of data and later import to elasticsearch

Question

I am looking for a good way to store up to 20 terabytes of data (social media postings, twitter data, etc) in the cloud and gradually feed it into Elasticsearch (to enable faceted searching) so that it can be quickly searched. I was going to break this into 2 steps. Saving the data to storage and then indexing it (the next day or the next month). I have seen mention of Redis. Would this be appropriate? Would it be better to use AWS and S3 or Google to do this? Is there a better way to do this then using Redis? Once the data is indexed I don't need the original data anymore.

store terabytes of data and later import to elasticsearch

Answers (1)

Related Questions