Can co-occurance of word be calculated using R/ python/ Map reducer?

Question

I have a huge database of 180 columns and 200,000 rows. To illustrate in a better way, I have a matrix of 180 x 200000. Each matrix is a single digit number. I need to find their co-occurrence count. For example I have a data of 5 columns having values 1,2,3,4,5. I need to find the number of times (1,2),(1,3),(1,4),(1,5),(2,3),(2,4),(2,5),(3,4),(3,5),(4,5) have occurred in the database. Can you please suggest me an approach to this problem? I have an exposure to R and python. So any suggestion using those will really help. Can this also be done using AWS map reducer? Any help or pointers on those lines would also be helpful.

Can co-occurance of word be calculated using R/ python/ Map reducer?

Answers (1)

Related Questions