Reputation: 31
it's a week i'm trying to find an answer for my question , i would appreciate if anyone can help . I've got a list of strings(originally list of sequences which can be viewed as list of strings) and i'd like to find a pattern (which is a string itself) withtin strings of this list , is there any java library which can i use or is there any tool (like weka , which doesn't do this!) which can help me ??
Upvotes: 0
Views: 1323
Reputation: 3520
If you want to find patterns frequently occuring in a set of sequence, then you could try "sequential pattern mining" or "sequential rule mining algorithms".
There are several implementations of these algorithms in my SPMF Java open-source data mining library.
Upvotes: 0
Reputation: 6376
Sounds like you want to find the longest common subsequence of those strings. This is a well known algorithmic problem that is commonly solved using dynamic programming. See here for various implementations in multiple languages.
Upvotes: 2