Reputation: 135
We have a set of sequences with taxi positions. We want to cluster the data by considering the sequential patterns in the data lines. For example: T1, T2, T3, T4 be the travels and a,b,c,d,e be set of places. The data we have is like,
But the problem is the length of the data are not variable. How can we cluster these type of data using EM. Since it does not accept variable length data is there way we can customize it.
Upvotes: -1
Views: 399
Reputation: 77454
EM is a general principle. You can use it with very different models.
Probably the most popular model for EM is Gaussian Mixture Modeling, GMM.
Naturally, if you use covariances, GMM requires a fixed dimensionality.
But if you use other models, there is no reason it cannot work with variable length vectors. For example, there are EM variants that process text data, and text usually does have different length.
Upvotes: 1