How to Optimally Group Vectors into Easily Described Groups?

Question

I have a set of vectors of length 4 (represented as a Nx4 matrix) where each element in the vector can take on the value -1, 0 or 1. I want to group the vectors into the smallest number of groups (and therefor the largest groups) such that each group satisfies the following constraint: There must be a vector in the group for each combination of unique elements represented in each column of the vectors in the group. For example, a group containing only the vectors [-1,1,0,1] and [-1,1,1,1] would satisfy the constraint because there is one unique value in each column of the group except the 3rd, which has 2, thus there are 2 possible combinations of the unique values in each column, both of which are represented in the group. However, grouping [1,0,-1,0] and [1,0,0,1] does not satisfy the constraint because there are 2 unique values in each of the 3rd and 4th columns, creating 4 possible combinations, only 2 of which are represented in the group. Adding [1,0,0,0] and [1,0,-1,1] to this group would satisfy the constraint. (Note that any single vector as a group on its own will always satisfy the constraint.)

These groups are "easily described" because you can just list the unique values for each column and that will fully describe the group, to the exclusion of all other vectors.

My first approach was to take the set as a whole, and first check if it already satisfies the constraint. If not, try leaving out one vector at a time and check if the remaining vectors satisfy the constraint. If none of those work then try leaving out all combinations of 2 vectors, then 3, and so on. Each time a particular subset satisfies the constraint, set those vectors aside and repeat the process on the remaining ones until there are none left. While this guarantees an optimal grouping (as far as I can tell), the run time is way too long for any set with more than ~25-30 vectors because you have to potentially check N choose k possible ways of leaving out a subset of vectors for all values of k from 1 to N-1.

I recently realized that you can think of this as more of a geometry problem if you imagine the space of possible vectors as a 3 by 3 by 3 by 3 hypercube where each unit hypercube represents a single vector. Groups that satisfy the constraint are hyper-rectangles (including wrapping around from -1 to 1) in this space which is potentially easier to think about than the original phrasing of the constraint. In this framing of the problem, I am looking for the minimum number of hyper-rectangles such that all vectors are contained in the hyper-rectangles, and no empty spaces exist in any hyper-rectangle. This approach has the promise of not exploding the run time combinatorially, but I haven't been able to come up with a good way of searching through the possible hyper-rectangles.

Does anyone have ideas for a faster algorithm to solve this problem?

How to Optimally Group Vectors into Easily Described Groups?

Answers (1)

Related Questions