How to topologically sort this data struct

Question

I'm playing around with figuring out some dependency stuff, and I got far, but now I'm stuck.

Let's say I have a data struct like this:

map> deps;

where each key in the map is a dependee node, and the value at that key is a list of nodes on which the dependee depends.

Further, lets say the map has 4 keys (A, B, C, and D) with the following dependency entries:

I'm looking for an algorithm (some topological sort?) which will yield a vector of strings such that the strings appear in this order:

F, B, C, D, A
0  1  2  3  4

This list represents the order in which the dependencies should be evaluated.

Galik · Accepted Answer

I recently came up with a solution for this based on this algorithm:

This is a slightly modified version for your data structure:

#include 
#include 
#include 
#include 
#include 

/**
 * Performs dependency resolution using
 * a topological sort
 */
template
class Resolver
{
public:
    using value_type = ValueType;
    using value_vec = std::vector;
    using value_map = std::map;

private:
    value_vec seen;
    value_map deps;

    void resolve(value_type const& d, value_vec& sorted)
    {
        seen.push_back(d);
        for(auto const& nd: deps[d])
        {
            if(std::find(sorted.begin(), sorted.end(), nd) != sorted.end())
                continue;
            else if(std::find(seen.begin(), seen.end(), nd) == seen.end())
                resolve(nd, sorted);
            else
            {
                std::cerr << "Circular from " << d << " to " << nd << '
';
                continue;
            }
        }
        sorted.push_back(d);
    }

public:

    /**
     * Clear the resolver ready for new
     * set of dependencies.
     */
    void clear()
    {
        seen.clear();
        deps.clear();
    }

    /**
     * Items that don't depend on anything
     */
    void add(value_type const& a)
    {
        deps[a];
    }

    /**
     * Item a depends on item b
     */
    void add(value_type const& a, value_type const& b)
    {
        deps[a].push_back(b);
    }

    value_vec resolve()
    {
        value_vec sorted;
        for(auto const& d: deps)
            if(std::find(sorted.begin(), sorted.end(), d.first) == sorted.end())
                resolve(d.first, sorted);
        return sorted;
    }
};

int main()
{
    Resolver resolver;

    resolver.add("A", "B");
    resolver.add("A", "C");
    resolver.add("A", "D");

    resolver.add("B", "F");

    resolver.add("C", "B");
    resolver.add("C", "F");

    resolver.add("D", "C");

    resolver.add("F");

    for(auto const& d: resolver.resolve())
        std::cout << d << '
';
}

Output:

F
B
C
D
A

Please let me know if you find any bugs (not very well tested yet).

Added from the comments:

For efficiency, in production code, if the node type (string, in this example) can be imbued with a flag to mark the node as seen/sorted, then the calls to std::find can be replaced with setting the seen/sorted values for flag. Of course, in this example, Galik couldn't do that, which is why std::find is used, instead. - @Dess

How to topologically sort this data struct

Answers (2)

Related Questions