Limiting nodes per label

Question

I have a graph with currently around the several thousand nodes with each node having between two to ten relationships. If we look at a single node and its connections, they would look like somewhat this: Node and relating nodes

The nodes with alphabetical characters are category nodes. All other nodes are content nodes that have an associated with relationship with these category nodes and their colour denotes which label(s) is/are attached to it. For simplicity, every node has a single label, and each node is only connected to a single other node:

Blue: Categories
Green: Scientific Publications
Orange: General Articles
Purple: Blog Posts

Now, the simplest thing I'm trying to do is getting a certain amount of related content nodes to a given node. The following returns all twenty related nodes:

START n = node(1)
MATCH (n)-->(category)<--(m)
RETURN m

However, I would like to filter this to 2 nodes per label per category (and afterwards play with ordering by nodes that have multiple categories overlapping with the starting node.

Currently I'm doing this by getting the results from the above query, and then manually looping through the results, but this feels like redundant work to me.

Is there a way to do this via Neo4j's Cipher Query language?

cybersam · Accepted Answer

This answer extends @Stefan's original answer to return the result for all the categories, not just one of them.

START p = node(1)
MATCH (p)-->(category)<--(m)
WITH category, labels(m) as label, collect(m)[0..2] as nodes 
UNWIND label as lbl
UNWIND nodes AS n
RETURN category, lbl, n

To facilitate manual verification of the results, you can also add this line to the end, to sort the results. (This sorting should probably not be in your final code, unless you really need sorted results and are willing expend the extra computing time):

ORDER BY id(category), lbl

Limiting nodes per label

Answers (2)

Related Questions