Neo4j Query Optimization for Cartesian Product

Question

I am trying to implement a user-journey analytics solution. Simply analyze on which screens, which users leave the application. For this , I have modeled the data like this: I modeled single activity since I want to index some attributes. Relation attributes can not be indexed in Neo4j.

With this model, I am trying to write a query that follows three successive event types with below query:

MATCH (eventType1:EventType {eventName:'viewStart-home'})<--(event:EventNode)
<--(eventType2:EventType{eventName:'viewStart-payment'})  

WITH distinct event.deviceId as eUsers, event.clientCreationDate as eDate

MATCH((eventType2)<--(event2:EventNode)
<--(eventType3:EventType{eventName:'viewStart-screen1'}))

WITH distinct event2.deviceId as e2Users, event2.clientCreationDate as e2Date
RETURN e2Users limit 200000

And the execution plan is below:

I could not figure the reason of this process out. Can you help me?

Neo4j Query Optimization for Cartesian Product

Answers (1)

Related Questions