Neo4j Load CSV Import Stalling

Question

After playing around with toy datasets, this was my first attempt to use data that is relevant for a project at work. In short, after limping to get nearly all of my data into Neo4j, my last query simply stalled. See the screenshot.

enter image description here

Note: I was prototyping my queries by pasting them into the browser tool, but my longer term plan was to keep all of the commands in a .cql file that I could script on my workstation in order to perform nightly analyses.

To add context to my problem, I am prototyping on my macbook.

8gb ram
2.2 ghz intel core i7
osx 10.9.5
2.2.0 community

The files I am processing (rows/columns). I am not importing every column, it was just easier to keep my current datasets in check.

Ability.csv = 3/1
brm.csv = 276992/34
cont.sv = 80093/17
email chain.csv = 199143/34 (this is the only data I can't get in)
email first last.csv = 77849/20
recs.csv = 77962/20
templates_topics.csv = 29/3
templates.csv = 49/4
topics.csv = 13/1
vendors = 5/1

The only config options that I set manually for neo4j were in neo4j-wrapper.conf where I set wrapper.java.initmemory and wrapper.java.maxmemory to 4096. I did this after poking around to find similar problems.

I made these changes out of the gate because within the browser, I was getting error messages that the database was disconnected while processing my queries.

Lastly, because my data are work-related, I can't provide test data. I can, however, link to my cypher queries.

Constraint and LOAD CSV .cql file

Any help and advice would be greatly appreciated. I am pretty confident this is user error on my end, but I have definitely hit the road with respect to what my next steps would be.

ErnestoE · Accepted Answer

Avoid eager loading in LOAD CSV. It doesn't respect PERIODIC COMMIT. See this article by Mark Needham for a thorough explanation.

Neo4j Load CSV Import Stalling

Answers (2)

Related Questions