stackmagic
stackmagic

Reputation: 164

Loading (parts of) a Freebase dump into ArangoDb

My web searches didn't turn up anything useful and maybe noone has done this yet. While I have done some processing freebase dumps and working with rdf and arangodb, my experience is still very limited and I'd like to hear opinions/suggestions/experiences on the topic.

A few things I'm wondering about:

Some of the challenges I'd be expecting are:

Update

Currently, I go trough the dump several times. The steps are roughly as follows:

It works but it's slow and strikes me as inefficient to go trough the dump this many times. And there will be more passes trough the dump, during/after (a) we are discovering many more entities that are related to the core entities I'm interested in.

And making millions of requests to the freebase api won't be much better either.

So that's a bit of background on why I'm interested in this topic and if there were a pre-made solution for that would be nice.

Upvotes: 5

Views: 226

Answers (1)

moonglum
moonglum

Reputation: 786

A similar thing has been done with data from Wikipedia in this project. I'm not aware of a Freebase dump though, but it should be very similar to a Wikipedia dump, shouldn't it? The steps you would need to do are the following:

  1. Convert the data from Freebase into JSON format in a form that you would like to store them in your ArangoDB instance.
  2. Use arangoimp to do the import.

Upvotes: 3

Related Questions