staticdev
staticdev

Reputation: 3060

Error loading DBPedia Turtle in Virtuoso with Docker

I downloaded DBPedia turtle-files with their extraction framework (https://github.com/dbpedia/extraction-framework).

Then I used the docker image tenforce/virtuoso:1.3.2-virtuoso7.2.2 after extracting all ttl.bz2 to a toLoad folder.

After the full load, I checked isql with select * from DB.DBA.LOAD_LIST; and got an error from one of the files:

2 2019.2.2 11:31.43 69876000 2019.2.2 11:31.44 521623000 0 NULL 37000 [Vectorized Turtle loader] SP029: TURTLE RDF loader, line 172395: syntax error

The virtuoso endpoint (http://localhost:8890/sparql) is online, but the queries don't return correct results.

Do you know what I can do to correct this error?

UPDATE: newer tenforce/virtuoso:1.3.2-virtuoso7.2.5.1 and openlink/virtuoso-opensource-7 were also tested giving the same result.

UPDATE 2: Line 172395 of that file seems ok: <http://pt.dbpedia.org/resource/Reino_de_Roma> <http://dbpedia.org/ontology/governmentType> <http://pt.dbpedia.org/resource/Monarquia> <http://pt.wikipedia.org/wiki/Reino_de_Roma?oldid=53119289&ns=0> .

Upvotes: 0

Views: 251

Answers (1)

TallTed
TallTed

Reputation: 9444

As I said in your duplicate post on github --

I would start by using tenforce/virtuoso:latest or tenforce/virtuoso:virtuoso7.2.5 or tenforce/virtuoso:1.3.2-virtuoso7.2.5, all of which should be based on Virtuoso 7.2.5.1 (from August 2018), rather than the rather elderly tenforce/virtuoso:1.3.2-virtuoso7.2.2 which is based on the similarly elderly Virtuoso 7.2.2 (from December 2015).

Upvotes: 1

Related Questions