-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove xsd:duration datatype from the mappings #145
Comments
Hi @Lars-H!
You're right. The problem here is that real data (the actual GTFS feed) that comes from Madrid Metro provides correctly the URL but the data generator does not support this kind of data. That should be already fixed by VIG generator, with our configuration. We will see what is happening.
Yes! I thought I removed all the datatypes duration, as again, VIG generator does not support them. I'll clean and fix the mappings. Please use the ones from this official GitHub repo (not the ones from kgc-eval which could be not up to date)
This is again a problem with the generator that we rely on. In any case, I'll try to take a look at their code to see if it can be solved (my suspicion here is that they may have the random generator not working very randomly). In any case, there would be nice work to be done on improving the data generator of the benchmark using SHACL constraints |
Hi @dachafra, thanks for the quick reply and clarification 🙂 I'll try using the up-to-date mappings from this repo. Best regards |
Hi @Lars-H, |
Sure, I can do that. I'll make sure to re-run the process with the updated mappings and see which issues remain. Which mappings file should I use to materialize the RDF from a MySQL DB using |
It should be automatically output from the docker I guess. If not, you can use R2RML and Morph-KGC or Ontop instead of the rdfizer https://github.com/oeg-upm/gtfs-bench/blob/master/mappings/gtfs-rdb.r2rml.ttl |
Ok, that worked. The only issue I am seeing now is the mentioned |
no worries, I'll reopen this issue and just change the name |
Describe the bug
Thanks for providing these insightful resources. I have been using them lately and I have encountered some minor issues.
I tried to follow your description from your journal paper to materialize the KG as RDF. I have seen a couple of problems.
rdfizer
leads to non-absolute IRIs in the RDF.arrivalTime
anddepartureTime
is specified asxsd:duration
while the values are not valid durations (under D-entailment).ShapePoints
with the exact same latitude and longitude. (Also, there are only 960 distinct values for latitude and 1000 distinct values for longitude)To Reproduce
sql
directory into a MySQL DB using the provided scriptrdfizer
and the mapping file provided in thekgc-eval
repo. (Seerdfizer
config below)rapper
Expected behavior
The materialized RDF should be valid.
Screenshots or Video
Example of a non-absolute IRI:
Example of an invalid duration value:
Repeated
ShapePoint
geo-location. The following query yields?cnt = 5852988
.Resources (please complete the following information):
Additional material/context
rdfizer config:
Thanks for your support.
The text was updated successfully, but these errors were encountered: