Skip to content

Conversation

@pvgenuchten
Copy link
Contributor

@pvgenuchten pvgenuchten commented Oct 28, 2025

This PR suggests the introduction of rdflib to parse dcat records in either json-ld, xml or turtle serialisation.

  • if json and no context -> add a dcat-ap context
  • let rdflib detect derialisation and parse document to graph
  • parse the graph using element matching or sparql
  • parse wkt-bbox using shapely

We can later extend this work to:

  • add a serialisation parameter to the write method, to write out the knowledge graph in either json-ld, rdf+xml, or turtle (when converting from mcf to dcat
  • use rdflib when importing schema.org (in various serialisations)
  • parse Dublin Core records using this plugin

The PR introduces rdflib and shapely dependency, can also run conditionally, if import is available

@pvgenuchten pvgenuchten marked this pull request as draft October 28, 2025 14:32
@pvgenuchten pvgenuchten marked this pull request as ready for review November 14, 2025 13:04
@pvgenuchten pvgenuchten force-pushed the dcat-import_support branch 2 times, most recently from 6b3ba32 to 13d5d22 Compare November 14, 2025 21:03
@pvgenuchten pvgenuchten changed the title initial commit on import dcat records import functionality for dcat records, using rdflib Nov 27, 2025
draft - test still failing
introduces rdflib dependency, can also conditinally import
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant