Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use SDMX-ML as data source #2

Open
csarven opened this issue Dec 18, 2013 · 2 comments
Open

Use SDMX-ML as data source #2

csarven opened this issue Dec 18, 2013 · 2 comments

Comments

@csarven
Copy link
Member

csarven commented Dec 18, 2013

Current mapping is TSV->RDF. SDMX-ML->RDF should be used instead. Brief rationale for the switch (from emails):

Aftab: "What is the added value of moving from TSV->RDF to SDMX-ML->RDF assuming that the resulting triples and properties remain the same?"

Sarven: "There is a more precise mapping, with less assumptions from SDMX-ML to RDF using QB, then there is via TSV. With the TSV approach, we hard-code a lot of (good) assumptions about what to do with the fields names, and cell values. Since QB is historically based on the SDMX information model, and that the vocabulary terms are available in SDMX-RDF, it is a simpler way forward. SDMX-ML is also considered to be the source format of these agencies, where they later generate other formats (e.g., TSV) - AFAIK! The resulting triples at this time are not the same.

Going forward, effort that's put into maintaining TSV->RDF is only good for Eurostat in EU-data-cloud. What we are trying to accomplish with Linked SDMX is to have a "one transformation to rule all statistical data". That's in contrast to writing a custom mapping for each CSV/TSV we encounter. At least that's the general direction."

Richard: "In the long run, I’d certainly prefer to see eurostat.linked-statistics.org use Linked SDMX, because that would move us from a one-off solution to something re-usable that has a better chance of being maintained in the long term."

@csarven
Copy link
Member Author

csarven commented Dec 18, 2013

Some expectations from Linked SDMX:

@cygri
Copy link
Member

cygri commented Dec 18, 2013

A good first step would be to adapt the scripts so that they produce RDF data using both the TSV→RDF and SDMX→RDF approaches in parallel. Then we can alert users, adapt example queries where needed, and so on, before completely switching over.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants