Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BE] Test for broken links in harvested assets #42

Open
ioggstream opened this issue Aug 11, 2022 · 1 comment
Open

[BE] Test for broken links in harvested assets #42

ioggstream opened this issue Aug 11, 2022 · 1 comment

Comments

@ioggstream
Copy link
Contributor

ioggstream commented Aug 11, 2022

I expect

Periodically test that accessURLs are valid on schema.gov.it/sparql

prefix dcat: <http://www.w3.org/ns/dcat#> 

select distinct ?url where {
_:a1 dcat:accessURL ?url

} 

Current behavior

no tests

Possible solution

add a gh action running a sparql query and testing all the retrieved URLs

@Clou-dia Clou-dia changed the title Test for broken links in harvested assets [BE] Test for broken links in harvested assets Nov 2, 2023
@ioggstream
Copy link
Contributor Author

This is part of a set of basic tests that should be implemented
to ensure the correct working of the platform.

For example, running this query right now, I found this URL

https://schema.gov.it/lodview/controlled-vocabulary/classifications-for-organizations/ateco-2007WoD
-> https://unece-stardog.ichec.ie/Ateco2007Classification#!/query/

That is referencing a nxdomain.

Probably, accessURL domains should be declared/whitelisted in some way (e.g., to inhibit referencing potentially malicious resources).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 🆕 New
Development

No branches or pull requests

2 participants