-
Notifications
You must be signed in to change notification settings - Fork 189
[ENH][BEP028] Specification update for BEP028 BIDS-Prov #2099
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
Include comment for ProvEntity versus Entity
|
Hi @rwblair, As discussed earlier, here are the issues I'm currently facing:
Thanks for your help :) |
for more information, see https://pre-commit.ci
|
@bclenet While thinking about the first two points on how to validate certain parts of the files I started to realize the nature of the changes that will be required of the schema to validate them. Here is my understanding of the main rules that this bep wants to enforce that require information outside of what has historically been used for validation, and the issues they raise.:
The following are all similar:
One thing I like about this proposal is that each json file is simple enough to be immediately understood by a human. I was playing around with alternative ways of organizing data from the examples that might be more amenable to the current expression language and they were all much more difficult to read at a glance. The UID in the Ids makes me think this was not meant to be produced or consumed by humans, but I'm a sucker for looking at any json file that comes across my path. Please let me know if I have misunderstood/misinterpreted any of the rules from the BEP. @effigies Any comments on my characterizations of the schema's short comings with respect to the above rules? @bclenet This only sort of addressed your first two issues, for the remaining four:
|
|
|
||
| The Resource Description Framework (RDF) is a method to describe and exchange graph data. | ||
|
|
||
| The terms defined in this part of the BIDS specification are based on the [W3C Prov](https://www.w3.org/TR/2013/REC-prov-o-20130430/) standard. Their relations with W3C Prov terms are defined in the [`provenance-context.json`]() file. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
provenance-context.json isn't mentioned any where else on the page. May want to clarify that it exists as json file in the specification itself like metaschema.json.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, we need to work on this section to be explicit about the context file. Thanks !
| Further datasets are available from | ||
| the [BIDS examples repository](https://bids-website.readthedocs.io/en/latest/datasets/examples.html#provenance). | ||
|
|
||
| ## Overview |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd remove this heading and move the sentences under it to the above section (#provenance), I think the opening sentences of a page are generally understood to be an overview.
|
|
||
| This description is based on the [W3C Prov](https://www.w3.org/TR/2013/REC-prov-o-20130430/) standard. | ||
|
|
||
| ### General principles |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These principles could be moved into the top level section, other sections of the standard (behavioral, phenotypic, etc) use requirement keywords in their opening salvos. I'd also remove the new lines between the sentences.
Co-authored-by: Ross Blair <[email protected]>
Co-authored-by: Ross Blair <[email protected]>
Co-authored-by: Ross Blair <[email protected]>
|
Hi @rwblair, Most of the rules you propose are related to the validation of the provenance metadata itself. I believe that these could be brought later to the validator / schema ? I agree that this is easier to validate once the metadata is aggregated; I understood you found the python code to do that. I also plan to write python code for defects detection in a provenance graph (e.g.: isolated nodes, undefined objects, etc.) which could later help for validation. About Ids, we need to be more precise in the specification about that. The idea is that we want to have BIDS URIs as much as possible, also because it's more human readable. But at the same time, we want to ensure uniqueness of the Id outside the dataset, which is not the case for BIDS URIs as currently defined. Does it make sense to you ? |
This is a work in progress PR proposing a specification update for BEP028 BIDS-Prov.
- [ ] being proofread
- [ ] validator error :
/prov/*NOT_INCLUDED- [ ] validator error :
/prov/*.jsonSIDECAR_WITHOUT_DATAFILE- [ ] validator error : derivative files are listed as NOT_INCLUDED / ALL_FILENAME_RULES_HAVE_ISSUES /FILENAME_MISMATCH / ENTITY_WITH_NO_LABEL