Store object_id with links if available #57

oruebel · 2022-12-24T13:40:32Z

Fix #54

This PR updates the storage of links/references to add the following information:

object_id: Object id of the reference object. May be None in case the referenced object does not have and assigned object_id (e.g., in the case we reference a dataset with a fixed name but without and assigned data_type (or neurodata_type in the case of NWB).
source_object_id: Object id of the source Zarr file indicated by the source key. The source should always have an object_id (at least if the source file is a valid HDMF formatted file).

TODO:

Updated the ZarrReference class to add a source_object_id and object_id keys
Updated ZARRIO.__get_ref to populate the source_object_id and object_id keys
Updated the storage documentation to document the source_object_id and object_id keys and update examples
Update CHANGELOG
Update tests

tests/unit/test_io_zarr.py

codecov-commenter · 2022-12-24T15:13:03Z

Codecov Report

All modified lines are covered by tests ✅

Comparison is base (e31f6a3) 85.66% compared to head (84240fb) 85.76%.

Additional details and impacted files

@@            Coverage Diff             @@
##              dev      #57      +/-   ##
==========================================
+ Coverage   85.66%   85.76%   +0.09%     
==========================================
  Files          13       13              
  Lines        3139     3189      +50     
==========================================
+ Hits         2689     2735      +46     
- Misses        450      454       +4

Files	Coverage Δ
src/hdmf_zarr/backend.py	`90.55% <100.00%> (+0.14%)`	⬆️
src/hdmf_zarr/utils.py	`96.49% <100.00%> (+0.69%)`	⬆️
tests/unit/base_tests_zarrio.py	`98.54% <100.00%> (+0.04%)`	⬆️

... and 1 file with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

oruebel · 2023-09-28T18:49:51Z

@mavaylon1 we should check whether this PR now also works with the fixes in #120 It would be nice we could include this in the release as well if it works. Otherwise, its fine to move this PR to the next release, but would be nice to push this over the finish line.

oruebel · 2023-10-01T11:29:42Z

Aside from adding/updating unit tests to check that the values for the object_id and source_object_id fields are correct, this PR should be ready.

oruebel · 2023-10-01T12:14:46Z

adding/updating unit tests to check that the values for the object_id and source_object_id fields are correct

Done

mavaylon1 · 2023-10-01T16:57:38Z

@oruebel I wanted to as about the case when the object_id is None. You said that would be the case when the data_type/neuro_datatype is not assigned for a dataset. I was thinking of an example of what that would be. Say we have TimeSeries, which is a dataset. This has a type, but it also contains a dataset "data" that does not. This would be an example where we have "object_id" as none if that was the target correct?

oruebel · 2023-10-01T19:06:44Z

. This has a type, but it also contains a dataset "data" that does not. This would be an example where we have "object_id" as none if that was the target correct?

Correct. For TimeSeries.data the object_id would be None because it is just a dataset within a type. However, the source_object_id should always be present since the file is always represented by a Container. The object_id and source_object_id are not really being used right now, but will be useful to validate links and possibly in the future to be able to retrieve external links dynamically.

Store object_id with links if available

505fe99

oruebel mentioned this pull request Dec 24, 2022

Save object id's as part of links and references #54

Closed

oruebel added 3 commits December 24, 2022 06:14

Store source_object_id for references

a01fd35

Update CHANGELOG

c881e7d

Update tests to pass

5e70553

oruebel commented Dec 24, 2022

View reviewed changes

tests/unit/test_io_zarr.py Outdated Show resolved Hide resolved

oruebel added category: enhancement improvements of code or code behavior priority: medium non-critical problem and/or affecting only a small set of users labels Jan 6, 2023

oruebel added this to the Next Release milestone Jan 6, 2023

Merge branch 'dev' into enh/add_oid_to_link_format

cd393e4

oruebel added 3 commits October 1, 2023 04:08

Merge branch 'dev' into enh/add_oid_to_link_format

372cbf3

Update changelog

3f4d0bb

Update changelog

6a6a814

Add test to check object_id and source_object_id on references

c94f9a3

Fix flake8

aacf7d4

oruebel marked this pull request as ready for review October 1, 2023 12:23

oruebel requested a review from mavaylon1 October 1, 2023 12:23

Merge branch 'dev' into enh/add_oid_to_link_format

31521da

mavaylon1 approved these changes Oct 1, 2023

View reviewed changes

Merge branch 'dev' into enh/add_oid_to_link_format

84240fb

mavaylon1 merged commit 0be6b04 into dev Oct 1, 2023
20 checks passed

mavaylon1 deleted the enh/add_oid_to_link_format branch October 1, 2023 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store object_id with links if available #57

Store object_id with links if available #57

oruebel commented Dec 24, 2022 •

edited

Loading

codecov-commenter commented Dec 24, 2022 •

edited

Loading

oruebel commented Sep 28, 2023

oruebel commented Oct 1, 2023

oruebel commented Oct 1, 2023

mavaylon1 commented Oct 1, 2023

oruebel commented Oct 1, 2023

Store object_id with links if available #57

Store object_id with links if available #57

Conversation

oruebel commented Dec 24, 2022 • edited Loading

codecov-commenter commented Dec 24, 2022 • edited Loading

Codecov Report

oruebel commented Sep 28, 2023

oruebel commented Oct 1, 2023

oruebel commented Oct 1, 2023

mavaylon1 commented Oct 1, 2023

oruebel commented Oct 1, 2023

oruebel commented Dec 24, 2022 •

edited

Loading

codecov-commenter commented Dec 24, 2022 •

edited

Loading