Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

“DNA Sample ID” should be unique #208

Open
Tracked by #587
pvangay opened this issue Feb 9, 2022 · 2 comments
Open
Tracked by #587

“DNA Sample ID” should be unique #208

pvangay opened this issue Feb 9, 2022 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@pvangay
Copy link

pvangay commented Feb 9, 2022

These values are pre-determined by JGI, but having validation for uniqueness can help in case the user accidentally overwrites the values.

@pvangay pvangay added the enhancement New feature or request label Feb 9, 2022
@turbomam
Copy link
Member

I guess I should have said that I had anticipated this. Presumably it will apply to RNA Sample ID as well.

DH takes its uniqueness cues from the LinkML identifier attribute on each slot that should become a DH column. Unfortunately, the DH columns represent fields/slots on one single class, and each class can only have identifier slot.

@cmungall It looks like there are some additional constructs in LinkML for communicating unique values required. I'll have to familiarize myself with them.

We could also come up with a new string serialization (SNTC pattern) for cases like this like {unique text}

@sujaypatil96 Either of those techniques will require changes to SNTC (which I consider acceptable up until days before GSP) and or changes to linkml2dataharmonizer.py, arround here (which I was hoping to be more conservative about)

@pkalita-lbl
Copy link
Collaborator

We can use the unique_keys construct in LinkML for this. DataHarmonizer understands that construct now in its validation code.

@pkalita-lbl pkalita-lbl transferred this issue from microbiomedata/DataHarmonizer Jul 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants