-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: log duplicated ids to the column of rxnRetired #666
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Except the pair of MAR03784 and MAR03783 is not documented. We also performed this kind of documentation in the #646 and should performed in every PR with removing duplicates both for Rxns and Mets
@@ -819,7 +819,7 @@ rxns rxnKEGGID rxnBiGGID rxnEHMNID rxnHepatoNET1ID rxnREACTOMEID rxnRecon3DID rx | |||
"MAR04232" "" "" "RE0549C" "" "" "RE0549C" "MNXR103449" "HMR_4232" "RCR10536" "" 0 "" "" "HMR_4232" | |||
"MAR04233" "R03889" "" "" "r0645" "" "r0645" "MNXR95796" "HMR_4233" "RCR11396" "" 0 "" "RHEA:14469" "HMR_4233" | |||
"MAR04235" "R01938" "AMCOXO" "" "r0449" "" "AMCOXO" "MNXR95807" "HMR_4235" "RCR11397" "" 0 "" "" "HMR_4235" | |||
"MAR04243" "R02487" "GLUTCOADHm" "R02487M" "r0541" "" "GLUTCOADHm;r0541" "MNXR100293" "HMR_4243;HMR_4242" "RCR11398;RCR14356" "" 0 "" "RHEA:30847" "HMR_4243;HMR_4242" | |||
"MAR04243" "R02487" "GLUTCOADHm" "R02487M" "r0541" "" "GLUTCOADHm;r0541" "MNXR100293" "HMR_4243;HMR_4242;MAR04242" "RCR11398;RCR14356" "" 0 "" "RHEA:30847" "HMR_4243;HMR_4242" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am not sure about the reason this one appears not in the last column
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good spot! this is actually my mistake, now is corrected in 9ce2c5c
thank you for the careful inspection that really helps in avoiding mistakes!
yes, we probably can follow this practice for all pending PRs and onward. What do you think? |
From my side, this requires a bit more thinking. Let's take
|
no immediate answer, but this can definitely be figured out given the radically transparent history of data/code in Human-GEM (an investigation to the
no, the current action doesn't claim to solve everything, the aim is to move one step forward, and hopefully in the right direction.
I don't get your point (there is no new column added so far), please clarify what do you mean
agree |
To me, the purpose of the newly added column should be clear - almost self-explanatory even to a new user. This duplication of data to me signals that the purpose is not clear enough.
The proposed solution is indeed a step forward. The question is, should it end up immediately in
As a general logic, the TSV files contain a mapping between Human-GEM ids and ids in other databases/models. The newly introduced column is not following this logic, so I am questioning if it really belongs in this file. |
it should be noted that there is no new column added so far @mihai-sysbio |
Indeed! Sorry about the mistake, I relied on the online diff too much - it makes it look like the new column is called |
now figured out that the introduction of column
both columns
so far no new column has been introduced. Regarding the duplication between columns |
based on discussions on #615, from this PR (#666) and #663 onward, we started to document removed duplicate reactions and metabolites by appending their IDs to the rows of kept ones in the columns
|
The discussion about the implementation has yet to lead to a concrete description of the solution (= an issue). In my view tentative implementations have their place in feature branches, not in develop.
I see, yet I want to remember that this
There is no documentation of this practice in CONTRIBUTING.md, and we cannot expect new users to randomly stumble upon the discussion/PR. |
In my view, tentative implementation can be merged into develop if discussed (#615) and approved (#666). Of course, discussion and further change is still open as always
good point - go ahead to modify CONTRIBUTING.md please |
Sorry, I can't do that, as I don't fully understand how the different columns are meant to be used. |
Main improvements in this PR:
This PR improves documentation of deprecated reactions in #613 by adding their ids to corresponding rows in
reactions.tsv
, as a tentative implementation according to the discussion in #615Note: all the pairs listed in #607 have been recorded except
MAR00483 vs. MAR00482/MAR00449/MAR01169
, because I don't know how to document this irregular pairI hereby confirm that I have:
develop
as a target branch