-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about how to interpret rho and FDR #17
Comments
Hey Taylor! Thanks for your interest in propr, and for asking such detailed questions! I'll try to answer them quickly now, but I also invite follow-up questions. (1)
There is some analogy to draw between the two, and indeed higher values indicate stronger associations. The strength of the associations are defined with respect to the CLR transformation. When rho=1 for a pair i,j -- a plot of OTU i vs. OTU j will give a perfect line where the slope is 1 for OTUs expressed in CLR coordinates. (It can be helpful to think of the CLR as a kind of normalization that scales abundances according to the average for a sample. From this perspective, propr is sort of like a correlation between normalized variables.). (2)
I would read the table something like this -- if the cutoff were 0.65, the FDR, empirically speaking, would be 0.0001. So you can select it very safely! On the other hand, if the cutoff were 0.35, the FDR would be 20%. If you wanted an FDR of, say 5%, the 0.65 cutoff is too conservative. In this case, I would re-run
This will try additional cutoffs from 0.35 to 0.65. One of them may have your target FDR. (3)
Great! I am glad it is useful! FYI we have written a pre-print about the methodological justification for "double CLR" approach here -- https://www.biorxiv.org/content/10.1101/847475v1 ``but I'm curious if I can calculate proportionality between environmental variables as well...we also took measurements like pH, nitrogen, latitude, longitude, etc., and I would like to know if any ASVs/transcripts are proportional to these variables.'' Proportionality is useful for finding associations between components belonging to a composition. Variables like nitrogen, latitude, longitude, etc. are not components, but rather real measurements. In this case, I would CLR transform my OTUs manually, and fit a linear model per usual. The pseudo-code might look something like this:
Or you could popular multivariable ecology methods like RDA from vegan package:
|
Thank you! these explanations were extremely helpful. |
Hi @tpq, thank you for the wonderful CoDa methods you have implemented in propr! I read Microbiome Datasets Are Compositional: And This Is Not Optional at the beginning of my PhD and am excited to be using the methods now.
I've read your the following posts/papers, but I still have some questions about how to interpret and use propr.
Is there an analogy to draw between
propr
and e.g. R^2 from spearman correlation, whereabs(rho@result$propr)
indicates higher proportionality? I know the math is completely different, but I have no intuition for the meaning ofpropr
orlrv
. (I've seen you state that you're skeptical of negative rho, so I'm assuming these should just be ignored.)I also don't understand how to interpret FDR from
updateCutoffs(rho)
. Currently I'm arbitrarily usingrho95 <- rho[">", .95]
because I'm not sure how to interpret this table. I think this is telling me I could use0.65
given that the FDR falls below.05
at this value...but I wanted to check.Lastly, I've seen your implementation for multi-omics data sets here and have it working. I have 16s, ITS, and metatranscriptomic data from the same N samples. I understand how to calculate proportionality between these datasets using your implementation, but I'm curious if I can calculate proportionality between environmental variables as well...we also took measurements like pH, nitrogen, latitude, longitude, etc., and I would like to know if any ASVs/transcripts are proportional to these variables.
The text was updated successfully, but these errors were encountered: