Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Errors of importing fitting data #1

Open
lilihub opened this issue Apr 3, 2022 · 5 comments
Open

Errors of importing fitting data #1

lilihub opened this issue Apr 3, 2022 · 5 comments

Comments

@lilihub
Copy link

lilihub commented Apr 3, 2022

Hi Crgagne,

This is Lili, a PhD student from Dublin City University. I'm really interested in the paper you published on eLief and am so impressed by the modeling analysis you made.

It was said in the left plot of figure 3 that the group mean learning rates are in logit space. I'm confused because I thought if it's in logit, the values should all be in the range 0 to 1. So I am trying to replicate your plots in figure 3. However, it gave me errors when I was trying to import the pickle files of the fitting results (please see the following error codes in the Figures_3-4_Behavioral_Model_Exp1_Results.ipynb file and the screenshot of the error information). I have followed the tutorial created the conda environment and installed the related packages. Could not figure out what was wrong on my side. Could you please help with this? Or at least could you please explain a bit the meaning of the logit space? I'm doing the modeling using Rstan apart from replicating your results with your pymc3 codes, so wanna make sure the Rstan models are correct by comparing the posterior distribution reported in your paper.

model_name = 'model=11_covariate=Bi3itemCDM_date=2020_4_27_samples=2000_seed=3_exp=1.pkl'
with open('../fitting_behavioral_model/model_fits/'+model_name, "rb" ) as buff:
model_output = pickle.load(buff)

image
image

@crgagne
Copy link
Owner

crgagne commented Apr 4, 2022

Hi Lili,

Thanks for reaching out and for your interest in our work. To answer the question about the logits in Figure 3.

  • I might be using the non-standard terminology here. To me, since the logit transformation is log(p / 1-p) and maps learning rates from [0,1] to the reals R, the logit space meant that the values were in R (i.e., unconstrained). To go from R to [0,1], I used inverse-logit (or logistic) transform, so that might be called inverse-logit space. So in Figure 3, I'm plotting the values before I use the inverse-logit transform, so they'll be in R and not constrained to be between 0 and 1. Happy to clarify if that doesn't make sense.
  • I'm stumped by what might be occurring in the import error. It might be an issue with the way theano was compiled. Have you looked into issues like (I'm getting an ImportError in Theano in Anaconda Theano/Theano#6532). I'll think about this a bit more and see if I can come up with anything. Sorry about this issue.

Sincerely,
Chris

@lilihub
Copy link
Author

lilihub commented Apr 4, 2022

Hi Chris,

Sorry for taking your first name wrong :) and thank you so much for your reply! I really appreciate your time explaining this.

Does it mean you used the posterior samples of the baseline and other learning rates directly without doing any transformation to plot them in Figure 3?
In the modeling files, e.g. the following sentence in models_1.py under model_code folder, lr_t is the learning rate which was mapped to (0,1) using inverse-logit transform, but lr_baseline, lr_goodbas, etc were not converted to logit range, so they are actually in the R space. You took the posterior sample of these parameters directly and plotted them in Figure 3. Am I understanding it correctly? This question is important to me because I am trying to replicate your models in Rstan and testing them on a smaller sample. The range of the posterior distribution I obtained for the baseline learning rate is much larger. But in your results, all learning rates components are perfectly constrained in (0,1) or (-1,0), which makes me doubt the correctness of my implementation.
lr_t = lr_baseline +
outcome_valence_tlr_goodbad +
stabvol_t
lr_stabvol +
rewpain_tlr_rewpain +
outcome_valence_t
stabvol_tlr_goodbad_stabvol +
outcome_valence_t
rewpain_tlr_rewpain_goodbad +
stabvol_t
rewpain_tlr_rewpain_stabvol +
outcome_valence_t
stabvol_trewpain_tlr_rewpain_goodbad_stabvol
lr_t = pm.invlogit(lr_t)

Another question about model 1 is that it was mentioned in the paper that there were only three parameters in model 1 but in model 1.py, there are four extra parameters lr_c_t, eps_t, Amix_t, Bc_t and I could not find the risk parameter.

Thanks for the shared link. I will check it and see if I can figure the error out.

Very sorry that I have so many questions and bothering you again.

Best,
Lili

@lilihub
Copy link
Author

lilihub commented Apr 4, 2022

Hi Chris,

These are the group-level posterior distributions (without doing any transformation on the posterior samples) of the learning rate components I obtained for a much smaller sample. As you see, their ranges are much larger, so that's why I'm wondering if you did any transformation (didn't see any transformation in the plotting code though).
image

Thanks again,
Lili

@crgagne
Copy link
Owner

crgagne commented Apr 5, 2022

Hi Lili,

No problem at all for asking questions. I'm happy that you are interested in the models/code. In looking into your question about model_1.py, it looks like the file was misnamed in the version of the repo that you have. My sincerest apologies for that -- when I uploaded the repo to github for sharing, I changed the file names to be more intuitive and accidentally swapped the names for models_1.py and models2thr9_11.py. I adjusted this and re-pushed the repo, so the now current version is correct.

I'm running the model on my computer now and it seems to be running without issue. What's your email? I'll send you the environment.yml in case you want to try installing the conda environment that way. It won't upload here for some reason.

For fIgure 3a, I'm not doing any transformations. For figure 3b, I add the samples for the components up, using +1 -1 depending on the condition, with the formula:
lr_t = lr_baseline +
outcome_valence_tlr_goodbad +
stabvol_tlr_stabvol +
rewpain_tlr_rewpain + ....

And then inverse-logit transform.

There's a few reasons that you might be seeing larger ranges of values than for you model. One might be because we also have separate components (e.g. lr_baseline, etc.) that get multiplied by the factor scores, so if you don't have these, then the basic components might have a larger range. The range is also probably highly depend on the number of participants -- with more participants, the range tends to be smaller for these group level parameters, because there is more evidence for the mean across participants (e.g. for lr_baseline).

@lilihub
Copy link
Author

lilihub commented Apr 5, 2022

Hi Chris,

Thank you so much for your quick reply. It will be great if you can share the environment file. Thanks for updating the repo!
my email: [email protected]

"lr_baseline multiplied by the factor score" refers to the following equation, right?
image

I do have sth similar to this in my model. Here A_base_cov corresponds to the weight parameters (beta_g, beta_d, etc) and cov corresponds to the clinical factors (not the same factors as in your paper, they are the clinical scores for the Parkinson subjects in our study instead. mu_pr corresponds to the mu in your equation.

image

These clinical factors are standardized before fitting to the model). I do agree with the second possible reason. I only have 20 participants tested on this model, which is much smaller than that of yours. I'm so happy to confirm with you about this. At least I got a bit more confidence about my implementation:).

Have a nice day!

Best,
Lili

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants