-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicated locations when submitting via ODK #792
Comments
502 error is due to same Unicode error as #770; Django cannot handle character encoding and uWsgi terminates. |
Thanks @amplifi. The difference with #770 is that I have permissions to see resources in that project. However, main issue is how those locations have been duplicated. I am not able to reproduce this again, but it'd be nice to know whether this is a corner case or not, or if we should be worried about this. I tried to reproduce this again many times but I didn't see it. Maybe if you can take a quick look at the logs at that timestamp (20161005 - 184500 CEST)? (not at the 502 error, but when the locations were duplicated). Thanks! |
Ok, it happened again. These are the steps I've followed:
Link to the project: You can see two clusters of 3 locations each. Those are exactly replicas of the actual location submitted . I will keep testing this. Setting this as high-priority. @linzjax, I will let you know if I find something else... multiple_location_minus_tenure_questionnaire_0.2.xlsx Timestamp: 20161006 - 113600 CEST (approx) |
100% reproducible. I've followed exactly the same steps as before in a new project with the same results (parties and locations submitted via ODK stored 3 times in the project). In a second submission, trying again submitting one party a 2 locations, resulted in 3 parties added and 6 locations in the project. Link to the project: |
@dpalomino The similarities with #770 were only in relation to the 502 error. There were no log entries aside from those posted in #platform-errors. We're not currently tracking errors from ODK. |
So after poking this quite a bit, it looks like for whatever reason, ODK is sending multiple submissions per xform submission... I could try and come up with something to prevent this, but I don't know that it would A) be any good or B) be done by the end of this week. |
@wonderchook didn't we run into this issue during testing for the first release? |
@linzjax I think so but couldn't track it down. I believe this is a known bug with ODK that others have had issues with as well. I'm inclined to say we need to come up with something to prevent this but not by the end of the week. @dpalomino I think this should be in sprint 10, what are your thoughts? |
Hi @wonderchook, @linzjax I was trying to find the ODK bug in their github repository, but I didn't find it. Does anyone have a link to that issue or to some place where we can check the steps-to-reproduce? I think it'd be important to know how this can be reproduced. If it is something to do with timeouts (because of several resources attached for instance), or for repeat groups, or a combination of both... I will try to do more testing today and try to provide more feedback later on today. |
Hi, After some more testing we've found out that:
My guess (only my guess) is that there would be some timeout expiring that makes the app to resend the form. In any case I think this is important, taking 4 pictures or more when collecting data is not unfrequent. You can see more details about the testing done here. Anyone has an idea about what could be the root cause of this? And how to work around it? |
It seems that the root cause for this is having attached resources over 10MB all together. Then the submission is automatically split (see the discussion in this thread). Adding an instanceId in the form like suggested here should assist to de-duplicate the submissions on the platform side. Thoughts? Do you think it'd be feasible to implement this "de-duplication" process when receiving and processing the forms? @linzjax @bjohare |
If we're going to allow submissions over 10MB to be split, we should seriously consider adding a per-file size limit for resources. There's real potential for project resources to exceed what the platform can display back to the user, particularly when slow/rural connection speeds are at play. It could prevent page load altogether. |
Perhaps we should have limits for mobile submissions specifically? I can think of bigger files being necessary in some situations, so I don't think we should disable this completely. |
@dpalomino we could add the |
That would be great @bjohare. Do you think this could be something reasonable for Sprint 10? Meanwhile we would advise partners not to include many resources. @amplifi @wonderchook , I think we can ask partners to use low resolution photos etc. I think including a file size limit could bring some confusion to users (as they won't probably know why it's failing and they might blame the platform). IMO I would wait to have a bigger scale to setup these limits... what do you think? |
@dpalomino I don't think we can require low resolution photos. I think we can suggest it, but there are going to be situations where high resolution is required. |
@dpalomino yes, should be ok for Sprint 10 |
At least for resources, this should not be a problem for the platform at all. Any links to resource files go directly to S3. The only possible problem for the platform are large images since we generate thumbnails for them so it's possible that uploading a gigantic image may result into a timeout. Other files types should be no problem. |
Yes @wonderchook. I meant to recommend to use at least not high-resolution images for the most typical cases (take a picture of a deed, of the landowner, etc). Not to make this mandatory of course, but recommend to configure their smartphone for not using the highest resolution possible. |
perhaps @bethschechter can add a section to the documentation regarding smartphone settings for this |
Assigning tentatively for Sprint 10 (thanks @bjohare!) . |
Steps to reproduce the error
STR are not clear at all. This has happened once when testing the repeat group feature, although we think it is totally unrelated with this.
We were submitting a form with:
However these locations where actually submitted and indeed appeared duplicated several times.
Interestingly when trying to access to the resources section, it returns a 502 error.
Link to the project:
https://platform-staging.cadasta.org/organizations/david-org-second-org/projects/testing-repeats-multiple-locations-minus-tenure/resources/
Approx timestamp of the issue: 20161005 - 184500 CEST
Don't know if this is a corner case or not if it is something more serious. Has anyone seen something like this?
The text was updated successfully, but these errors were encountered: