-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AAE Failing hourly, with exit value {badarg,46}, [{base64,decode_binary[{base64.erl... [JIRA: RIAK-1557] #450
Comments
Hey @Boardom. You should be able to see the entire error in the crash.log file. That'd help us take a look at what's going on. If you need more info about Logs in Riak, check out http://docs.basho.com/riak/latest/community/faqs/logs/. |
Yup. I had already looked there... Only difference is that one prints out the ===ERROR REPORT==== and then the line. Any other suggestions. |
@Boardom have you tried deleting the |
We actually pushed updated schema's, which appears to have resolved the isssue... Unfortunate as there's probably a data handling issue somewhere... Thanks for the help. Feel free to close. |
@Boardom yeah... surprised that there was not more descriptive data handling/corruption errors in any of the logs, even moreso, the solr.log if the validation issues occured on the Solr side, which would be my guess here. We have an issue out to better expose these solr errors in the logs, #446. Thanks. |
I looked into it a bit a while back, and it stemmed back to something with On Wed, Feb 11, 2015 at 2:20 PM, Zeeshan Lakhani [email protected]
|
Could you please re-open this ticket. It would seem that AAE issues that I mentioned in the mailing list here: http://lists.basho.com/pipermail/riak-users_lists.basho.com/2015-February/016782.html are cause by this very bug. Every time I clear the yz_anti_entropy directory and attach to a node to run: (yz_entropy_mgr:init([])). I get the badarg,46 error in the logs. |
@stevegaron it seems like the issue may be on the data itself though, causing the AAE errors; though, as per #446, we need a better way to log the issues w/ the data. |
I have the same issue on both my prod server and my dev server. My dev server is nearly empty. I can run any debugging on my dev server, can I have AAE running manually in a shell so I can see the full error msg? |
@stevegaron I'd try the commands (and set_envs) shown in this video spot: https://www.youtube.com/watch?v=ETJqu5SmwOc#t=2290 where he's calling |
@stevegaron I'm wondering if one of your keys contains spaces, as it may be due to this, #436. If so, I'll be working on that fix next week. |
Yes I have 5000 ish keys with spaces... |
@stevegaron ok... we have something close for a fix for this already on the java side. I should be able to PR something next week. /cc @Boardom @seancribbs. Sorry for the inconvenience. |
@zeeshanlakhani I'll remove all keys with spaces in my dev cluster and let you know if AAE finishes. Will this fix make it into 2.0.5? |
@zeeshanlakhani I've started to see numbers showing up in the 'riak-admin search aae-status' ouput and I haven't seen any error so far... looks promising. Hopefully you guys can 2.0.5 with that patch in fairly soon. |
@stevegaron We are trying to solidify and verify 2.0.5 right now. We discussed this issue today in our weekly meeting and decided that, at least if the fix doesn't make it into 2.0.5, we will make available a patch to you and other affected users. We will update this issue as soon as we know whether the patch will be in 2.0.5. Thank you for working with @zeeshanlakhani on confirming the bug. |
@stevegaron The fix did not make it into 2.0.5, but @zeeshanlakhani tells me the patch is nearly ready. We'll update this again soon. |
Fellas, Just noticed the space issue wasn't raised as a 'Known Issue' in the 2.0.5 On Mon, Feb 23, 2015 at 5:12 PM, Sean Cribbs [email protected]
|
@Boardom Release Notes have been updated... and the patch is in the review stage: #459. |
PR -> #459 _[posted via JIRA by Zeeshan Lakhani]_ |
PR -> #459... S3 updated w/ yokozuna-2.jar _[posted via JIRA by Zeeshan Lakhani]_ |
Code review done, awaiting CI. _[posted via JIRA by Sean Cribbs]_ |
Updated "fix version" to 2.0.6 since Zeeshan added the related GH to the 2.0.6 release notes based on this fix landing in 2.0.6. _[posted via JIRA by Patricia Brewer]_ |
We are getting these error messages hourly, I assume when trying to rebuild the AAE tables....
Unfortunately erlang appears to cut off the error logs, so that's all I can provide currently...
Would appreciate any tips on how to proceed in terms of debugging the issue.
The text was updated successfully, but these errors were encountered: