Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to disable or redirect log messages #11

Open
jakirkham opened this issue Apr 17, 2018 · 10 comments
Open

Option to disable or redirect log messages #11

jakirkham opened this issue Apr 17, 2018 · 10 comments
Labels
question A question for IBM developers and LSF users

Comments

@jakirkham
Copy link

Am seeing logging messages sent to terminal much like those described in this mailing list post (snippet of one example below). Is there some option to turn them off or redirect them to a log file at run time? Thanks.

Job <38534717> is submitted to default queue <normal>.
E #7a96 [     0.00]  * fsd_exc_new(1006,Vector have no more elements.,0)
@adamsla
Copy link
Contributor

adamsla commented May 7, 2018

@jakirkham,

Please see the following IBM KB article. You can also find other references by searching Google using the following search string: "BSUB_QUIET site:ibm.com", for our environment variable reference.

Small KB article:
http://www-01.ibm.com/support/docview.wss?uid=isg3T1015935

Environment variable reference:
https://www.ibm.com/support/knowledgecenter/en/SSWRJV_10.1.0/lsf_config_ref/lsf_envars_ref.html

@adamsla adamsla added the question A question for IBM developers and LSF users label May 7, 2018
@jakirkham
Copy link
Author

Thanks @adamsla.

So that does get rid of the Job <38534717> is submitted to default queue <normal>. log messages. Though it does not get rid of the E #7a96 [ 0.00] * fsd_exc_new(1006,Vector have no more elements.,0) log messages. Do you have any idea as to how to get rid of this second set of log messages?

@adamsla
Copy link
Contributor

adamsla commented May 7, 2018

You can redirect job output either to a file or /dev/null depending on what you want with that data. I suspect this is standard error. So, a few options:

-e /dev/null
-e /some/path/%J.e

Choose your poison.

@adamsla
Copy link
Contributor

adamsla commented May 7, 2018

Also, check the user manual. That might already be supported as a standard drmaa option.

@adamsla
Copy link
Contributor

adamsla commented May 7, 2018

LSF supports a variety of replacement variables:

%J - Job Id
%I - Job Index (for array jobs)
%H - Job Execution Host
%U - Job Execution User

You can use the variables programmatically in for you standard output (-o), standard error (-e), and local working directory (-cwd)

@jakirkham
Copy link
Author

So we are already dumping stdout and stderr to log files with DRMAA, but we are still seeing this logged outside of those files.

@adamsla
Copy link
Contributor

adamsla commented May 11, 2018

What does the deck look like? This is an end of list exception. So, it's a drmaa internal error that leads me to believe their is something wrong with the way your have submitted the workload.

@jakirkham
Copy link
Author

Sorry to be dense, but what do you mean by "deck"?

@adamsla
Copy link
Contributor

adamsla commented May 11, 2018

How are you submitting the jobs where this error is occurring.

@jakirkham
Copy link
Author

jakirkham commented Jun 29, 2018

There are a few layers of indirection unfortunately. Jobs are started by a runBulkJobs call in a library called dask-drmaa, which is calling into a Python DRMAA binding library that calls into libdrmaa roughly here. The c there is just a wrapper for calling C functions and is uninteresting for our purposes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question A question for IBM developers and LSF users
Projects
None yet
Development

No branches or pull requests

2 participants