Backfill SparkJobRun.log_uri for old runs #520

robhudson · 2017-05-31T18:44:08Z

In #477 we will start recording log_uri for each spark job run. The runs prior to this landing, however, do not have a relation from the run to the log. The EMR cluster details API can provide us with the LogUri that we can use to populate these but it would be an API call per run so we want to be mindful of API limits. The idea would be to create a celery task that backfills in batches of x and spreads them out in celery tasks every y minutes.

The text was updated successfully, but these errors were encountered:

jezdez · 2017-05-31T19:46:11Z

So I'm not sure if #477 is enough, since what we do is create logfiles as part of the job processing, in batch.sh: https://github.com/mozilla/emr-bootstrap-spark/blob/b3c7412b2f6b61c02b27125d6cad5935c16985ad/ansible/files/steps/batch.sh#L51 (and following lines)

Here's where the logs are uploaded to S3: https://github.com/mozilla/emr-bootstrap-spark/blob/b3c7412b2f6b61c02b27125d6cad5935c16985ad/ansible/files/steps/batch.sh#L112

jezdez · 2017-05-31T19:46:40Z

The log_uri thing as I understand it is the log of the cluster bootstrapping, which is not the same as the logfiles of the actual job.

robhudson · 2017-05-31T21:28:19Z

Thanks for pointing out that the LogUri we pass isn't the job logs we display in ATMO.

Would it be worth it to try to match up the log files in the S3 bucket with the historical job runs? Maybe listing all the log files sorted by time and attaching them to the runs sorted by time would be close enough?

Or just drop it all together?

rafrombrc · 2017-06-14T18:15:44Z

We've got another approach for handling #477 (using cluster id to generate the log URLs) but we won't be able to do the backfill.

rafrombrc closed this as completed Jun 14, 2017

robhudson mentioned this issue Jun 14, 2017

Record log URI in spark job runs #477

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backfill SparkJobRun.log_uri for old runs #520

Backfill SparkJobRun.log_uri for old runs #520

robhudson commented May 31, 2017

jezdez commented May 31, 2017

jezdez commented May 31, 2017

robhudson commented May 31, 2017

rafrombrc commented Jun 14, 2017

Backfill SparkJobRun.log_uri for old runs #520

Backfill SparkJobRun.log_uri for old runs #520

Comments

robhudson commented May 31, 2017

jezdez commented May 31, 2017

jezdez commented May 31, 2017

robhudson commented May 31, 2017

rafrombrc commented Jun 14, 2017