Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark History Server unable to read rolling logs from COS using stocator #321

Open
shivprakashy opened this issue Dec 23, 2022 · 0 comments

Comments

@shivprakashy
Copy link
Contributor

shivprakashy commented Dec 23, 2022

When Spark History Server (SHS) is configured to use stocator to read rolling files it fails.
Below are the following configurations done (SHS read configurations from spark-default.conf)

Non-Working Configuration

spark.hadoop.fs.cos.hbdevshivcos.endpoint https://s3.us-south.cloud-object-storage.appdomain.cloud
spark.hadoop.fs.cos.hbdevshivcos.access.key <hmac-access-key>
spark.hadoop.fs.cos.hbdevshivcos.secret.key <hmac-secret-key>
spark.hadoop.fs.stocator.cos.impl com.ibm.stocator.fs.cos.COSAPIClient
spark.hadoop.fs.cos.impl com.ibm.stocator.fs.ObjectStoreFileSystem
spark.hadoop.fs.stocator.scheme.list cos
spark.hadoop.fs.stocator.cos.scheme cos
spark.history.fs.logDirectory cos://hbdev-shiv.hbdevshivcos/spark-events
spark.eventLog.dir cos://hbdev-shiv.hbdevshivcos/spark-events

Working Configuration ($ start-history-server --properties-file shs.properties) content of shs.properties below

spark.hadoop.fs.s3a.endpoint=https://s3.us-south.cloud-object-storage.appdomain.cloud
spark.hadoop.fs.s3a.access.key=<testaccesskey>
spark.hadoop.fs.s3a.secret.key=<testsecretkey>
spark.hadoop.fs.s3a.impl=org.apache.hadoop.fs.s3a.S3AFileSystem
spark.hadoop.fs.s3a.scheme.list=s3a
spark.hadoop.fs.s3a.scheme=s3a
spark.history.fs.logDirectory=s3a://hbdev-shiv/spark-events
spark.eventLog.dir=s3a://hbdev-shiv/spark-events

What differs in s3a and COS implementation to read rolling log files needs to be investigated and fixed.
More details to follow...

@shivprakashy shivprakashy changed the title Spark History Server not able to read rolling logs Spark History Server unable to read rolling logs from COS using stocator Dec 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant