-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kusto Stream Ingestion #301
Conversation
Hello @jrob5756 , Thanks for the PR. We're going to have a look at this and provide a review on this in a week or 2 ( we are in the process of a version bump for JDK 11 and up). |
We need to discuss internally the way we want to deal with streaming, |
Thanks @ohadbitt. Good call on the microbatch vs. partition comment. I updated the pr summary to reflect that. Regarding the Queued fallback approach, I had the same thoughts while I was implementing these changes. As you pointed out, there are pros and cons to both strategies. If the team is more comfortable splitting the data into 4 MB chunks, I can certainly do that as it would not be difficult to implement. Let me know! |
7c77f2c
to
98994ec
Compare
98994ec
to
e86b62b
Compare
1f9c022
to
cc014df
Compare
@ag-ramachandran quick update. We discussed offline but, adding comments here as well for transparency. Since the PR was created the code has been updated to chunk the requests as discussed above. Also, I saw that there have been some structural changes recently which were incompatible with the PR. I've updated the code so we should be squared away there as well. I await your review... |
47b4c47
to
fbaa32f
Compare
@jrob5756 @ag-ramachandran We are working on a POC to stream large volume of data using Spark-Kusto connector (Databricks) into ADX streaming tables ingesting with a requirement of very low latency(ms). Thanks for your time and dedication for developing this feature. Looking forward to your update. |
Hello @Raghu0210 , we should get this picked up by the end of this year. The challenge has been in changes we have been making in the connector and this taking a lower priority. Will keep this thread updated |
Pull Request Description
This pull request contains code (& tests) to introduce kusto streaming ingestion support in the Spark Kusto Connector.
Stream ingestion is enabled by setting the KUSTO_WRITE_MODE option to
Stream
.Notes
Breaking Changes:
Features:
Fixes: