-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NUTCH-2856] Implement a protocol-smb plugin based on hierynomus/smbj #826
base: master
Are you sure you want to change the base?
Conversation
Draft version of a protocol-smb plugin. Lots of todo comments still, but it seems to work.
Moving this to DRAFT status and acknowledging the PR @HiranChaudhuri thank you. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @HiranChaudhuri I added quite a few comments for your consideration. Thanks for submitting this PR 👍
Please ping me once your ready and we can go for round # 2 of peer review.
Further out, I think we could implement some testing for this protocol plugin. We could use testcontainers
and essentially spin up a local Samba server using @nddipiazza 's smbj-inttest image. We can come back to this one the PR has evolved a bit.
src/plugin/protocol-smb/src/java/org/apache/nutch/protocol/smb/Smb.java
Outdated
Show resolved
Hide resolved
src/plugin/protocol-smb/src/java/org/apache/nutch/protocol/smb/Smb.java
Outdated
Show resolved
Hide resolved
src/plugin/protocol-smb/src/java/org/apache/nutch/protocol/smb/Smb.java
Outdated
Show resolved
Hide resolved
src/plugin/protocol-smb/src/java/org/apache/nutch/protocol/smb/Smb.java
Outdated
Show resolved
Hide resolved
src/plugin/protocol-smb/src/java/org/apache/nutch/protocol/smb/SmbURLConnection.java
Show resolved
Hide resolved
Improve error handling Rename class as requested Added license header Improve url parsing added robots.txt
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for all of the responses @HiranChaudhuri please see my updated suggestions.
Hi @HiranChaudhuri now that you've activated the plugin
Please let me know your thoughts on this. |
The container looks good. I have no clue about the @rule annotation and am interested to see how this gets combined together. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Trivial header requests. Thanks
@@ -0,0 +1,6 @@ | |||
<?xml version="1.0" encoding="UTF-8"?> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please provide ALv2 header
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
@@ -0,0 +1,120 @@ | |||
package org.apache.nutch.protocol.smb; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please provide ALv2 header
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done
[NUTCH-2856] Implement a protocol-smb plugin based on hierynomus/smbj
Draft version of a protocol-smb plugin. Lots of todo comments still,
but it seems to work.