-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crawler not catching all links? #17
Comments
@huglester Given url is under construction atm. Can you give fresh example please? Thanks. |
@umpirsky please try now:) |
Very strange. Have you tried pressing F5? |
@huglester Ctrl + F5 did the trick. After commenting out https://github.com/umpirsky/centipede-crawler/blob/master/src/Centipede/Extractor/UrlExtractor.php#L13 I got:
Looks like there is invalid html. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
For example, script won't find any childs here:
Why so? there are more links on that page. but it only checks the mainpage?
I have checked the UrlExtractor class, the extract() method receives valid html,
but DOMDocument does not extract the links. Maybe we should use symfony/dom-crawler ?
Thanks!
The text was updated successfully, but these errors were encountered: