Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crawl error: missing context with id #567

Open
edsu opened this issue May 9, 2024 · 0 comments
Open

Crawl error: missing context with id #567

edsu opened this issue May 9, 2024 · 0 comments

Comments

@edsu
Copy link
Contributor

edsu commented May 9, 2024

I've seen this a couple times witb v1.1.1 and thought it might be worth noting down. It doesn't seem to be easily reproducible unfortunately:

Error: INTERNAL ERROR: missing context with id = 14
    at assert (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/util/assert.js:15:15)
    at FrameManager.executionContextById (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/cdp/FrameManager.js:199:9)
    at #onBindingCalled (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/cdp/Page.js:640:44)
    at file:///app/node_modules/puppeteer-core/lib/esm/third_party/mitt/mitt.js:36:7
    at Array.map (<anonymous>)
    at Object.emit (file:///app/node_modules/puppeteer-core/lib/esm/third_party/mitt/mitt.js:35:20)
    at CdpCDPSession.emit (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/common/EventEmitter.js:77:23)
    at CdpCDPSession._onMessage (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/cdp/CDPSession.js:79:18)
    at Connection.onMessage (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/cdp/Connection.js:138:25)
    at Immediate.<anonymous> (file:///app/node_modules/puppeteer-core/lib/esm/puppeteer/node/NodeWebSocketTransport.js:37:36)

Node.js v20.11.1

In case it's useful here was my command, after removing the original site being crawled:

docker compose run --build -p 9037:9037 crawler crawl --url http://www.example.com --scopeType prefix --generateWACZ --screencastPort 9037 --collection buffon --scopeExcludeRx 'search=.=search='  --scopeExcludeRx '.*fig=.*fig=.*' --workers 4 --pageLoadTimeout 30 --text to-warc --screenshot view
@edsu edsu changed the title missing context with id Crawl error: missing context with id May 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Triage
Development

No branches or pull requests

1 participant