-
-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New production crawler #1164
Comments
I'll create a 4Gb one as the production one, to save us money |
Pointed ota.tosdr.org to 159.223.154.84 in DNS |
Copied most updated instructions from #1160 :
(pick a good hard-to-guess password that will be used for sudo later)
|
I accidentally picked Ubuntu 24.04 and confirmed that this way of installing Puppeteer on a server really doesn't work for that OS version. Rebuilding it as Ubuntu 20.04 now. |
|
Running
in a screen now, let's see how it does on a 4Gb server. |
The run took 35 minutes, so that's great! Will add the github user and schedule it hourly. |
Cron job is running now
I set up a git ssh key, will see if the engine will do a git push. if not, we can add it to the /home/crawler/hourly.sh script |
The robustness of the cronjob is still not acceptable for production usage, so this work is now blocked on #1174 |
Done |
We've set up the server at 206.81.0.208 with a user 'runner' that has both @madoleary's and my ssh key authorized.
It can run
cd ~/server ; npx ota track
in a screen.I pull the data to my laptop and then relay it to github:
And same for tosdr-snapshots.
The text was updated successfully, but these errors were encountered: