Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

⭐️ Create a Github Action for Automatic LinkedIn Scraping #81

Open
1 of 5 tasks
EssamWisam opened this issue Nov 21, 2024 · 3 comments
Open
1 of 5 tasks

⭐️ Create a Github Action for Automatic LinkedIn Scraping #81

EssamWisam opened this issue Nov 21, 2024 · 3 comments
Assignees

Comments

@EssamWisam
Copy link
Owner

EssamWisam commented Nov 21, 2024

Requirements:

  • Make a new or prepare dummy LinkedIn account for CMPDocs
  • Download a new browser and stay permanently logged in with it so cookies never expire
  • Consider if the cookie can be stringified and saved by Github secrets to be loaded in the action
  • Ensure that in the presence of the cookie and in a fresh environment installing requirements and running python scripts/linkedin-scraper/run.py is sufficient to update all files
  • Write the Github action to simply run the commands and commit the changes
@EssamWisam EssamWisam self-assigned this Nov 21, 2024
@EssamWisam
Copy link
Owner Author

@Iten-No-404 could you convey the cookie browser generation mechanism that is currently assumed in the code because it seems to be different than how I did it in another application.

@Iten-No-404
Copy link
Collaborator

Sure, @EssamWisam. I used a Chrome extension called Export cookie JSON file for Puppeteer which can be found here. It produces a JSON file with the format below:
image

@EssamWisam
Copy link
Owner Author

Hmmmm. I remember using browser cookie library and it was able to access browser cookies (but not in a JSON format from what I saw). The idea is that it works cross-browser (e.g., Firfox where I am not sure if this extension exists). So I will try extracting them this way first and if it doesnt' work well I can change a little the cookie code to handle the browser-cookie library style.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants