Scrape-Linkedin

Python script that scrapes of all the Careem employees in Pakistan.

The scrape_linkedin.py file uses Selenium and BeautifulSoup to extract information about Careem Employees in Pakistan. The information include name, title, location, profile. The output data is stored in a csv file, i.e., output_search.csv.

The intodb.py file loads the output_search.csv file into pandas dataframe. It cleans the data like removes extra whitespaces, capitalize each first letter of each word in the name. It stores the information in a sqlite database. The database name is employees_details.db, and the table name is careem_employees. The exported data from the database is stored in careem_employees.csv file.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
careem_employees.csv		careem_employees.csv
intodb.py		intodb.py
output_search.csv		output_search.csv
scape_linkedin.py		scape_linkedin.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrape-Linkedin

About

Releases

Packages

Languages

Aksa12/Scrape-Linkedin

Folders and files

Latest commit

History

Repository files navigation

Scrape-Linkedin

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages