Employee-Data-Analysis

In this challenge I took on the roll of a data engineer at a company. The company has their employees' information recorded in 6 differenct .csv files. As the data engineer I needed to find a way to link all these in an intelligent way. Therefore, I started out with an entity-relationship diagram (ERD) to visualize what is in each csv and how each csv would be linked to the next. That ERD is below.

Next I used PostgreSQL as my database and made the table schema to hold my .csv information. I created 6 tables to correspond to the 6 .csvs and took note of data types, primary keys, and foreign keys. I imported the data and then did some data analysis on it inside the PostgreSQL software. The analysis ranged from standard to more complex. The questions included: employee information, filtering by who was hired in 1986, list managers of each department, find the employees with first name Hercules and last name starting with the letter "B", list employees in the sales department, and the count of employees who share a last name. Some of these questions required queries that included multiple JOINs, GROUPBYs, and WHERE statements. All the queries are in the "queries.sql" file.

After querying my database, I decided to use some Pandas and analyze the data further. Inside my Bonus.ipynb, I used SQLAlchemy to connect to my database and pull in 3 of my tables into a Jupyter notebook. I wanted to visualize the Salary distribution of the employees as well as determine what was the average salary per title. I merged the imported tables and created my 2 plots. Below are what I found.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
EmployeeSQL		EmployeeSQL
Images		Images
data		data
.gitignore		.gitignore
Bonus.ipynb		Bonus.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Employee-Data-Analysis

About

Releases

Packages

Languages

jasoncr/Employee-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Employee-Data-Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages