Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

kimberlytanyh · 2023-03-12T17:36:12Z

Dependency

Create Project Board Looker Dashboard for Different Roles #4921. Resume when the dashboard is ready

Overview

We need to collect data on the authors of all the prework issues in our repository to perform data analysis.

Action Items

Of the 202 people, how many people left the team?
How many people started and got to "Complexity: Large" issues (completed at least 2 for combined first and second good issue, and one of every other complexity type)?

Perform above analysis again on only closed prework issues.
Clean data and get number and percentage of closed large issues that were unassigned in Google Sheets
Create Google spreadsheet with list of issues that have more than one complexity label and unassigned closed large issues.
Perform cohort analysis on closed prework authors

Clean data from API and create dataset
Import into Google Drive and visualize data in Google Sheets

Research how to connect data to Looker Studio in a way that new data can come in and Looker visualizations are automatically updated.
Create new repository with Sophia and Chelsey's help that has GitHub Actions that perform cron job so that Python script can be run automatically daily for fresh data.
Add automation components to Python script and verify data cleaning accuracy.
Create Looker dashboard with data pulled in.
Refine the Looker dashboard so that it is more intuitive
Investigate correlation between number of issues available and cohort performance:
- Design analysis and investigate where/how data can be obtained

Might be separated into another issue

Get project board column data from GitHub and clean the data
Set up data source and create Looker dashboard to show live number of issues available per role
Create separate dashboard pages for developers (front end, back end, front and back end, and dev lead)
Create documentation of process for GitHub class using Hack for LA template
Set up automation of running of Python script so that dashbboard updates automatically

Resources/Instructions

GitHub API Documentation
GitHub Rate Limiting
Link to GitHub Data Analysis Folder
Spreadsheet with accurate numbers as of 03/26/2023
Link to process documentation
Using Google Sheets API to add and refresh dataframe in Python to Google Sheets:
https://www.youtube.com/watch?v=sVURhxyc6jE
https://medium.com/@jb.ranchana/write-and-append-dataframes-to-google-sheets-in-python-f62479460cf0
https://www.youtube.com/watch?v=3wC-SCdJK2c
Slides documentation process from Python to GitHub

kimberlytanyh · 2023-03-12T17:41:45Z

After finish drafting this issue, add the label "Ready for Product".

ExperimentsInHonesty · 2023-03-16T00:59:48Z

@kimberlytanyh Add a step to add data to a google sheet on the Team Google Drive. Add a link to the folder it will go in, under the resources section.

kimberlytanyh · 2023-03-19T17:05:36Z

Weekly Update:

Progress: Retrieved data and calculated count of issues per complexity label. Left with converting rows to columns so that we can see the distribution in one row per assignee, and completing documentation.
Blockers: None
Availability: Mon - Fri, 12:00-5:00PM
ETA: ~21 hours. 1-3 hours for remaining deliverables.

kimberlytanyh · 2023-03-31T00:25:59Z

Weekly Update:

Progress: Adjusted data cleaning method and calculated count of issues per complexity label. Exported dataset as csv and uploaded to the drive. Manually checked accuracy of data. Working on data analysis now.
Blockers: None
Availability: Thurs-Saturday, Anytime
ETA: ~17 hours

ExperimentsInHonesty · 2023-04-06T18:38:04Z

@kimberlytanyh we are in the process of changing the labels on issues currently labeled Complexity: Good second issue to good first issue

Why?

to improve current and future data analysis
to make it easier for devs to know how many issues we need to make at a given time
to make it easier for devs who are looking for their next issue to find one

What you need to know

The issue where you can stay informed of the progress is here: Roll Out plan: Make issues required to change all the places that Good Second Issues appears with Good First Issue #4432
The changes I made so far, are that I rolled out the change to all the closed issues and PRs. So there should be no closed Complexity: Good second issue Issue or PR
You will have to revise your analysis to calculate if team members have done two good first issues instead of first grouping good first issue and Complexity: Good second issue together

kimberlytanyh · 2023-04-06T20:43:23Z

@ExperimentsInHonesty Thank you for the heads up! I will adjust my code for the next round of analysis rerun accordingly.

kimberlytanyh · 2023-04-15T17:18:30Z

Weekly Update:

Progress: Identified means for identifying pull requests in retrieved issues through GitHub API. Will re-perform all analyses done and try to improve accuracy of datasets.
Blockers: None
Availability: Saturday
ETA: ~6 hours

kimberlytanyh · 2023-04-16T17:45:38Z

@ExperimentsInHonesty As discussed in the Sunday Team Meeting, below are the labels to be added to prework/tracking issues for better data analysis:

Team Member Progression

Join the team
Set up development environment
Self-assigning an issue and communicating availability
Progress Report
Good first issue (2nd)
Small
Issue making: Level 1
Medium
Issue making: Level 2
Large
Join the merge team
Extra Large
Issue making: Level 3
Issue making: Level 4

kimberlytanyh · 2023-06-10T21:34:50Z

Progress: In the process of changing one more section of the code for automation and double checking accuracy of data after cleaning (need to improve accuracy of crediting the right amount of small issues for agenda issues that have multiple assignees). Next step is to add the Python script for automation and clean and create dataset for the live dashboard on number of issues available.

Blockers: None yet.
Availability: 6-8 hours
ETA: A few more weeks since it is an evolving and ongoing issue.

github-actions · 2023-06-30T07:19:27Z

@kimberlytanyh

Please add update using the below template (even if you have a pull request). Afterwards, remove the '2 weeks inactive' label and add the 'Status: Updated' label.

Progress: "What is the current status of your project? What have you completed and what is left to do?"
Blockers: "Difficulties or errors encountered."
Availability: "How much time will you have this week to work on this issue?"
ETA: "When do you expect this issue to be completed?"
Pictures (optional): "Add any pictures of the visual changes made to the site so far."

If you need help, be sure to either: 1) place your issue in the developer meeting discussion column and ask for help at your next meeting, 2) put a "Status: Help Wanted" label on your issue and pull request, or 3) put up a request for assistance on the #hfla-site channel. Please note that including your questions in the issue comments- along with screenshots, if applicable- will help us to help you. Here and here are examples of well-formed questions.

_{You are receiving this comment because your last comment was before Tuesday, June 27, 2023 at 12:17 AM PST.}

kimberlytanyh · 2023-07-02T15:33:15Z

Progress: Completed documentation of process for live issue availability dashboard (for GitHub class). Left to do: Edit Python script to add in data from other columns, add it to repository for automation, and finish creating dashboard.
Blockers: None yet. Might have to consult Data Science COP about auto running automation script.
Availability: 21 hours next week Mon-Fri.
ETA: By next week or two.

mayankt153 · 2024-09-22T06:02:13Z

github-actions bot added Feature Missing This label means that the issue needs to be linked to a precise feature label. role missing labels Mar 12, 2023

kimberlytanyh added ready for product and removed Draft Issue is still in the process of being created labels Mar 12, 2023

kimberlytanyh changed the title ~~Prework Analysis~~ Prework Analysis: Collecting Data on Issue Completion per Prework Author Mar 12, 2023

ExperimentsInHonesty added this to the 08. Team workflow milestone Mar 16, 2023

ExperimentsInHonesty assigned kimberlytanyh Mar 16, 2023

ExperimentsInHonesty removed the ready for product label Mar 16, 2023

github-actions bot added the Status: Updated No blockers and update is ready for review label Mar 17, 2023

github-actions bot removed the Status: Updated No blockers and update is ready for review label Mar 24, 2023

ExperimentsInHonesty mentioned this issue Apr 6, 2023

Roll Out plan: Make issues required to change all the places that Good Second Issues appears with Good First Issue #4432

Open

27 tasks

github-actions bot added the To Update ! No update has been provided label Apr 14, 2023

github-actions bot removed the To Update ! No update has been provided label Apr 21, 2023

github-actions bot added the 2 weeks inactive An issue that has not been updated by an assignee for two weeks label Jun 9, 2023

kimberlytanyh added Status: Updated No blockers and update is ready for review and removed 2 weeks inactive An issue that has not been updated by an assignee for two weeks labels Jun 10, 2023

github-actions bot removed the Status: Updated No blockers and update is ready for review label Jun 16, 2023

github-actions bot added To Update ! No update has been provided 2 weeks inactive An issue that has not been updated by an assignee for two weeks and removed To Update ! No update has been provided labels Jun 23, 2023

kimberlytanyh added Status: Updated No blockers and update is ready for review and removed 2 weeks inactive An issue that has not been updated by an assignee for two weeks labels Jul 2, 2023

kimberlytanyh changed the title ~~Prework Analysis: Collecting Data on Issue Completion per Prework Author~~ Prework Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights Jul 2, 2023

kimberlytanyh changed the title ~~Prework Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights~~ Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights Jul 2, 2023

ExperimentsInHonesty added the Dependency An issue is blocking the completion or starting of another issue label Jul 2, 2023

kimberlytanyh mentioned this issue Jul 7, 2023

Create Project Board Looker Dashboard for Different Roles #4921

Open

22 tasks

kimberlytanyh removed their assignment Jul 7, 2023

ExperimentsInHonesty added the feature: progress tracker dashboard label Apr 9, 2024

kellyc9 mentioned this issue Apr 9, 2024

Epic: Dashboards planning #6614

Open

16 tasks

ExperimentsInHonesty added this to P: HfLA Website: Project Board Jun 23, 2024

ExperimentsInHonesty moved this to Ice box in P: HfLA Website: Project Board Jun 23, 2024

Eleftherios06 added this to P: HfLA Dashboards: Project Board Jun 30, 2024

Samhitha444 removed this from P: HfLA Dashboards: Project Board Aug 25, 2024

Samhitha444 added this to P: HfLA Dashboards: Project Board Aug 25, 2024

mayankt153 moved this to New Issue Approval in P: HfLA Dashboards: Project Board Aug 25, 2024

ExperimentsInHonesty added feature: skills / productivity ladder dashboard and removed feature: ladder progress dashboard labels Sep 8, 2024

ExperimentsInHonesty mentioned this issue Oct 6, 2024

Epic: Skills/Productivity Dashboard #7539

Open

7 tasks

Samhitha444 mentioned this issue Oct 27, 2024

Epic: Issues Dashboard #7505

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

kimberlytanyh commented Mar 12, 2023 •

edited by Kphalguni

Loading

kimberlytanyh commented Mar 12, 2023

ExperimentsInHonesty commented Mar 16, 2023

kimberlytanyh commented Mar 19, 2023 •

edited

Loading

kimberlytanyh commented Mar 31, 2023

ExperimentsInHonesty commented Apr 6, 2023

kimberlytanyh commented Apr 6, 2023

kimberlytanyh commented Apr 15, 2023

kimberlytanyh commented Apr 16, 2023 •

edited

Loading

kimberlytanyh commented Jun 10, 2023 •

edited

Loading

github-actions bot commented Jun 30, 2023

kimberlytanyh commented Jul 2, 2023

mayankt153 commented Sep 22, 2024 •

edited

Loading

Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

Analysis: Collecting Data on Issue Completion per Prework Author and Creating Looker Dashboards to Uncover Insights #4152

Comments

kimberlytanyh commented Mar 12, 2023 • edited by Kphalguni Loading

Dependency

Overview

Action Items

Might be separated into another issue

Resources/Instructions

kimberlytanyh commented Mar 12, 2023

ExperimentsInHonesty commented Mar 16, 2023

kimberlytanyh commented Mar 19, 2023 • edited Loading

kimberlytanyh commented Mar 31, 2023

ExperimentsInHonesty commented Apr 6, 2023

Why?

What you need to know

kimberlytanyh commented Apr 6, 2023

kimberlytanyh commented Apr 15, 2023

kimberlytanyh commented Apr 16, 2023 • edited Loading

kimberlytanyh commented Jun 10, 2023 • edited Loading

github-actions bot commented Jun 30, 2023

kimberlytanyh commented Jul 2, 2023

mayankt153 commented Sep 22, 2024 • edited Loading

kimberlytanyh commented Mar 12, 2023 •

edited by Kphalguni

Loading

kimberlytanyh commented Mar 19, 2023 •

edited

Loading

kimberlytanyh commented Apr 16, 2023 •

edited

Loading

kimberlytanyh commented Jun 10, 2023 •

edited

Loading

mayankt153 commented Sep 22, 2024 •

edited

Loading