Is a tool to retrieve data from Github for our repositories and aggregate data on pull requests.
- Add a line in
gradle.properties
with a valid github access token with the permissions to read the repositories data
githubAccessToken=1234567890asdfgh
- Set the list of repositories to scan in
gradle.properties
- run
./gradlew runDefault
- the data will be generated in the folder
data
There are three levels of data collected or aggregated:
_all.csv
contains all the aggregated data, over the entire period of observation, 11 weeks, per repository
In each repository folder, repository_name.csv
contains the aggregated data per week
In each repository folder, repository_name_week.csv
contains the raw data point with stats of the specific pull request
All the times are expressed in days, 24h, so 0.5 means 12 hours. Every time measurement has already been cleaned by weekends, but night hours are still present