-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create computed nCommit & nAuthor per month by project #25
Comments
OK, this may be a good candidate for CH as the number of records would exceed billion? |
Also, what about number of blobs, types of files changed, APIs involved? |
Number and type of files changed, along with blobs and API's, would be very useful! I propose a format here, in an existing clickhouse table conversation/issue, since this is now a clickhouse related issue. |
/da1_data/basemaps/gz/P2mncFullS*.s |
This basemap is wonderful, thank you! My group from the hackathon, Inflection Points, is interested in continuing research. We are interested in rootFork projects, though, which are often not the deforked (P) project. Blobs changed would also be a great metric to have. Could the map be expanded to include both of these? |
Take a look at new P_metadata.S in mongodb: it might be good for sampling, though it does not have monthly blobs. You can always find P for rootFork via p2P map. Otherwise these projects should be nearly identical. Do you have instances where they are not? |
Is WoC.P_metadata.S an improved version of WoC.proj_metadata.S? They appears to have similar fields, with the addition of monthly commit count and core size. I have questions about these additions, though.
I'll take it to be true that P is similar enough to the RootFork project that computing time series for both is not necessary. |
|
@audrism
When analyzing a project's activity over time, commit counts and unique author counts per month (not counting one author multiple time if they have made multiple commits) are great metrics. Could we pre-compute this metric in one of our databases to allow for a query of
Project.month.nCommits
andProject.month.nAuthors
?The text was updated successfully, but these errors were encountered: