Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug][Module Name] Enterprise Github Plugin Collects only X number of Data - Devlake Version: v1.0.1@e061ef2 #8131

Closed
2 of 3 tasks
BhoopalanI opened this issue Oct 7, 2024 · 3 comments
Assignees
Labels
component/plugins This issue or PR relates to plugins severity/p1 This bug affects functionality or significantly affect ux type/bug This issue is a bug

Comments

@BhoopalanI
Copy link

Search before asking

  • I had searched in the issues and found no similar issues.

What happened

I have a github repo which has more than 1200 PRs. I'm trying to populate the data into my Devlake DB. However, Devlake collects only most recent 800 PRs. More than 400 PRs which are the oldest aren't collected. I tested this by quering the DB with the following queries:

select
*
from 
	_raw_github_api_pull_requests
select
*
from 
	pull_requests

I have also attached the log of the Github data collection.

Version: v1.0.1@e061ef2
oct_3_local.zip

What do you expect to happen

The Github Plugin is meant to collect all the data from oldest to most recent

How to reproduce

Please collect data from a large repository and test with the above mentioned query. The older data wouldn't be collected.

Anything else

No response

Version

v1.0.1@e061ef2

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@BhoopalanI BhoopalanI added the type/bug This issue is a bug label Oct 7, 2024
Copy link

dosubot bot commented Oct 7, 2024

Hello @BhoopalanI! 👋 I'm here to help you debug, answer questions, and even guide you on becoming a contributor.

A similar issue was previously reported and closed. The problem with the Enterprise Github Plugin collecting only the most recent 800 pull requests is due to the API request configuration. The direction parameter is set to desc, which retrieves pull requests in descending order, starting from the most recent. The PageSize is set to 100, and the pagination logic likely limits the total number of pull requests collected to 800 due to the number of pages processed or a similar constraint [1].

To continue talking to Dosu, mention @dosu.

@dosubot dosubot bot added component/plugins This issue or PR relates to plugins severity/p1 This bug affects functionality or significantly affect ux labels Oct 7, 2024
@klesh
Copy link
Contributor

klesh commented Oct 8, 2024

Please check if they were created earlier than Time After from your project under the Sync policy section.

@BhoopalanI
Copy link
Author

BhoopalanI commented Oct 9, 2024

@klesh It works after changing the Time After. Thank you. This can be considered closed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component/plugins This issue or PR relates to plugins severity/p1 This bug affects functionality or significantly affect ux type/bug This issue is a bug
Projects
None yet
Development

No branches or pull requests

3 participants