Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bigquery: Add missing response bodies data for 2020 November-2021 June (Desktop and Mobile) #592

Closed
gezmondo-cyber opened this issue May 8, 2022 · 4 comments

Comments

@gezmondo-cyber
Copy link

The httparchive/response_bodies tables on Bigquery should be monthly for 2020-2021.
Several tables for the period 2020.11-2021.06 seem to be missing.

Could you please upload them to Bigquery?

image

@rviscomi
Copy link
Member

rviscomi commented May 9, 2022

Thanks for filing this! Regenerating the response_bodies tables should be doable, but given the ongoing infrastructure work it may be lower priority.

@HTTPArchive HTTPArchive deleted a comment from Jayjay1488 Jul 3, 2023
@HTTPArchive HTTPArchive deleted a comment from Jayjay1488 Jul 3, 2023
@max-ostapenko
Copy link
Contributor

@rviscomi this is still relevant. Do we skip it in the backfill?
If yes, probably we should just close it after 2 years?

@max-ostapenko
Copy link
Contributor

And do you remember why did the number of rows explode there?
Screenshot 2024-10-19 at 02 19 30

@tunetheweb
Copy link
Member

I believe images were accidentally included for a period instead of just text response bodies. We shoudl exclude these in the port over to all.

See also: #225

@max-ostapenko max-ostapenko closed this as not planned Won't fix, can't repro, duplicate, stale Nov 3, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants