Skip to content
This repository has been archived by the owner on Nov 10, 2024. It is now read-only.

Feature suggestion: include a dataset with some tweet IDs #279

Closed
perrystephenson opened this issue Aug 12, 2018 · 1 comment
Closed

Feature suggestion: include a dataset with some tweet IDs #279

perrystephenson opened this issue Aug 12, 2018 · 1 comment

Comments

@perrystephenson
Copy link

I recently scraped about 400,000 tweets about R (including all tweets about #rstats) and it turned out to be a really cool dataset - lots of interesting things to find. https://perrystephenson.me/2018/08/11/analysing-the-r-twitterverse/

Lots of people have asked me to share the data - I'm almost certainly not allowed to, but I understand that Twitter allows people to share Tweet IDs, which you can then use to grab all the details through the API.

So, the suggestion is that you could include the following in the package:

  1. A dataset of about 400,000 tweet IDs (under 4MB so it should be fine with CRAN)
  2. A modifications to the lookup_statuses() function so that it automatically backs off and waits when it hits the API limit, so that users can retrieve the entire dataset in a single call

I'd be happy to do all the work and raise a PR, I just wanted to check whether this fits in with your vision for the package before I spend time doing it :)

@hadley
Copy link
Collaborator

hadley commented Feb 27, 2021

I think this would be a better fit for a separate package. That way rtweet can stay lean and mean and focussed on wrapping the twitter API.

@hadley hadley closed this as completed Feb 27, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants