Skip to content

📦 A curated list of JSON / BSON datasets from the web in order to practice / use in MongoDB

Notifications You must be signed in to change notification settings

Lavanya-Anbalagan/mongodb-json-files

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MongoDB JSON Data

A dedicated repository that collects collections to practice/use in MongoDB.

List of small datasets

Name Size Data type How to import

Tweets

610 Ko

zip → dump folder

mongorestore

Zips

3.1 Mo

JSON

mongoimport

Palbum

731 Ko

zip → JSON files

mongoimport

Grades

92 Ko

JSON

mongoimport

Students

35 Ko

JSON

mongoimport

Profiles

454 Ko

JSON

mongoimport

Products

2.8 Ko

JSON

mongoimport

Countries small

329 Ko

JSON

mongoimport

Countries big

2.3 Mo

JSON

mongoimport

Restaurants

666 Ko

JSON

mongoimport

Covers

470 Ko

JSON

mongoimport

Books

525 Ko

JSON

mongoimport

List of bigger datasets

Name Size Data type How to import

People

21 Mo

zip → dump gzip

mongorestore --gzip

City inspections

24 Mo

JSON

mongoimport

Companies

75 Mo

JSON

mongoimport

Stocks

85 Mo

zip → dump folder

mongorestore

Trades

232 Mo

JSON

mongoimport

Enron

55 Mo

RAR (named .zip for confusion) → dump folder

mongorestore

List of other dataset

Name Size Data type

Enron

423 Mo

Email server tarball (slow DL server)

Import in MongoDB

Use the import.sh script provided to insert the "small" and the "bigger" datasets. You can see the help and the options with import.sh --help.

Current features:

  • Docker support: starts a MongoDB automatically in Docker for you.

  • Only insert the smallest dataset for a quick data import with --small (cool for live demos).

Requirements:

  • Docker if you use the docker option.

  • MongoDB (mongoimport, mongorestore)

  • unzip

  • unrar (for the Enron dataset)

Contributing

Feel free to make a pull request to add your collection files into the list.

License

http://creativecommons.org/publicdomain/zero/1.0/

About

📦 A curated list of JSON / BSON datasets from the web in order to practice / use in MongoDB

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Shell 100.0%