Collection of datasets for HUB 22 – Visual Perspectives in Science
UFO sightings are from NUFORC, an organisation investigating UFO sightings in the US.
The Orange data frame has 35 rows and 3 columns of records of the growth of orange trees.
Taken from Movebank, which provides (as I write) data associated with around 50 published articles using location data of a range of different animals.
I've included one dataset focused on kestrels, which I accessed from this page in the Movebank Data Repository which should be cited so:
Hernández-Pliego J, Rodríguez C, Bustamante J (2015) Why do kestrels soar? PLOS ONE. 10(12): e0145402. doi:10.1371/journal.pone.0145402
Hernández-Pliego J, Rodriguez C, Bustamante J (2015) Data from: Why do kestrels soar? Movebank Data Repository. doi:10.5441/001/1.sj8t3r11
It's in folder "kestrels"
To be cited as
Bartlam-Brooks HLA, Beck PSA, Bohrer G, Harris S (2013) In search of greener pastures—using satellite images to predict the effects of environmental change on zebra migration. Journal of Geophysical Research: Biogeosciences v 188, p 1–11. doi:10.1002/jgrg.20096
Bartlam-Brooks HLA, Harris S (2013) Data from: In search of greener pastures: using satellite images to predict the effects of environmental change on zebra migration. Movebank Data Repository. doi:10.5441/001/1.f3550b4f
to be found in folder "Zebras"
This data, describing sitings of elasmobranch fish i.e. sharks and rays is in the file Reef_Life_Survey_Global_reef_fish_dataset_Elasmobranch.csv . This was obtained from the Reef Life Survey website, which we came across when reading the article describing this data in the Nature Publishing Group journal Scientific Data. This file is in the directory "Elasmobranches". It describes locations of sitings of such fish across the world.
In the GalaxyZoo folder, is a file containing information about the images (galaxyData.csv) in the images folder
Found via http://www.galaxyzoo.org/ which pointed me to http://www.sdss.org/dr12./ described in this article http://iopscience.iop.org/article/10.1088/0067-0049/219/1/12/meta;jsessionid=F71FE985792C4CE9913D141C0589FE4C.c2.iopscience.cld.iop.org
I looked through images via the galaxyzoo website classifier link, and pulled out and saved info on images that are relatively different from each other i.e. I wanted to find a small data set which showed lots of different kinds of galaxy morphologies.
This comes from http://www.londonmapper.org.uk, and is in the LondonBoroughs folder.
The map showing where the boroughs are (LondonmapperBasemap.jpg) I got from http://www.londonmapper.org.uk/maps/reference-map/#basemap and is with a CC BY-NC-ND 3.0 – Attribution-NonCommercial-NoDerivs 3.0 Unported license http://creativecommons.org/licenses/by-nc-nd/3.0/
Data tables provided in Excel and tab separated format are:
Carbon emissions per borough http://www.londonmapper.org.uk/maps/environment-and-travel/b-2011-carbonemissions/
Numbers of stag beetles sighted per borough http://www.londonmapper.org.uk/maps/environment-and-travel/b-2014-glnpstagbeetle/
Population per borough http://www.londonmapper.org.uk/maps/population/b-2011-population/
Wealthy households per borough http://www.londonmapper.org.uk/maps/poverty-and-wealth/b-2010-wealthy/
People making day trips to visit per borough http://www.londonmapper.org.uk/maps/environment-and-travel/b-2012-daytripvisitors/
[http://vis.oobrien.com/tube/#metric=total&year=2014&layers=TTTTT&zoom=12&lon=-0.1059&lat=51.5283](London Tube Data Map), an interactive online map that lets you query Tube usage data and display it in many different ways. Note - the London Underground train lines are collectively referred to often as "The Tube"
Transport for London (TfL) provides data on a range of different aspects of public transport usage in London. This page has links to counts of customer entry and exit at all the stations, separated into weekdays, Saturday, and Sunday. The file in the LondonUnderground folder is for a weekday from 2010 counts-entries-10-weekday-sample.csv
Elsewhere on the TfL site is a link to Entry and exit figures by year in Excel format: multi-year-station-entry-and-exit-figures.xls found at this link
The Tube Map is in the LondonUnderground folder in the file large-print-tube-map.pdf taken from this link found on this page.
Data DataRecord_1b_DLC_LH_Table_Analysis_06Jun14-1.csv was found after reading this article in Scientific Data i.e. Zehr SM, Roach RG, Haring D, Taylor J, Cameron FH, Yoder AD (2014) Life history profiles for 27 strepsirrhine primate taxa generated using captive data from the Duke Lemur Center. Scientific Data 1: 140019. http://dx.doi.org/10.1038/sdata.2014.19
The files we include are taken from this record in the DRYAD database and are storred together with a readme file describing its content. Also included is a PDF of the Scientific Data article describing the content sdata201419.pdf
This content is found in the direcory "Lemurs"
The World Bank provides a wealth of economic data. In the directory "WorldBankData" you find world-wide data on womens's fertility, high technology exports, intellectual property and income shares held by the lower and upper 20% of the population.
The data was extracted from the 1974 Motor Trend US magazine, and comprises fuel consumption and 10 aspects of automobile design and performance for 32 automobiles (1973-74 models).
This rather gruesome dataset is taken from the Serial Killer Information Center of Radford University.
Taken from R's Ecdat package. People are always interested in such things.
Source: Fair, R. (1977) “A note on the computation of the tobit estimator”, Econometrica, 45, 1723-1727.
The file "2db3.pdb.gz" in the directory 3DMacromolecularStructure is a single protein, with two domains, including also RNA and an ATP analog. You could download the file from the PDB here.
You could download and install a 3D macromolecular structure viewer such as Chimera to visualise the PDB file. Here is the Chimera download page
This structure is described in (Sengoku et al, 2006](http://www.ncbi.nlm.nih.gov/pubmed/16630817). The pdf of this paper is included in the directory 3DMacromolecularStructure.
Suggested visualisation: try to highlight the residues important for binding to RNA and ATP.
Smoking, Alcohol and (O)esophageal Cancer
Data from a case-control study of (o)esophageal cancer in Ille-et-Vilaine, France.
A data frame with records for 88 age/alcohol/tobacco combinations.