The following raw data files were downloaded from the U.S. Census Bureau and pre-proccessed in "processing" notebook 01_processing.ipynb
prior to analysis:
For the 2010-18 period (all counties downloaded)
For the 2000-10 period (only Maryland counties downloaded)
For the 1990-99 period (all counties downloaded)
The Baltimore Sun conducted an analysis of county-level population estimates by age, sex and race/ethnicity analysis for a story published June 19, 2019 titled "Baltimore's white population swells with millennials, resembling D.C., Brooklyn".
The Sun's findings and analysis are available in the "analysis" notebook in this repository: 02_analysis.ipynb
. The pre-processing code is in the "processing" notebook in this repository: 01_processing.ipynb
.
The raw datasets are saved in the input
folder. The cleaned files are saved in the output
folder. Note that these files are compressed as input/inputdata.zip
and output/outputdata.zip
and will need to be unzipped prior to analysis.
https://twitter.com/baltsundata
There are many angles to explore with this data, beyond just the ones we looked into for our story.
Have a question or something to contribute? Send us a pull request or contact us on Twitter @baltsundata or via email.
You can also fork a copy of this repo to your own account.
All code in this repository is available under the MIT License. The data files are available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.