django-cities provides you with place related models (eg. Country, Region, City) and data (from GeoNames) that can be used in your django projects.
This package officially supports all currently supported versions of Python/Django:
Python | 2.7 | 3.4 | 3.5 | 3.6 | 3.7 |
---|---|---|---|---|---|
Django 1.7 | ❌ | ❌ | ❌ | ❌ | ❌ |
Django 1.8 | ✅ | ✅ | ✅ | 🔵 | 🔵 |
Django 1.9 | ✅ | ✅ | ✅ | 🔵 | 🔵 |
Django 1.10 | ✅ | ✅ | ✅ | 🔵 | 🔵 |
Django 1.11 | ✅ | ✅ | ✅ | ✅ | ✅ |
Django 2.0 | ❌ | ✅ | ✅ | ✅ | ✅ |
Django 2.1 | ❌ | ❌ | ✅ | ✅ | ✅ |
Key | |
---|---|
✅ | Officially supported, tested, and passing |
🔵 | Tested and passing, but not officially supported |
❌ | Known incompatibilities |
Authored by Ben Dowling, and some great contributors.
See some of the data in action at city.io and country.io.
- Requirements
- Installation
- Configuration
- Writing Plugins
- Examples
- Third Party Apps/Extensions
- TODO
- Notes
- Running Tests
- Release Notes
Your database must support spatial queries, see the GeoDjango documentation for details and setup instructions.
Clone this repository into your project:
git clone https://github.com/coderholic/django-cities.git
Download the zip file and unpack it:
wget https://github.com/coderholic/django-cities/archive/master.zip
unzip master.zip
Install with pip:
pip install django-cities
You'll need to enable GeoDjango. See that documentation for guidance.
You'll need to add cities
to INSTALLED_APPS
in your projects settings.py
file:
INSTALLED_APPS = (
# ...
'cities',
# ...
)
These settings should be reviewed and set or modified BEFORE any migrations have been run.
Some users may wish to override some of the default models to add data, override default model methods, or add custom managers. This project supports swapping models out using the django-swappable-models project.
To swap models out, first define your own custom model in your custom cities app. You will need to subclass the appropriate base model from cities.models
:
Here's an example my_cities_app/models.py
:
from django.db import models
from cities.models import BaseCountry
class CustomCountryModel(BaseCountry, models.Model):
more_data = models.TextField()
class Meta(BaseCountry.Meta):
pass
Then you will need to configure your project by setting the appropriate option:
Model | Setting Name | Default Value |
---|---|---|
Continent | CITIES_CONTINENT_MODEL |
cities.Continent |
Country | CITIES_COUNTRY_MODEL |
cities.Country |
City | CITIES_CITY_MODEL |
cities.City |
So to use the CustomCountryModel
we defined above, we would add the dotted model string to our project's settings.py
:
# ...
CITIES_COUNTRY_MODEL = 'my_cities_app.CustomCountryModel'
# ...
The dotted model string is simply the dotted import path with the .models
substring removed, just <app_label>.<model_class_name>
.
Once you have set the option in your settings.py
, all appropriate foreign keys in django-cities will point to your custom model. So in the above example, the foreign keys Region.country
, City.country
, and PostalCode.country
will all automatically point to the CustomCountryModel
. This means that you do NOT need to customize any dependent models if you don't want to.
The Geonames data for alternative names contain additional information, such as links to external websites (mostly Wikipedia articles) and pronunciation guides (pinyin). However, django-cities only uses and imports a subset of those types. Since some users may wish to use them all, the CITIES_ALTERNATIVE_NAME_TYPES
and CITIES_AIRPORT_TYPES
settings can be used to define the alternative name types in the database.
These settings should be specified as a tuple of tuple choices:
CITIES_AIRPORT_TYPES = (
('iata', _("IATA (Airport) Code")),
('icao', _("ICAO (Airport) Code")),
('faac', _("FAAC (Airport) Code")),
)
CITIES_ALTERNATIVE_NAME_TYPES = (
('name', _("Name")),
('abbr', _("Abbreviation")),
('link', _("Link")),
)
If CITIES_INCLUDE_AIRPORT_CODES
is set to True
, the choices in CITIES_AIRPORT_TYPES
will be appended to the CITIES_ALTERNATIVE_NAME_TYPES
choices. Otherwise, no airport types are imported.
The Geonames data also contains alternative names that are purely numeric.
The CITIES_INCLUDE_NUMERIC_ALTERNATIVE_NAMES
setting controls whether or not purely numeric alternative names are imported. Set to True
to import them, and to False
to skip them.
Since continent data rarely (if ever) changes, the continent data is loaded directly from Python data structures included with the django-cities distribution. However, there are different continent models with different numbers of continents. Therefore, some users may wish to override the default settings by setting the CITIES_CONTINENT_DATA
to a Python dictionary where the keys are the continent code and the values are (name, geonameid) tuples.
For an overview of different continent models, please see the Wikipedia article on Continents:
https://en.wikipedia.org/wiki/Continent#Number
The following is the default continent data in cities/conf.py
:
CITIES_CONTINENT_DATA = {
'AF': ('Africa', 6255146),
'AS': ('Asia', 6255147),
'EU': ('Europe', 6255148),
'NA': ('North America', 6255149),
'OC': ('Oceania', 6255151),
'SA': ('South America', 6255150),
'AN': ('Antarctica', 6255152),
}
Note that if you do not use these default settings, you will need to register a plugin with a country_pre
method to adjust the continent ID for country models before countries are processed and saved to the database by the import script. Please contribute your plugin back upstream to this project so that others may benefit from your work by creating a pull request containing your plugin and any relevant documentation for it.
After you have configured all migration settings, run
python manage.py migrate cities
to create the required database tables and add the continent data to its table.
These settings should also be reviewed and set or modified before importing any data. Changing these settings after importing data may not have the intended effect.
Specify a download directory (used to specify a writable directory).
Default: cities/data
You may want to use this if you are on a cloud services provider, or if django-cities is installed on a read-only medium.
Note that this path must be an absolute path.
CITIES_DATA_DIR = '/var/data'
You can override the files the import command uses to process data:
CITIES_FILES = {
# ...
'city': {
'filename': 'cities1000.zip',
'urls': ['http://download.geonames.org/export/dump/'+'{filename}']
},
# ...
}
It is also possible to specify multiple filenames to process. Note that these files are processed in the order they are specified, so duplicate data in files specified later in the list will overwrite data from files specified earlier in the list.
CITIES_FILES = {
# ...
'city': {
'filenames': ["US.zip", "GB.zip", ],
'urls': ['http://download.geonames.org/export/dump/'+'{filename}']
},
# ...
}
Note that you do not need to specify all keys in the CITIES_FILES
dictionary. Any keys you do not specify will use their default values as defined in cities/conf.py
.
The Geonames data includes currency data, but it is limited to the currency code (example: "USD") and the currency name (example: "Dollar"). The django-cities package offers the ability to import currency symbols (example: "$") with the country model.
However, like the continent data, since this rarely changes, the currency symbols are loaded directly from Python data structures included with the django-cities distribution in the CITIES_CURRENCY_SYMBOLS
setting. Users can override this setting if they wish to add or modify the imported currency symbols.
For default values see the included cities/conf.py
file.
CITIES_CURRENCY_SYMBOLS = {
"AED": "د.إ", "AFN": "؋", "ALL": "L", "AMD": "դր.", "ANG": "ƒ", "AOA": "Kz",
"ARS": "$", "AUD": "$", "AWG": "ƒ", "AZN": "m",
"BAM": "KM", "BBD": "$", "BDT": "৳", "BGN": "лв", "BHD": "ب.د", "BIF": "Fr",
# ...
"UAH": "₴", "UGX": "Sh", "USD": "$", "UYU": "$", "UZS": "лв",
The Geonames data includes countries that no longer exist. At this time, those countries are the Dutch Antilles (AN
) and Serbia and Montenegro (CS
). If you wish to import those countries, set the CITIES_NO_LONGER_EXISTENT_COUNTRY_CODES
to an empty list ([]
).
Default: ['CS', 'AN']
CITIES_NO_LONGER_EXISTENT_COUNTRY_CODES = ['CS', 'AN']
The Geonames data contains country postal code formats and regular expressions, as well as postal codes. Some of these postal codes do not match the regular expression of their country. Users who wish to ignore invalid postal codes when importing data can set the CITIES_VALIDATE_POSTAL_CODES
setting to True
to skip importing postal codes that do not validate the country postal code regular expression.
If you have regional knowledge of postal codes that do not validate, please either update the postal code itself or the country postal codes regular expression on the Geonames website. Doing this will help all Geonames users (including this project but also every other Geonames user).
CITIES_VALIDATE_POSTAL_CODES = True
You may wish to customize the slugs generated by django-cities. To do so, you will need to write your own slugify()
function and specify its dotted import path in the CITIES_SLUGIFY_FUNCTION
:
CITIES_SLUGIFY_FUNCTION = 'cities.util.default_slugify'
Your customized slugify function should accept two arguments: the object itself and the slug generated by the object itself. It should return the final slug as a string.
Because the slugify function contains code that would be reused by multiple objects, there is only a single slugify function for all of the objects in django-cities. To generate different slugs for different types of objects, test against the object's class name (obj.__class__.__name__
).
Default slugify function (see cities/util.py
):
# SLUGIFY REGEXES
to_und_rgx = re.compile(r"[']", re.UNICODE)
slugify_rgx = re.compile(r'[^-\w._~]', re.UNICODE)
multi_dash_rgx = re.compile(r'-{2,}', re.UNICODE)
dash_und_rgx = re.compile(r'[-_]_', re.UNICODE)
und_dash_rgx = re.compile(r'[-_]-', re.UNICODE)
starting_chars_rgx = re.compile(r'^[-._]*', re.UNICODE)
ending_chars_rgx = re.compile(r'[-._]*$', re.UNICODE)
def default_slugify(obj, value):
if value is None:
return None
value = force_text(unicode_func(value))
value = unicodedata.normalize('NFKC', value.strip())
value = re.sub(to_und_rgx, '_', value)
value = re.sub(slugify_rgx, '-', value)
value = re.sub(multi_dash_rgx, '-', value)
value = re.sub(dash_und_rgx, '_', value)
value = re.sub(und_dash_rgx, '_', value)
value = re.sub(starting_chars_rgx, '', value)
value = re.sub(ending_chars_rgx, '', value)
return mark_safe(value)
Note: This used to be CITIES_IGNORE_EMPTY_REGIONS
.
Some cities in the Geonames data files do not have region information. By default, these cities are imported as normal (they still have foreign keys to their country), but if you wish to avoid importing these cities, set CITIES_SKIP_CITIES_WITH_EMPTY_REGIONS
to True
:
# Import cities without region (default False)
CITIES_SKIP_CITIES_WITH_EMPTY_REGIONS = True
Limit imported alternative names by languages/locales
Note that many alternative names in the Geonames data do not specify a language code, so if you manually specify language codes and do not include und
, you may not import as many alternative names as you want.
Special values:
ALL
- import all alternative namesund
- alternative names that do not specify a language code. When imported, these alternative names will be assigned a language code ofund
. If this language code is not specified, alternative names that do not specify a language code are not imported.LANGUAGES
- a "shortcut" to import all alternative names specified in theLANGUAGES
setting in your Django project'ssettings.py
For a full list of ISO639-1 language codes, see the iso-languagecodes.txt file on Geonames.
CITIES_LOCALES = ['en', 'und', 'LANGUAGES']
Limit the imported postal codes to specific countries
Special value:
ALL
- import all postal codes
CITIES_POSTAL_CODES = ['US', 'CA']
You can write your own plugins to process data before and after it is written to the database. See the section on Writing Plugins for details.
To activate plugins, you need to add their dotted import strings to the CITIES_PLUGINS
option. This example activates the postal_code_ca
and reset_queries
plugins that come with django-cities:
CITIES_PLUGINS = [
# Canadian postal codes need region codes remapped to match geonames
'cities.plugin.postal_code_ca.Plugin',
# Reduce memory usage when importing large datasets (e.g. "allCountries.zip")
'cities.plugin.reset_queries.Plugin',
]
Note that some plugins may use their own configuration options:
# This setting may be specified if you use 'cities.plugin.reset_queries.Plugin'
CITIES_PLUGINS_RESET_QUERIES_CHANCE = 1.0 / 1000000
After you have configured all import settings, run
python manage.py cities --import=all
to import all of the place data.
You may also import specific object types:
python manage.py cities --import=country
python manage.py cities --import=city
NOTE: This can take a long time, although there are progress bars drawn in the terminal.
Specifically, importing postal codes can take one or two orders of magnitude more time than importing other objects.
You can write plugins that modify data before and after it is processed by the import script. For example, you can use this to adjust the continent a country belongs to, or you can use it to add or modify any additional data if you customize and override any django-cities models.
A plugin is simply a Python class that has implemented one or more hook functions as members. Hooks can either modify data before it is processed by the import script, or modify the database after the object has been saved to the database by the import script. By raising cities.conf.HookException
, plugins can skip one piece of data.
Here's a table of all available hooks:
Model | Pre Hook Name | Post Hook Name |
---|---|---|
Country |
country_pre |
country_post |
Region |
region_pre |
region_post |
Subregion |
subregion_pre |
subregion_post |
City |
city_pre |
city_post |
District |
district_pre |
district_post |
PostalCode |
postal_code_pre |
postal_code_post |
AlternativeName |
alt_name_pre |
alt_name_post |
The argument signatures for _pre
hooks and _post
hooks differ. All _pre
hooks have the following argument signature:
class ...Plugin(object):
model_pre(self, parser, item)
whereas all _post
hooks also have the saved model instance available to them:
class ...Plugin(object):
model_post(self, parser, <model>_instance, item)
Arguments passed to hooks:
self
- the plugin object itselfparser
- the instance of thecities.Command
management command<model>_instance
- instance of model that was created based onitem
item
- Python dictionary with data for row being processed
Note that the argument names are simply conventions, you are free to rename them to whatever you wish as long as you keep their order.
Here is a complete skeleton plugin class example:
class CompleteSkeletonPlugin(object):
"""
Skeleton plugin for django-cities that has hooks for all object types, and
does not modify any import data or existing objects in the database.
"""
# Note: Only ONE of these methods needs to be defined. If a method is not
# defined, the import command will avoid calling the undefined method.
def country_pre(self, parser, imported_data_dict):
pass
def country_post(self, parser, country_instance, imported_data_dict):
pass
def region_pre(self, parser, imported_data_dict):
pass
def region_post(self, parser, region_instance, imported_data_dict):
pass
def subregion_pre(self, parser, imported_data_dict):
pass
def subregion_post(self, parser, subregion_instance, imported_data_dict):
pass
def city_pre(self, parser, imported_data_dict):
pass
def city_post(self, parser, city_instance, imported_data_dict):
pass
def district_pre(self, parser, imported_data_dict):
pass
def district_post(self, parser, district_instance, imported_data_dict):
pass
def alt_name_pre(self, parser, imported_data_dict):
pass
def alt_name_post(self, parser, alt_name_instance, imported_data_dict):
pass
def postal_code_pre(self, parser, imported_data_dict):
pass
def postal_code_post(self, parser, postal_code_instance, imported_data_dict):
pass
Silly example:
from cities.conf import HookException
class DorothyPlugin(object):
"""
This plugin skips importing cities that are not in Kansas, USA.
There's no place like home.
"""
def city_pre(self, parser, import_dict):
if import_dict['cc2'] == 'US' and import_dict['admin1Code'] != 'KS':
raise HookException("Ignoring cities not in Kansas, USA") # Raising a HookException skips importing the item
else:
# Modify the value of the data before it is written to the database
import_dict['admin1Code'] = 'KS'
def city_post(self, parser, city, import_data):
# Checks if the region foreign key for the city database row is NULL
if city.region is None:
# Set it to Kansas
city.region = Region.objects.get(country__code='US', code='KS')
# Re-save any existing items that aren't in Kansas
city.save()
Once you have written a plugin, you will need to activate it by specifying its dotted import string in the CITIES_PLUGINS
setting. See the Plugins section for details.
This repository contains an example project which lets you browse the place hierarchy. See the example directory
. Below are some small snippets to show you the kind of queries that are possible once you have imported data:
# Find the 5 most populated countries in the World
>>> Country.objects.order_by('-population')[:5]
[<Country: China>, <Country: India>, <Country: United States>,
<Country: Indonesia>, <Country: Brazil>]
# Find what country the .ly TLD belongs to
>>> Country.objects.get(tld='ly')
<Country: Libya>
# 5 Nearest cities to London
>>> london = City.objects.filter(country__name='United Kingdom').get(name='London')
>>> nearest = City.objects.distance(london.location).exclude(id=london.id).order_by('distance')[:5]
# All cities in a state or county
>>> City.objects.filter(country__code="US", region__code="TX")
>>> City.objects.filter(country__name="United States", subregion__name="Orange County")
# Get all countries in Japanese preferring official names if available,
# fallback on ASCII names:
>>> [country.alt_names_ja.get_preferred(default=country.name) for country in Country.objects.all()]
# Alternate names for the US in English, Spanish and German
>>> [x.name for x in Country.objects.get(code='US').alt_names.filter(language_code='de')]
[u'USA', u'Vereinigte Staaten']
>>> [x.name for x in Country.objects.get(code='US').alt_names.filter(language_code='es')]
[u'Estados Unidos']
>>> [x.name for x in Country.objects.get(code='US').alt_names.filter(language_code='en')]
[u'United States of America', u'America', u'United States']
# Alternative names for Vancouver, Canada
>>> City.objects.get(name='Vancouver', country__code='CA').alt_names.all()
[<AlternativeName: 溫哥華 (yue)>, <AlternativeName: Vankuver (uz)>,
<AlternativeName: Ванкувер (ce)>, <AlternativeName: 溫哥華 (zh)>,
<AlternativeName: वैंकूवर (hi)>, <AlternativeName: Ванкувер (tt)>,
<AlternativeName: Vankuveris (lt)>, <AlternativeName: Fankoever (fy)>,
<AlternativeName: فانكوفر (arz)>, <AlternativeName: Ванкувер (mn)>,
<AlternativeName: ဗန်ကူးဗားမ_ (my)>, <AlternativeName: व्हँकूव्हर (mr)>,
<AlternternativeName: வான்கூவர் (ta)>, <AlternativeName: فانكوفر (ar)>,
<AlternativeName: Vankuver (az)>, <AlternativeName: Горад Ванкувер (be)>,
<AlternativeName: ভ্যানকুভার (bn)>, <AlternativeName: แวนคูเวอร์ (th)>,
<Al <AlternativeName: Ванкувер (uk)>, <AlternativeName: ਵੈਨਕੂਵਰ (pa)>,
'...(remaining elements truncated)...']
# Get zip codes near Mountain View, CA
>>> PostalCode.objects.distance(City.objects.get(name='Mountain View', region__name='California').location).order_by('distance')[:5]
[<PostalCode: 94040>, <PostalCode: 94041>, <PostalCode: 94043>,
<PostalCode: 94024>, <PostalCode: 94022>]
These are apps that build on top of the django-cities
. Useful for essentially extending what django-cities
can do.
- django-airports provides you with airport related model and data (from OpenFlights) that can be used in your Django projects.
In increasing order of difficulty:
- Add tests for the plugins we ship with
- Minimize number of attributes on abstract base models and adjust import script accordingly
- Steal/modify all of the contrib apps from django-contrib-light (Django REST Framework integration, chained selects, and autocomplete)
- Integrate libpostal to extract Country/City/District/Postal Code from an address string
Some datasets are very large (> 100 MB) and take time to download/import.
Data will only be downloaded/imported if it is newer than your data, and only matching rows will be overwritten.
The cities manage command has options, see --help
. Verbosity is controlled through the LOGGING
setting.
-
Install postgres, postgis and libgdal-dev
-
Create
django_cities
database:sudo su -l postgres # Enter your password createuser -d -s -P some_username # Enter password createdb -T template0 -E utf-8 -l en_US.UTF-8 -O multitest django_cities psql -c 'create extension postgis;' -d django_cities
-
Run tests:
POSTGRES_USER=some_username POSTGRES_PASSWORD='password from createuser step' tox # If you have changed example data files then you should push your # changes to github and specify commit and repo variables: TRAVIS_COMMIT=`git rev-parse HEAD` TRAVIS_REPO_SLUG='github-username/django-cities' POSTGRES_USER=some_username POSTGRES_PASSWORD='password from createuser ste' tox
As an alternative to installing and running PostgreSQL system-wide, you can run the tests against a transient Docker instance:
docker run --rm -p 127.0.0.1:5432:5432 mdillon/postgis
TRAVIS_LOG_LEVEL
- defaults toINFO
, but set toDEBUG
to see a (very) large and (very) complete log of the import scriptCITIES_FILES
- set the base urls to afile://
path to use local files without modifying any other settings
Use Django's native migrations
Upgrading from 0.4.1 is likely to cause problems trying to apply a migration when the tables already exist. In this case a fake migration needs to be applied:
python manage.py migrate cities 0001 --fake
** This release of django-cities is not backwards compatible with previous versions **
The country model has some new fields:
- elevation
- area
- currency
- currency_name
- languages
- neighbours
- capital
- phone
Alternative name support has been completely overhauled. The code and usage should now be much simpler. See the updated examples below.
The code field no longer contains the parent code. Eg. the code for California, US is now "CA". In the previous release it was "US.CA".
These changes mean that upgrading from a previous version isn't simple. All of the place IDs are the same though, so if you do want to upgrade it should be possible.