-
-
Notifications
You must be signed in to change notification settings - Fork 3.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add schema for configuration file with yamale #4084
Changes from 25 commits
c5aa08a
f38259d
4c02207
a63d39e
7fc52b9
ab7d8fa
602fcd4
b6dd2a2
584ce5e
e4f5a3a
cd5b5dc
07350b9
00e69d9
f4ddb42
89badbf
7b30c8c
e038046
b2e5dbc
40c35a2
ace5f8e
25e962d
1c3a2fa
b63b016
18a6485
dc60f11
79ebf87
6286a34
31fc800
84d1326
9ccd94b
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,5 @@ | ||
*.db | ||
*.rdb | ||
*.egg-info | ||
*.log | ||
*.pyc | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,180 @@ | ||
"""Validator for the RTD configuration file.""" | ||
|
||
from __future__ import division, print_function, unicode_literals | ||
|
||
from os import path | ||
|
||
import six | ||
import yamale | ||
from yamale.validators import DefaultValidators, Validator | ||
|
||
V2_SCHEMA = path.join( | ||
path.dirname(__file__), | ||
'../rtd_tests/fixtures/spec/v2/schema.yml' | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We shouldn't have code outside tests that references tests fixtures. This module should probably live under |
||
) | ||
|
||
ALL = 'all' | ||
|
||
DEFAULT_VALUES = { | ||
'formats': [], | ||
'conda': None, | ||
'build': { | ||
'image': '2.0', | ||
}, | ||
'python': { | ||
'version': '3.6', | ||
'requirements': None, | ||
'install': None, | ||
'extra_requirements': [], | ||
'system_packages': False, | ||
}, | ||
'sphinx': { | ||
'configuration': None, | ||
'fail_on_warning': False, | ||
}, | ||
'mkdocs': { | ||
'configuration': None, | ||
'fail_on_warning': False, | ||
}, | ||
'submodules': { | ||
'include': [], | ||
'exclude': [], | ||
'recursive': False, | ||
}, | ||
'redirects': None, | ||
} | ||
|
||
|
||
def flatten(dic, keep_key=False, position=None): | ||
""" | ||
Returns a flattened dictionary | ||
|
||
Given a dictionary of nested dictionaries, it returns | ||
a dictionary with all nested keys at the top level. | ||
|
||
For example | ||
|
||
{ | ||
'key': 'value', | ||
'nested': { | ||
'subkey': 'value' | ||
}, | ||
} | ||
|
||
Becomes | ||
|
||
{ | ||
'key': value, | ||
'nested': ..., | ||
'nested.subkey': 'value' | ||
} | ||
""" | ||
child = {} | ||
|
||
for key, value in dic.items(): | ||
if isinstance(key, six.string_types): | ||
key = key.replace('.', '_') | ||
if position: | ||
item_position = '{}.{}'.format(position, key) | ||
else: | ||
item_position = key | ||
|
||
if isinstance(value, dict): | ||
child.update( | ||
flatten(dic[key], keep_key, item_position) | ||
) | ||
if keep_key: | ||
child[item_position] = value | ||
else: | ||
child[item_position] = value | ||
|
||
return child | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. What is the underlying need for flattening the dictionary? This seems like it could be an unnecessary step. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It makes it easy to manipulate and check the keys, also is better to describe constraints, for example We don't have to use recursion or several nested bucles to do things like that There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It's not clear if this is still required. It seems like a confusing way to do a nested dict merge. I think if we are implementing defaults through yamale there is a tighter integration rather than keeping this data in a dictionary. |
||
|
||
|
||
class PathValidator(Validator): | ||
|
||
""" | ||
Path validator | ||
|
||
Checks if the given value is a string and a existing | ||
file. | ||
""" | ||
|
||
tag = 'path' | ||
constraints = [] | ||
configuration_file = '.' | ||
|
||
def _is_valid(self, value): | ||
if isinstance(value, six.string_types): | ||
file_ = path.join( | ||
path.dirname(self.configuration_file), | ||
value | ||
) | ||
return path.exists(file_) | ||
return False | ||
|
||
|
||
class BuildConfig(object): | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I haven't played with yamale, it's not clear how much of this class is needed to validate our specification in unit tests. If this is more for implementing yamale in our application, we should break this out and save it. |
||
|
||
"""Wrapper object to validate to configuration file.""" | ||
|
||
def __init__(self, configuration_file): | ||
self.configuration_file = configuration_file | ||
self.data = yamale.make_data(self.configuration_file) | ||
self.defaults = flatten(DEFAULT_VALUES, keep_key=True) | ||
self.schema = yamale.make_schema( | ||
V2_SCHEMA, | ||
validators=self._get_validators() | ||
) | ||
|
||
def _get_validators(self): | ||
validators = DefaultValidators.copy() | ||
PathValidator.configuration_file = self.configuration_file | ||
validators[PathValidator.tag] = PathValidator | ||
return validators | ||
|
||
def check_constraints(self): | ||
""" | ||
Check constraints between keys. | ||
|
||
Such as relations of uniquiness and set default values for those. | ||
""" | ||
constraints = { | ||
'Documentation type': { | ||
'unique': ['sphinx', 'mkdocs'], | ||
'default': 'sphinx', | ||
}, | ||
'Submodules': { | ||
'unique': ['submodules.include', 'submodules.exclude'], | ||
'default': 'submodules.include', | ||
} | ||
} | ||
|
||
for subject, constraint in constraints.items(): | ||
present_keys = [ | ||
key | ||
for key in constraint['unique'] | ||
if key in self.data[0] | ||
] | ||
default_key = constraint.get('default') | ||
if not present_keys and default_key: | ||
self.data[0][default_key] = self.defaults[default_key] | ||
elif len(present_keys) > 1: | ||
raise ValueError( | ||
'{subject} can not have {keys} at the same time'.format( | ||
subject=subject, | ||
keys=constraint['unique'], | ||
) | ||
) | ||
|
||
def set_defaults(self): | ||
"""Set default values to the currently processed data.""" | ||
for key, value in self.defaults.items(): | ||
if key not in self.data[0]: | ||
self.data[0][key] = value | ||
|
||
def validate(self): | ||
"""Validate the current configuration file.""" | ||
self.check_constraints() | ||
self.set_defaults() | ||
return yamale.validate(self.schema, self.data) |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,96 @@ | ||
# Read the Docs configuration file | ||
|
||
# The version of the spec to be use | ||
version: enum('2') | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I assume this spec will be used only on the version 2, if there is another version, we will need to create another schema. Also, I was thinking that we can use this to validate the current v1 API. But I'm not sure if that's worth it. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yea, I think we can keep v1 as it is. |
||
|
||
# Formats of the documentation to be built | ||
# Default: [] | ||
formats: any(list(enum('htmlzip', 'pdf', 'epub')), enum('all'), required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is only valid for sphinx, I wonder if we can move this to the sphinx key, or maybe we want to support this #1939? (still no pdf for mkdocs) There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It's only valid for sphinx for now. We might have other engines that produce a subset of these later. I'm fine with this being top level |
||
|
||
# Configuration for Conda support | ||
conda: include('conda', required=False) | ||
|
||
# Configuration for the documentation build process | ||
build: include('build', required=False) | ||
|
||
# Configuration of the Python environment to be used | ||
python: include('python', required=False) | ||
|
||
# Configuration for sphinx documentation | ||
sphinx: include('sphinx', required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Instead of the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We could use this to extend the build and support things like #1139 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Not sure. It does seem reasonable to have a There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Having two different sections could be useful be extended in the future. In case this works as Eric mentioned, we will need a check to to accept one or the other, but not both. |
||
|
||
# Configuration for mkdocs documentation | ||
mkdocs: include('mkdocs', required=False) | ||
|
||
# Submodules configuration | ||
submodules: include('submodules', required=False) | ||
|
||
# Redirects for the current version to be built | ||
# Key/value list, represent redirects of type `type` | ||
# from url -> to url | ||
# Default: null | ||
redirects: map(enum('page'), map(str(), str()), required=False) | ||
|
||
--- | ||
|
||
conda: | ||
# The path to the Conda environment file from the root of the project | ||
environment: path() | ||
|
||
build: | ||
# The build docker image to be used | ||
# Default: '2.0' | ||
image: enum('1.0', '2.0', 'latest', required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. As per readthedocs/readthedocs-docker-images#62, this will be changing:
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we just need to update the spec without increasing the version number. If there is a v3 spec we can stop updating the images there and add new ones in the most recent spec.
But it is is our docs https://docs.readthedocs.io/en/latest/yaml-config.html#build-image There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yup, that's changing. We don't want to support numeric versions directly anymore. See the discussion in readthedocs/readthedocs-docker-images#62 for more background.
Yup, agreed. I think this just needs our current images then. |
||
|
||
python: | ||
# The Python version | ||
# Default: '3.6' | ||
version: enum('2', '2.7', '3', '3.5', '3.6', required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure what to do here. This is dependent on the docker image. Our current I think verifying the python version list is a config parsing level validation, we don't necessarily need to worry about validating across fields as part of this exercise. Perhaps here, for our spec discussion, we just put all possible python versions? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
I'd say yes |
||
|
||
# The path to the requirements file from the root of the project | ||
# Default: null | ||
requirements: path(required=False) | ||
|
||
# Install the project using python setup.py install or pip | ||
# Default: null | ||
install: enum('pip', 'setup.py', required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This is the first breaking change :), I think is more simple to do this, instead of having a boolean field for each There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Also, the |
||
|
||
# Extra requirements sections to install in addition to the package dependencies | ||
# Default: [] | ||
extra_requirements: list(str(), required=False) | ||
|
||
# Give the virtual environment access to the global site-packages dir | ||
# Default: false | ||
system_packages: bool(required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I wasn't 100% sure if this should belong in the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I mostly want this option to go away, but I don't know if we can do it quite yet. I think it can go in |
||
|
||
sphinx: | ||
# The path to the conf.py file | ||
# Default: rtd will try to find it | ||
configuration: path(required=False) | ||
|
||
# Add the -W option to sphinx-build | ||
# Default: false | ||
fail_on_warning: bool(required=False) | ||
|
||
mkdocs: | ||
# The path to the mkdocs.yml file | ||
# Default: rtd will try to find it | ||
configuration: path(required=False) | ||
|
||
# Add the --strict optio to mkdocs build | ||
# Default: false | ||
fail_on_warning: bool(required=False) | ||
|
||
|
||
submodules: | ||
# List of submodules to be included | ||
# Default: [] | ||
include: any(list(str()), enum('all'), required=False) | ||
|
||
# List of submodules to be ignored | ||
# Default: [] | ||
exclude: any(list(str()), enum('all'), required=False) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. is it possible to mark these There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Nop, there isn't :/, I'll see if there is a way to implement a custom validator to do this. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I imagine we'll probably need some kind double validation:
|
||
|
||
# Do a recursive clone? | ||
# Default: false | ||
recursive: bool(required=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As general feedback, I'm not sure this needs to be broken out into a separate django application path. I'd implement this next to similar code so that it was easy to find. I believe
doc_builder/
already has similar code around our config file.See note below though. I don't think this should be application level yet (pending a design discussion around usage of yamale)