-
Notifications
You must be signed in to change notification settings - Fork 913
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OmegaConfigLoader returns Dict
instead of DictConfig
, resolves runtime_params properly
#2467
Changes from 15 commits
4565684
67cff2d
055124e
842e83c
d6d224a
78c1ae0
5b48003
5b0bff8
c864783
2c9da40
f578c39
de98e15
4c0571d
f1c1ec2
acbde3e
87a51b6
a1b5f0c
29f62ff
f610010
5799792
78c2371
e24fbfe
0f7911f
e7d6f62
a4d9893
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,6 +11,9 @@ | |
# Upcoming Release 0.18.9 | ||
|
||
## Major features and improvements | ||
* `OmegaConfigLoader` will return a `dict` instead of `DictConfig`. | ||
* `kedro run --params` now update interpolated parameters correctly. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That's also just for the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That's correct, I will make changes to this There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I change I have changed the |
||
|
||
## Bug fixes and other changes | ||
## Breaking changes to the API | ||
## Upcoming deprecations for Kedro 0.19.0 | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -8,7 +8,7 @@ | |
from typing import Any, Dict, Iterable, List, Optional, Set # noqa | ||
|
||
import fsspec | ||
from omegaconf import OmegaConf | ||
from omegaconf import DictConfig, OmegaConf | ||
from omegaconf.resolvers import oc | ||
from yaml.parser import ParserError | ||
from yaml.scanner import ScannerError | ||
|
@@ -166,7 +166,7 @@ def __getitem__(self, key) -> Dict[str, Any]: | |
else: | ||
base_path = str(Path(self._fs.ls("", detail=False)[-1]) / self.base_env) | ||
base_config = self.load_and_merge_dir_config( | ||
base_path, patterns, read_environment_variables | ||
base_path, patterns, key, read_environment_variables | ||
) | ||
config = base_config | ||
|
||
|
@@ -177,7 +177,7 @@ def __getitem__(self, key) -> Dict[str, Any]: | |
else: | ||
env_path = str(Path(self._fs.ls("", detail=False)[-1]) / run_env) | ||
env_config = self.load_and_merge_dir_config( | ||
env_path, patterns, read_environment_variables | ||
env_path, patterns, key, read_environment_variables | ||
) | ||
|
||
# Destructively merge the two env dirs. The chosen env will override base. | ||
|
@@ -209,6 +209,7 @@ def load_and_merge_dir_config( | |
self, | ||
conf_path: str, | ||
patterns: Iterable[str], | ||
key: str, | ||
read_environment_variables: Optional[bool] = False, | ||
) -> Dict[str, Any]: | ||
"""Recursively load and merge all configuration files in a directory using OmegaConf, | ||
|
@@ -217,6 +218,7 @@ def load_and_merge_dir_config( | |
Args: | ||
conf_path: Path to configuration directory. | ||
patterns: List of glob patterns to match the filenames against. | ||
key: Str of the dictionary keys to access. | ||
noklam marked this conversation as resolved.
Show resolved
Hide resolved
|
||
read_environment_variables: Whether to resolve environment variables. | ||
|
||
Raises: | ||
|
@@ -275,7 +277,13 @@ def load_and_merge_dir_config( | |
return {} | ||
if len(aggregate_config) == 1: | ||
return list(aggregate_config)[0] | ||
return dict(OmegaConf.merge(*aggregate_config)) | ||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think the flow here is a little unclear (too many different This would be solved if we removed lines 276-279. I know @merelcht said there was some reason to pick out the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would be keen on removing it, unless we have a specific failing case. I try remove line 278-279 and all test case passed. Should we separate this into another issue? Removing Line 276-277 will cause a few tests fail, I suspect it's also link to #2556 and some refactoring may be needed. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm happy for those lines to be removed if everything is still working fine. I don't remember why I added it, so really should have left a comment 😅 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I think we should remove line 278-279 here since as I understand it at the moment they are not just unnecessary but following this change will actually be causing a bug (would be good if you could check this): if you just have a single parameters.yml file, the Lines 276-277 don't actually cause a bug I think, so if removing them makes some tests fail and it seems related to #2556 then let's leave those here for now and try to remove them in that PR. |
||
if key == "parameters": | ||
# Merge with runtime parameters only for "parameters" | ||
return OmegaConf.to_container( | ||
OmegaConf.merge(*aggregate_config, self.runtime_params), resolve=True | ||
) | ||
return OmegaConf.to_container(OmegaConf.merge(*aggregate_config), resolve=True) | ||
|
||
def _is_valid_config_path(self, path): | ||
"""Check if given path is a file path and file type is yaml or json.""" | ||
|
@@ -311,7 +319,7 @@ def _check_duplicates(seen_files_to_keys: Dict[Path, Set[Any]]): | |
raise ValueError(f"{dup_str}") | ||
|
||
@staticmethod | ||
def _resolve_environment_variables(config: Dict[str, Any]) -> None: | ||
def _resolve_environment_variables(config: DictConfig) -> None: | ||
"""Use the ``oc.env`` resolver to read environment variables and replace | ||
them in-place, clearing the resolver after the operation is complete if | ||
it was not registered beforehand. | ||
|
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -69,14 +69,20 @@ def local_config(tmp_path): | |||||
|
||||||
@pytest.fixture | ||||||
def create_config_dir(tmp_path, base_config, local_config): | ||||||
proj_catalog = tmp_path / _BASE_ENV / "catalog.yml" | ||||||
base_catalog = tmp_path / _BASE_ENV / "catalog.yml" | ||||||
local_catalog = tmp_path / _DEFAULT_RUN_ENV / "catalog.yml" | ||||||
parameters = tmp_path / _BASE_ENV / "parameters.json" | ||||||
project_parameters = {"param1": 1, "param2": 2} | ||||||
base_parameters = {"param1": 1, "param2": 2, "templated_param": "${global}"} | ||||||
base_global_parameters = {"global": "base"} | ||||||
local_global_parameters = {"global": "local"} | ||||||
|
||||||
_write_yaml(proj_catalog, base_config) | ||||||
_write_yaml(base_catalog, base_config) | ||||||
_write_yaml(local_catalog, local_config) | ||||||
_write_json(parameters, project_parameters) | ||||||
_write_json(parameters, base_parameters) | ||||||
_write_json(tmp_path / _BASE_ENV / "parameters_global.json", base_global_parameters) | ||||||
_write_json( | ||||||
tmp_path / _DEFAULT_RUN_ENV / "parameters_global.json", local_global_parameters | ||||||
) | ||||||
|
||||||
|
||||||
@pytest.fixture | ||||||
|
@@ -530,3 +536,30 @@ def zipdir(path, ziph): | |||||
conf = OmegaConfigLoader(conf_source=f"{tmp_path}/Python.zip") | ||||||
catalog = conf["catalog"] | ||||||
assert catalog["trains"]["type"] == "MemoryDataSet" | ||||||
|
||||||
@use_config_dir | ||||||
def test_variable_interpolation_with_correct_env(self, tmp_path): | ||||||
"""Make sure the parameters is interpolated with the correct environment""" | ||||||
conf = OmegaConfigLoader(str(tmp_path)) | ||||||
params = conf["parameters"] | ||||||
# Making sure it is not override by local/parameters_global.yml | ||||||
assert params["templated_param"] == "base" | ||||||
|
||||||
@use_config_dir | ||||||
def test_runtime_params_override_interpolated_value(self, tmp_path): | ||||||
"""Make sure interpolated value is updated correctly with runtime_params""" | ||||||
conf = OmegaConfigLoader(str(tmp_path), runtime_params={"global": "global"}) | ||||||
params = conf["parameters"] | ||||||
assert params["templated_param"] == "global" | ||||||
|
||||||
@use_config_dir | ||||||
def test_runtime_params_not_propogate_non_parameters_config(self, tmp_path): | ||||||
"""Make sure `catalog`, `credetials` won't get updated by runtime_params""" | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||
# https://github.com/kedro-org/kedro/pull/2467 | ||||||
conf = OmegaConfigLoader(str(tmp_path), runtime_params={"global": "global"}) | ||||||
cred = conf["credentials"] | ||||||
catalog = conf["catalog"] | ||||||
logging = conf["logging"] | ||||||
assert "global" not in cred | ||||||
assert "global" not in catalog | ||||||
assert "global" not in logging |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Personally I'd put all this as bug fix rather than major improvement.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's unclear to me if this is part of the official support feature. My reasoning go as below:
We never fixed the
TemplatedConfigLoader
as we promised we need some more thinking for 0.19. If this is considered a bug fix, I don't see why we can't fix this forTemplatedConfigLoader
in 0.18.x and requires users to sub-class it instead.In fact,
TemplatedConfigLoader
doesn't suffer from the residual problem so it can be safely used for all configurations. There is nothing wrong with merging theruntime_params
as this is exactly what we are doing now.#1782 (comment)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a fair comment I think for the "interpolated parameters" point, but I still think the
dict
vs.DictConfig
point should be under bug fix.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@antonymilne I agree with this and I will move it to bug fix.