Validation for illegal column names in `data` #734

danielhuppmann · 2023-02-28T15:36:01Z

Please confirm that this PR has done the following:

Tests Added
~~Documentation Added~~
~~Name of contributors Added to AUTHORS.rst~~
Description in RELEASE_NOTES.md Added

Description of PR

During review of #729, I noticed that there may be issues when initializing an IamDataFrame with an unnamed column (None or empty string) - both could give unexpected behavior. This PR adds these two corner cases to the validation routine.

…lumn-names

coroa

LGTM. Only minor comments from my side. Feel free to merge.

coroa · 2023-03-03T08:31:13Z

pyam/utils.py

@@ -262,12 +262,18 @@ def _intuit_column_groups(df, index, include_index=False):
        existing_cols = existing_cols.union(df.columns)

    # check that there is no column in the timeseries data with reserved names
+    if None in existing_cols:
+        raise ValueError("Unnamed column in `data`: None")


This could also be part of the ILLEGAL_COLS, i guess, but maybe the message is just clearer this way.

Right, the reason why this is not part ILLEGAL_COLS is that we have an easy solution for string-column-names, which is part of the error message - but this doesn't work (easily) for None.

pyam/utils.py

codecov · 2023-03-03T09:14:40Z

Codecov Report

Merging #734 (7318209) into main (e07d3b9) will increase coverage by 0.0%.
The diff coverage is 98.2%.

@@          Coverage Diff          @@
##            main    #734   +/-   ##
=====================================
  Coverage   95.0%   95.1%           
=====================================
  Files         59      59           
  Lines       6020    6044   +24     
=====================================
+ Hits        5725    5748   +23     
- Misses       295     296    +1

Impacted Files	Coverage Δ
pyam/utils.py	`92.7% <97.8%> (+0.1%)`	⬆️
tests/test_core.py	`100.0% <100.0%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

danielhuppmann added 3 commits February 28, 2023 16:31

Add empty string to list of illegal column names

84b85fa

Add check for unnamed columns

48cb1ef

Extend the tests

b2a3c1d

danielhuppmann requested a review from coroa February 28, 2023 15:36

danielhuppmann self-assigned this Feb 28, 2023

danielhuppmann added 4 commits February 28, 2023 16:38

Merge remote-tracking branch 'origin/main' into validation/illegal-co…

9a14cbc

…lumn-names

Fix merge conflicts

854559f

Add to release notes

6e61f79

Fix the merge conflict resolution conflict

b958264

danielhuppmann marked this pull request as ready for review February 28, 2023 15:51

coroa approved these changes Mar 3, 2023

View reviewed changes

Fix error message formatting per suggestion by @coroa

7318209

danielhuppmann merged commit 1ca10ca into IAMconsortium:main Mar 3, 2023

danielhuppmann deleted the validation/illegal-column-names branch March 3, 2023 09:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation for illegal column names in `data` #734

Validation for illegal column names in `data` #734

danielhuppmann commented Feb 28, 2023 •

edited

Loading

coroa left a comment

coroa Mar 3, 2023

danielhuppmann Mar 3, 2023

codecov bot commented Mar 3, 2023

Validation for illegal column names in data #734

Validation for illegal column names in data #734

Conversation

danielhuppmann commented Feb 28, 2023 • edited Loading

Please confirm that this PR has done the following:

Description of PR

coroa left a comment

Choose a reason for hiding this comment

coroa Mar 3, 2023

Choose a reason for hiding this comment

danielhuppmann Mar 3, 2023

Choose a reason for hiding this comment

codecov bot commented Mar 3, 2023

Codecov Report

Validation for illegal column names in `data` #734

Validation for illegal column names in `data` #734

danielhuppmann commented Feb 28, 2023 •

edited

Loading