Add strong validation on tftypes.NewValue. #67

paddycarver · 2021-03-24T09:48:28Z

Previously, we were validating that the value passed to
tftypes.NewValue was a valid value for any tftypes.Type, but not
for the specific tftypes.Type we passed to tftypes.NewValue. This
meant, for example, tftypes.NewValue(tftypes.Bool, "hello, world")
would return a tftypes.Value that was invalid. The invalid would
eventually cause an error when we try to marshal it into msgpack to send
back to Terraform, but by that point, it's hard to track down where the
invalid tftypes.Value even came from.

This PR changes to a much stricter validation in tftypes.NewValue,
checking that the value passed can be used for the specific
tftypes.Type passed. This goes so far as to as validate that the types
of the tftypes.Values used as attribute or element values on aggregate
types are valid.

I took the opportunity to refactor a bit, as well, splitting the
conversions in tftypes.NewValue out into type-specific helper
functions, and to test the conversions done for tftypes.Number, the
only type we currently convert for.

Testing tftypes.Number surfaced that the conversion of float32
values to float64s that can be used in big.NewFloat was lossy in
precision. I tried a couple of different ways to make the conversion
lossless, but wasn't able to. I therefore removed support for float32
values from tftypes.NewValue--provider developers can convert them
themselves using whatever lossy strategy they like. If we can find a
lossless strategy for turning a float32 into a *big.Float, I'm all
for adding support back, but I'd like our conversions to be lossless to
prevent future situations like hashicorp/terraform-plugin-sdk#655.

I took this opportunity to also properly support uint, uint8, uint16,
uint32, uint64, int, int8, int16, int32, int64, and float64 in
tftypes.NewValue. #46 and #47 meant to do this, but forgot to include
them, only include their pointer variations instead, on accident.

I noticed our cmp.Diff and cmp.Equals calls were still using the
tftypes.ValueComparer() option, which is no longer needed because
tftypes.Value now has an Equal method, which cmp will call. So I
removed that option to clean up.

The tftypes.Type interface also got a supportedGoTypes method, which
returns a slice of strings, indicating which Go types a tftypes.Type
supports. This was helpful in crafting error messages and keeping the
amount of copying and pasting I was doing minimal. It also lets us
define the list of Go types we tell people a tftypes.Type supports
right next to the code that handles those Go types, making it more
likely that the list will stay in sync with the code.

tftypes.DynamicPseudoType proved to be a problem when validating
values in tftypes.NewValue. In theory, provider developers shouldn't
be using it at all, as tftypes.DynamicPseudoType means the type and
value are both unknown. But we need it in our msgpack and JSON
marshaling tests, and those live outside the tftypes package, so they
can't just skip the validation and create a tftypes.Value by hand. My
solution was to allow tftypes.DynamicPseudoType to contain any Go type
that tftypes.String, tftypes.Bool, tftypes.Number,
tftypes.Tuple, or tftypes.Object could. This may need future
revisions and rethinking based on conversations with core about how they
expected tftypes.DynamicPseudoType to be used.

Because the stricter validation will lead to more panics, I added a
tftypes.ValidateValue function that returns an error if
tftypes.NewValue would panic for that combination of tftypes.Type
and Go value. I think panicking is the right answer, usually, as it
represents a programming error that can never be gracefully handled, the
most rudimentary of testing will find it, and returning an error creates
a prohibitive number of branches in provider code and makes working with
tftypes much, much harder. For example, constructing a map, object,
list, set, or tuple would now require provider developers to construct
the tftypes.Value for each element and attribute individually, and
handle the errors from each. By panicking, we allow the inline
definition of these values, reducing them to a one-liner that doesn't
pollute the scope with a bunch of variables. However, the above isn't
true in one case that I can think of: if the provider is constructing a
tftypes.Value out of a user- or API-defined type that can change at
runtime, that is no longer a programming error and is now an
environmental error--i.e., it can be prompted by changes in the context
the code is running in. This makes testing to avoid panics much, much
harder. In that case, panicking is an unfortunate situation. I think
that's a rare case, however, and providers that find themselves in that
case can use tftypes.ValidateValue to handle the error as they see
fit.

I removed a number of test cases from the TestValueAs test function,
as they were really testing tftypes.NewValue instead of
tftypes.Value.As, which was confusing. These tests are replicated and
expanded upon in the new tests for tftypes.NewValue.

Finally, the stricter validation in tftypes.NewValue found an error in
the TestTransform tests, where an object didn't have values for all
the attributes listed in its type. The extra attributes listed in the
type were removed.

kmoe

Awesome work 🎉

Previously, we were validating that the value passed to `tftypes.NewValue` was a valid value for _any_ `tftypes.Type`, but not for the specific `tftypes.Type` we passed to `tftypes.NewValue`. This meant, for example, `tftypes.NewValue(tftypes.Bool, "hello, world")` would return a `tftypes.Value` that was invalid. The invalid would eventually cause an error when we try to marshal it into msgpack to send back to Terraform, but by that point, it's hard to track down where the invalid `tftypes.Value` even came from. This PR changes to a much stricter validation in `tftypes.NewValue`, checking that the value passed can be used for the specific `tftypes.Type` passed. This goes so far as to as validate that the types of the `tftypes.Value`s used as attribute or element values on aggregate types are valid. I took the opportunity to refactor a bit, as well, splitting the conversions in `tftypes.NewValue` out into type-specific helper functions, and to test the conversions done for `tftypes.Number`, the only type we currently convert for. Testing `tftypes.Number` surfaced that the conversion of `float32` values to `float64`s that can be used in `big.NewFloat` was lossy in precision. I tried a couple of different ways to make the conversion lossless, but wasn't able to. I therefore removed support for `float32` values from `tftypes.NewValue`--provider developers can convert them themselves using whatever lossy strategy they like. If we can find a lossless strategy for turning a `float32` into a `*big.Float`, I'm all for adding support back, but I'd like our conversions to be lossless to prevent future situations like hashicorp/terraform-plugin-sdk#655. I took this opportunity to also properly support uint, uint8, uint16, uint32, uint64, int, int8, int16, int32, int64, and float64 in `tftypes.NewValue`. #46 and #47 meant to do this, but forgot to include them, only include their pointer variations instead, on accident. I noticed our `cmp.Diff` and `cmp.Equals` calls were still using the `tftypes.ValueComparer()` option, which is no longer needed because `tftypes.Value` now has an `Equal` method, which `cmp` will call. So I removed that option to clean up. The `tftypes.Type` interface also got a `supportedGoTypes` method, which returns a slice of strings, indicating which Go types a `tftypes.Type` supports. This was helpful in crafting error messages and keeping the amount of copying and pasting I was doing minimal. It also lets us define the list of Go types we tell people a `tftypes.Type` supports right next to the code that handles those Go types, making it more likely that the list will stay in sync with the code. `tftypes.DynamicPseudoType` proved to be a problem when validating values in `tftypes.NewValue`. In theory, provider developers shouldn't be using it at all, as `tftypes.DynamicPseudoType` means the type _and_ value are both unknown. But we need it in our msgpack and JSON marshaling tests, and those live outside the `tftypes` package, so they can't just skip the validation and create a `tftypes.Value` by hand. My solution was to allow `tftypes.DynamicPseudoType` to contain any Go type that `tftypes.String`, `tftypes.Bool`, `tftypes.Number`, `tftypes.Tuple`, or `tftypes.Object` could. This may need future revisions and rethinking based on conversations with core about how they expected `tftypes.DynamicPseudoType` to be used. Because the stricter validation will lead to more panics, I added a `tftypes.ValidateValue` function that returns an error if `tftypes.NewValue` would panic for that combination of `tftypes.Type` and Go value. I think panicking is the right answer, usually, as it represents a programming error that can never be gracefully handled, the most rudimentary of testing will find it, and returning an error creates a prohibitive number of branches in provider code and makes working with `tftypes` much, much harder. For example, constructing a map, object, list, set, or tuple would now require provider developers to construct the `tftypes.Value` for each element and attribute individually, and handle the errors from each. By panicking, we allow the inline definition of these values, reducing them to a one-liner that doesn't pollute the scope with a bunch of variables. However, the above isn't true in one case that I can think of: if the provider is constructing a `tftypes.Value` out of a user- or API-defined type that can change at runtime, that is no longer a programming error and is now an environmental error--i.e., it can be prompted by changes in the context the code is running in. This makes testing to avoid panics much, much harder. In that case, panicking is an unfortunate situation. I think that's a rare case, however, and providers that find themselves in that case can use `tftypes.ValidateValue` to handle the error as they see fit. I removed a number of test cases from the `TestValueAs` test function, as they were really testing `tftypes.NewValue` instead of `tftypes.Value.As`, which was confusing. These tests are replicated and expanded upon in the new tests for `tftypes.NewValue`. Finally, the stricter validation in `tftypes.NewValue` found an error in the `TestTransform` tests, where an object didn't have values for all the attributes listed in its type. The extra attributes listed in the type were removed.

github-actions · 2024-05-28T05:06:01Z

I'm going to lock this pull request because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

paddycarver added bug Something isn't working enhancement New feature or request breaking-change This will impact or improve our compatibility posture labels Mar 24, 2021

paddycarver requested a review from a team March 24, 2021 09:48

kmoe approved these changes Mar 31, 2021

View reviewed changes

paddycarver added 3 commits March 31, 2021 08:47

Add changelog entry.

ded5941

Remove unused function from tests.

5995f10

paddycarver force-pushed the paddy_value_validation branch from fae0ded to 5995f10 Compare March 31, 2021 15:47

paddycarver merged commit bbc1a4d into main Mar 31, 2021

paddycarver deleted the paddy_value_validation branch March 31, 2021 15:49

paddycarver mentioned this pull request Mar 31, 2021

Better error messages when passing an incompatible type to NewDynamicValue #44

Closed

github-actions bot locked as resolved and limited conversation to collaborators May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add strong validation on tftypes.NewValue. #67

Add strong validation on tftypes.NewValue. #67

paddycarver commented Mar 24, 2021

kmoe left a comment

github-actions bot commented May 28, 2024

Add strong validation on tftypes.NewValue. #67

Add strong validation on tftypes.NewValue. #67

Conversation

paddycarver commented Mar 24, 2021

kmoe left a comment

Choose a reason for hiding this comment

github-actions bot commented May 28, 2024