-
Notifications
You must be signed in to change notification settings - Fork 17.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reflect: add a function for treating C memory as a Go slice #13656
Comments
We already have C.GoBytes that converts a unsafe.Pointer
and a length to []byte, why do we need another?
After converting the pointer to a byte slice, it's very easy
to convert to []T for arbitrary T.
|
Not without copying the underlying array. Or am I missing something? |
You can convert a unsafe.Pointer p to any []T without copying
as follows:
(*[(1 << 31) / unsafe.Sizeof(T{})]T)(p)[:len:len]
|
That isn't portable: on 64-bit platforms it only expresses sizes up to 2GB. (And you can't work around it by increasing the constant because then it doesn't work on 32-bit platforms.) It also seems likely to interact poorly with analysis tools: it temporarily produces a pointer to an "array" with elements in invalid memory. |
There is way to use correct cap on all platforms:
(*[1<<(^uint(0)>>27&63)/unsafe.Sizeof(T{}) - 1]T)(p)[:len:len]
If the code breaks some analysis tool, then it's a
bug of the analysis tool. The code is fully conforming
and portable. If p is not pointing to Go heap, it should
also work with future GC enhancements.
|
|
Also, those constants aren't even correct on amd64:
package main
import (
"fmt"
"reflect"
"unsafe"
)
func main() {
s := []byte("Hello, ⚛")
n := len(s)
p := reflect.ValueOf(s).Pointer()
s2 := (*[1<<(^uint(0)>>27&63)/unsafe.Sizeof([]byte{}) - 1]byte)(p)[:n:n]
fmt.Println(string(s2))
}
That comes from here: To me, the fact that even you - someone deeply familiar with the Go language and its implementations - couldn't get those constants right strongly suggests that we should not expect more casual cgo users to do so. |
I agree that we should probably find a solution to this problem. We want to make cgo easier & safer. Using SliceHeader is ok, that "cannot be used safely" warning is aimed at people trying to store Go pointers in its Data field. Storing C pointers in the Data field should be ok. The rest of the warning is basically the same as the one for unsafe.Pointer.
(In an ideal world, SliceHeader.Data should be an unsafe.Pointer, not a uintptr. Too late now.) I'm hesitant to put the fix in reflect, it is too heavyweight. Maybe just something in package unsafe?
|
If that's the case, then just a doc fix might be sufficient. (I suggested a similar approach in some code I was reviewing and @ianlancetaylor said not to do it.)
Maybe? But unsafe doesn't otherwise have anything to do with built-in data structures like slices, and it's not entirely clear to me how you'd get from that to a value of slice type. Would you convert "s" to a pointer-to-slice, or to some other type (e.g. the slice itself)? If it's a pointer-to-slice, is the slice header allocated on the Go heap? reflect.Value.Interface() at least makes the answers to those questions consistent, which is why it seems like a more natural fit to me - and |
In particular, this snippet of existing func SliceAt(typ reflect.Type, len, cap int, p unsafe.Pointer) reflect.Value {
return reflect.NewAt(reflect.ArrayOf(len, typ.Elem()), p).Elem().Slice(0, len)
} |
You'd do something like this:
Doing it this way gets around the polymorphic return type you would need if MakeSlice returned a slice, or the allocation you would need if MakeSlice returned an unsafe.Pointer to a slice. I agree that this is functionality that unsafe hasn't had historically. But I think it would be the right place for this operation. On the other hand, reflect.NewAt has a similar feel, and maybe both it and MakeSlice should be together. |
I don't see how to use reflect.SliceHeader safely unless we exempt it from the Go 1 guarantee or promise that we will never change the representation of slices. Since in the past we've discussed changing the representation of slices, it seems unwise to commit to the current representation for all time and for all implementations. Same for reflect.StringHeader, of course. |
To use reflect.SliceHeader you need to use unsafe, and unsafe is exempt from the Go 1 guarantee. |
FWIW, this is not just a cgo related issue, it already occurs in some of the syscal-wrappers. For example, an implementation of syscall.Mmap needs to do basically the same construction. It would be helpful if there is an API - even under unsafe - available that hides the implementation details and memory-layout for slices away. (Assuming the API doesn't need to change or can be augmented with an refined version). |
This doesn't seem exactly what reflect is for. And it's not specific to cgo either: this comes up in system calls like mmap. I could see possibly adding something like unsafe.Slice(p, n, m) with the right types (p is pointer to T, result is []T). That would be a language change. I guess we could add reflect.UnsafeSlice(t, p, n, m) to return an interface{} or a Value holding a slice of type t with pointer p, len n, cap m. It's not really reflect but maybe it's fine. I guess it's more reflect than Swapper, although that's not a great argument. Personally, I don't mind writing |
I think it's too late to do this for Go 1.8 at least. |
@bcmills, are you still interested in this? |
Yes. (But I'm not sure whether I'll have the bandwidth to implement it for 1.9.) |
It's #19367 that I'm thinking of. That's on hold for Go 2, so probably this should be too. (That would address this exact need.) |
FWIW, I have implemented a library locally using My solution for // SliceAt returns a view of the memory at p as a slice of elem.
// The elem parameter is the element type of the slice, not the complete slice type.
func SliceAt(elem Type, p unsafe.Pointer, n int) Value {
if p == nil && n == 0 {
return Zero(SliceOf(elem))
}
return NewAt(bigArrayOf(elem), p).Elem().Slice3(0, n, n)
}
// bigArrayOf returns the type of a maximally-sized array of t.
//
// This works around the memory-stranding issue described in
// https://golang.org/issue/13656 by producing only one array type per element
// type (instead of one array type per length).
func bigArrayOf(t Type) Type {
n := ^uintptr(0) / uintptr(t.Size())
const maxInt = uintptr(^uint(0) >> 1)
if n > maxInt {
n = maxInt
}
return ArrayOf(int(n), t)
} [Edit: based on #19367 (comment), I would recommend a somewhat different approach, available as |
The problem is having to write A simpler way to write that entire expression would be ideal. If that can't be fixed before Go2, perhaps an easier problem could be. A lot of people much smarter than I am have written the constant If cgo (or runtime or somewhere appropriate) exported a constant with the correct value of I would much rather a solution to the whole problem be found and am only suggesting this if there is no better solution that can be implemented before Go2. |
@jimmyfrasche It's an interesting idea but the maximum value for |
@ianlancetaylor if it's the maximum for the platform assuming a size of 1 you could use it to compute the correct value for arbitrary T. Still not pretty and far from ideal, but its use could be documented and explained on the wiki (which currently just has 1 << 30 without explanation now https://github.com/golang/go/wiki/cgo#turning-c-arrays-into-go-slices ) |
Going to close this since it seems clear that anything we do in reflect is going to be awful. Better to wait for #19367. |
Hi @rsc @ianlancetaylor, I am also curious about the specification of the GC behavior when it does not interacts with slices/strings which I have only found this paragraph from the cgo wiki. This question relates this comment by @bcmills :
Thanks! |
@gobwas The issue tracker is not the right place for this kind of discussion, especially not on a closed issue. Please use a forum, probably golang-nuts. See https://golang.org/wiki/Questions. Thanks. |
Now that cgo supports exporting Go functions to C, it would be useful for Go functions to be able to read C memory without making unnecessary copies. In particular, we would like to be able to read (and write) a C
char*
using functions that operate on a Go[]byte
(e.g.bytes.Reader
orio.Reader.Read
).reflect.NewAt
andreflect.ArrayOf
get us partway there: we can treat the pointer to C memory as a Go pointer to an array of the correct underlying length, then usereflect.Value.Slice
to convert that to a slice. Unfortunately,reflect.ArrayOf
strands a small amount of memory for a uniquereflect.Type
for every distinct length used: http://play.golang.org/p/lc3QnMnKP_We could consider using
reflect.SliceHeader
instead, but it "cannot be used safely or portably" (https://golang.org/pkg/reflect/#SliceHeader).So it seems that we don't have an efficient solution for this conversion.
The simplest solution seems like it would be to add a function to the
reflect
package to construct the slice directly. Following the example ofreflect.MakeSlice
andreflect.NewAt
, it could be something like:The text was updated successfully, but these errors were encountered: