mem: skip defer for performance #48

abraithwaite · 2016-08-25T22:00:51Z

We're looking to move to go-capnproto2 from the original and we've been comparing the performance of the two using alecthomas' go serialization benchmark suite.

This is a minor patch which improves the speed moderately. We're still looking at options for increasing the speed. Most of them probably involve changing the go structs to pointers, if that's amenable to you.

Here's the performance comparison with and without this changeset:

https://gist.github.com/abraithwaite/a8876d351edf16bc052cd6b5acdaa330

I'm not sure why the svgs don't render nicely on gist though :-(

zombiezen · 2016-08-26T18:49:36Z

How I wish that defer was faster. This seems fine.

I remember Jason using that benchmark suite to compare perf numbers before. You may also be interested in running go test -bench='BenchmarkUnmarshal_Reuse', which makes even less allocations with an increase in potential complexity. I haven't gotten around to upstreaming those.

I'm curious: why will converting the structs to pointers make perf improvements? I recently made a largeish API change to remove allocations in the hot paths because it was adversely impacting perf.

abraithwaite · 2016-08-26T19:01:35Z

My mistake. My expectation is that by not allocating the metadata structs it'll speed it up a bit.

zombiezen · 2016-08-26T19:47:16Z

mem.go

@@ -168,21 +168,27 @@ func (m *Message) NumSegments() int64 {
 // Segment returns the segment with the given ID.
 func (m *Message) Segment(id SegmentID) (*Segment, error) {
 	m.mu.Lock()
-	defer m.mu.Unlock()
+	var seg *Segment
 	if isInt32Bit() && id > maxInt32 {


nit: this case doesn't require the mutex

You mean we can move m.mu.Lock() below this block correct?

abraithwaite · 2016-08-29T21:26:02Z

Coming back to this:

why will converting the structs to pointers make perf improvements?

You may also be interested in running go test -bench='BenchmarkUnmarshal_Reuse'

That's exactly what I was looking for. Only just got around to reading that code. I also made a patch for the the serialization benchmarks repo. Here are the results: https://gist.github.com/abraithwaite/905555e81b35eca8d30987690dfc1220

Very nice! 👏

Edit: here's the patch, if you want to upstream. Otherwise I'd be happy to, but it's your work :-)

https://gist.github.com/abraithwaite/93d6f0b6ed505a78d6686bff1850eb4a

mem.go: skip defer for performance

zombiezen · 2016-09-04T16:04:27Z

Merged with some tweaks to critical sections.

You should go ahead and upstream the benchmark changes.

abraithwaite closed this Aug 26, 2016

abraithwaite reopened this Aug 26, 2016

zombiezen reviewed Aug 26, 2016
View reviewed changes

mem: skip defer for performance

f8b915c

abraithwaite force-pushed the performance branch from cb4dff6 to f8b915c Compare August 29, 2016 22:04

zombiezen merged commit f8b915c into capnproto:master Sep 4, 2016

zombiezen added a commit that referenced this pull request Sep 4, 2016

Merge pull request #48 from abraithwaite/performance

3207911

mem.go: skip defer for performance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mem: skip defer for performance #48

mem: skip defer for performance #48

abraithwaite commented Aug 25, 2016

zombiezen commented Aug 26, 2016

abraithwaite commented Aug 26, 2016

zombiezen Aug 26, 2016

abraithwaite Aug 26, 2016

zombiezen Aug 26, 2016

abraithwaite commented Aug 29, 2016 •

edited

Loading

zombiezen commented Sep 4, 2016

mem: skip defer for performance #48

mem: skip defer for performance #48

Conversation

abraithwaite commented Aug 25, 2016

zombiezen commented Aug 26, 2016

abraithwaite commented Aug 26, 2016

zombiezen Aug 26, 2016

Choose a reason for hiding this comment

abraithwaite Aug 26, 2016

Choose a reason for hiding this comment

zombiezen Aug 26, 2016

Choose a reason for hiding this comment

abraithwaite commented Aug 29, 2016 • edited Loading

zombiezen commented Sep 4, 2016

abraithwaite commented Aug 29, 2016 •

edited

Loading