cmd/compile, cmd/link, runtime: make defers low-cost through inline code and extra funcdata
Generate inline code at defer time to save the args of defer calls to unique
(autotmp) stack slots, and generate inline code at exit time to check which defer
calls were made and make the associated function/method/interface calls. We
remember that a particular defer statement was reached by storing in the deferBits
variable (always stored on the stack). At exit time, we check the bits of the
deferBits variable to determine which defer function calls to make (in reverse
order). These low-cost defers are only used for functions where no defers
appear in loops. In addition, we don't do these low-cost defers if there are too
many defer statements or too many exits in a function (to limit code increase).
When a function uses open-coded defers, we produce extra
FUNCDATA_OpenCodedDeferInfo information that specifies the number of defers, and
for each defer, the stack slots where the closure and associated args have been
stored. The funcdata also includes the location of the deferBits variable.
Therefore, for panics, we can use this funcdata to determine exactly which defers
are active, and call the appropriate functions/methods/closures with the correct
arguments for each active defer.
In order to unwind the stack correctly after a recover(), we need to add an extra
code segment to functions with open-coded defers that simply calls deferreturn()
and returns. This segment is not reachable by the normal function, but is returned
to by the runtime during recovery. We set the liveness information of this
deferreturn() to be the same as the liveness at the first function call during the
last defer exit code (so all return values and all stack slots needed by the defer
calls will be live).
I needed to increase the stackguard constant from 880 to 896, because of a small
amount of new code in deferreturn().
The -N flag disables open-coded defers. '-d defer' prints out the kind of defer
being used at each defer statement (heap-allocated, stack-allocated, or
open-coded).
Cost of defer statement [ go test -run NONE -bench BenchmarkDefer$ runtime ]
With normal (stack-allocated) defers only: 35.4 ns/op
With open-coded defers: 5.6 ns/op
Cost of function call alone (remove defer keyword): 4.4 ns/op
Text size increase (including funcdata) for go binary without/with open-coded defers: 0.09%
The average size increase (including funcdata) for only the functions that use
open-coded defers is 1.1%.
The cost of a panic followed by a recover got noticeably slower, since panic
processing now requires a scan of the stack for open-coded defer frames. This scan
is required, even if no frames are using open-coded defers:
Cost of panic and recover [ go test -run NONE -bench BenchmarkPanicRecover runtime ]
Without open-coded defers: 62.0 ns/op
With open-coded defers: 255 ns/op
A CGO Go-to-C-to-Go benchmark got noticeably faster because of open-coded defers:
CGO Go-to-C-to-Go benchmark [cd misc/cgo/test; go test -run NONE -bench BenchmarkCGoCallback ]
Without open-coded defers: 443 ns/op
With open-coded defers: 347 ns/op
Updates #14939 (defer performance)
Updates #34481 (design doc)
Change-Id: I63b1a60d1ebf28126f55ee9fd7ecffe9cb23d1ff
Reviewed-on: https://go-review.googlesource.com/c/go/+/202340
Reviewed-by: Austin Clements <austin@google.com>
2019-06-24 13:59:22 -06:00
|
|
|
// errorcheck -0 -l -d=defer
|
|
|
|
|
|
|
|
// Copyright 2019 The Go Authors. All rights reserved.
|
|
|
|
// Use of this source code is governed by a BSD-style
|
|
|
|
// license that can be found in the LICENSE file.
|
|
|
|
|
|
|
|
// check that open-coded defers are used in expected situations
|
|
|
|
|
|
|
|
package main
|
|
|
|
|
|
|
|
import "fmt"
|
|
|
|
|
|
|
|
var glob = 3
|
|
|
|
|
|
|
|
func f1() {
|
|
|
|
|
|
|
|
for i := 0; i < 10; i++ {
|
|
|
|
fmt.Println("loop")
|
|
|
|
}
|
|
|
|
defer func() { // ERROR "open-coded defer"
|
|
|
|
fmt.Println("defer")
|
|
|
|
}()
|
|
|
|
}
|
|
|
|
|
|
|
|
func f2() {
|
|
|
|
for {
|
|
|
|
defer func() { // ERROR "heap-allocated defer"
|
|
|
|
fmt.Println("defer1")
|
|
|
|
}()
|
|
|
|
if glob > 2 {
|
|
|
|
break
|
|
|
|
}
|
|
|
|
}
|
|
|
|
defer func() { // ERROR "stack-allocated defer"
|
|
|
|
fmt.Println("defer2")
|
|
|
|
}()
|
|
|
|
}
|
|
|
|
|
|
|
|
func f3() {
|
|
|
|
defer func() { // ERROR "stack-allocated defer"
|
|
|
|
fmt.Println("defer2")
|
|
|
|
}()
|
|
|
|
for {
|
|
|
|
defer func() { // ERROR "heap-allocated defer"
|
|
|
|
fmt.Println("defer1")
|
|
|
|
}()
|
|
|
|
if glob > 2 {
|
|
|
|
break
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
func f4() {
|
|
|
|
defer func() { // ERROR "open-coded defer"
|
|
|
|
fmt.Println("defer")
|
|
|
|
}()
|
|
|
|
label:
|
|
|
|
fmt.Println("goto loop")
|
|
|
|
if glob > 2 {
|
|
|
|
goto label
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
func f5() {
|
|
|
|
label:
|
|
|
|
fmt.Println("goto loop")
|
|
|
|
defer func() { // ERROR "heap-allocated defer"
|
|
|
|
fmt.Println("defer")
|
|
|
|
}()
|
|
|
|
if glob > 2 {
|
|
|
|
goto label
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
func f6() {
|
|
|
|
label:
|
|
|
|
fmt.Println("goto loop")
|
|
|
|
if glob > 2 {
|
|
|
|
goto label
|
|
|
|
}
|
|
|
|
// The current analysis doesn't end a backward goto loop, so this defer is
|
|
|
|
// considered to be inside a loop
|
|
|
|
defer func() { // ERROR "heap-allocated defer"
|
|
|
|
fmt.Println("defer")
|
|
|
|
}()
|
|
|
|
}
|
2024-07-28 07:12:59 -06:00
|
|
|
|
|
|
|
// Test for function with too many exits, which will disable open-coded defer
|
|
|
|
// even though the number of defer statements is not greater than 8.
|
|
|
|
func f7() {
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
|
|
|
|
switch glob {
|
|
|
|
case 1:
|
|
|
|
return
|
|
|
|
case 2:
|
|
|
|
return
|
|
|
|
case 3:
|
|
|
|
return
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
func f8() {
|
|
|
|
defer println(1) // ERROR "stack-allocated defer"
|
|
|
|
defer println(1) // ERROR "stack-allocated defer"
|
|
|
|
defer println(1) // ERROR "stack-allocated defer"
|
|
|
|
defer println(1) // ERROR "stack-allocated defer"
|
|
|
|
|
|
|
|
switch glob {
|
|
|
|
case 1:
|
|
|
|
return
|
|
|
|
case 2:
|
|
|
|
return
|
|
|
|
case 3:
|
|
|
|
return
|
|
|
|
case 4:
|
|
|
|
return
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
func f9() {
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
defer println(1) // ERROR "open-coded defer"
|
|
|
|
|
|
|
|
switch glob {
|
|
|
|
case 1:
|
|
|
|
return
|
|
|
|
case 2:
|
|
|
|
return
|
|
|
|
case 3:
|
|
|
|
return
|
|
|
|
case 4:
|
|
|
|
panic("")
|
|
|
|
}
|
|
|
|
}
|