1
0
mirror of https://github.com/golang/go synced 2024-11-25 22:37:59 -07:00
go/src
Joel Sing 9abd11440c math/big: implement addVV in riscv64 assembly
This provides an assembly implementation of addVV for riscv64,
processing up to four words per loop, resulting in a significant
performance gain.

On a StarFive VisionFive 2:

               │   addvv.1    │               addvv.2               │
               │    sec/op    │   sec/op     vs base                │
AddVV/1-4         73.45n ± 0%   48.08n ± 0%  -34.54% (p=0.000 n=10)
AddVV/2-4         88.14n ± 0%   58.76n ± 0%  -33.33% (p=0.000 n=10)
AddVV/3-4        102.80n ± 0%   69.44n ± 0%  -32.45% (p=0.000 n=10)
AddVV/4-4        117.50n ± 0%   72.18n ± 0%  -38.57% (p=0.000 n=10)
AddVV/5-4        132.20n ± 0%   82.79n ± 0%  -37.38% (p=0.000 n=10)
AddVV/10-4        216.3n ± 0%   126.8n ± 0%  -41.35% (p=0.000 n=10)
AddVV/100-4      1659.0n ± 0%   885.2n ± 0%  -46.64% (p=0.000 n=10)
AddVV/1000-4     16.089µ ± 0%   8.400µ ± 0%  -47.79% (p=0.000 n=10)
AddVV/10000-4     245.3µ ± 0%   176.9µ ± 0%  -27.88% (p=0.000 n=10)
AddVV/100000-4    2.537m ± 0%   1.873m ± 0%  -26.17% (p=0.000 n=10)
geomean           1.435µ        904.5n       -36.99%

               │   addvv.1    │                addvv.2                │
               │     B/s      │      B/s       vs base                │
AddVV/1-4        830.9Mi ± 0%   1269.5Mi ± 0%  +52.78% (p=0.000 n=10)
AddVV/2-4        1.353Gi ± 0%    2.029Gi ± 0%  +50.00% (p=0.000 n=10)
AddVV/3-4        1.739Gi ± 0%    2.575Gi ± 0%  +48.09% (p=0.000 n=10)
AddVV/4-4        2.029Gi ± 0%    3.303Gi ± 0%  +62.82% (p=0.000 n=10)
AddVV/5-4        2.254Gi ± 0%    3.600Gi ± 0%  +59.69% (p=0.000 n=10)
AddVV/10-4       2.755Gi ± 0%    4.699Gi ± 0%  +70.54% (p=0.000 n=10)
AddVV/100-4      3.594Gi ± 0%    6.734Gi ± 0%  +87.37% (p=0.000 n=10)
AddVV/1000-4     3.705Gi ± 0%    7.096Gi ± 0%  +91.54% (p=0.000 n=10)
AddVV/10000-4    2.430Gi ± 0%    3.369Gi ± 0%  +38.65% (p=0.000 n=10)
AddVV/100000-4   2.350Gi ± 0%    3.183Gi ± 0%  +35.44% (p=0.000 n=10)
geomean          2.119Gi         3.364Gi       +58.71%

Change-Id: I727b3d9f8ab01eada7270046480b1430d56d0a96
Reviewed-on: https://go-review.googlesource.com/c/go/+/595395
Reviewed-by: Cherry Mui <cherryyz@google.com>
Reviewed-by: David Chase <drchase@google.com>
Reviewed-by: M Zhuo <mengzhuo1203@gmail.com>
LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com>
Reviewed-by: Than McIntosh <thanm@google.com>
2024-08-02 14:14:14 +00:00
..
archive archive: use slices and maps to clean up tests 2024-07-25 00:25:45 +00:00
arena
bufio
builtin
bytes bytes,slices,strings: optimize Repeat a bit 2024-08-01 21:32:50 +00:00
cmd cmd/internal/obj/loong64: add support for MOV{GR2FCSR/FCSR2GR/FR2CF/CF2FR} instructions 2024-08-02 00:29:24 +00:00
cmp
compress
container
context context: handle nil values for valueCtx.String() 2024-07-10 02:44:20 +00:00
crypto crypto: implement encoding.BinaryAppender for all crypto hashes 2024-08-01 14:57:46 +00:00
database/sql database/sql/driver: fix name in comment 2024-07-11 15:01:00 +00:00
debug debug/buildid: treat too large string as "not a Go executable" 2024-08-01 15:02:27 +00:00
embed crypto/x509,embed: use slices to clean up tests 2024-07-24 16:44:15 +00:00
encoding encoding: add TextAppender and BinaryAppender 2024-07-30 14:22:50 +00:00
errors errors: change interface{} to any in comment 2024-05-24 17:13:04 +00:00
expvar
flag flag: handle nil os.Args when setting CommandLine at package level 2024-07-22 20:58:27 +00:00
fmt
go go/types: fix typo in comment 2024-08-01 14:56:12 +00:00
hash all: make function comments match function names 2024-06-03 14:56:25 +00:00
html
image all: make function comments match function names 2024-06-03 14:56:25 +00:00
index/suffixarray
internal cmd/compiler,internal/runtime/atomic: optimize Load{64,32,8} on loong64 2024-08-01 02:17:13 +00:00
io go,internal,io,mime: use slices and maps to clean tests 2024-07-25 00:22:14 +00:00
iter iter: minor doc comment updates 2024-06-21 19:12:59 +00:00
log cmd,log,net,runtime: simplify string prefix and suffix processing 2024-07-29 21:29:17 +00:00
maps maps: document handling of non-reflexive keys 2024-07-17 23:08:52 +00:00
math math/big: implement addVV in riscv64 assembly 2024-08-02 14:14:14 +00:00
mime go,internal,io,mime: use slices and maps to clean tests 2024-07-25 00:22:14 +00:00
net net: replace sort with slices for address and DNS record sorting 2024-07-31 22:06:36 +00:00
os os: rm unused code 2024-07-27 00:57:42 +00:00
path os,path/filepath,testing: use slices to clean up tests 2024-07-25 00:23:06 +00:00
plugin
reflect reflect: add flag tests for MapOf 2024-07-31 20:45:55 +00:00
regexp regexp: allow patterns with no alternates to be one-pass 2024-07-24 01:01:48 +00:00
runtime runtime: avoid futile mark worker acquisition 2024-08-01 21:11:15 +00:00
slices bytes,slices,strings: optimize Repeat a bit 2024-08-01 21:32:50 +00:00
sort sort: add example for Find 2024-07-16 17:55:15 +00:00
strconv strconv: document that Unquote("''") returns an empty string 2024-07-22 18:35:09 +00:00
strings bytes,slices,strings: optimize Repeat a bit 2024-08-01 21:32:50 +00:00
structs
sync all: make struct comments match struct names 2024-07-11 17:23:45 +00:00
syscall syscall: selectively update zerrors_* on openbsd/386, openbsd/arm and openbsd/amd64 2024-06-16 23:08:08 +00:00
testdata
testing os,path/filepath,testing: use slices to clean up tests 2024-07-25 00:23:06 +00:00
text text/template: fix doc spacing 2024-07-22 20:58:50 +00:00
time time: optimize time <-> date conversions 2024-07-31 21:29:46 +00:00
unicode bytes,strings,unicode/utf16: use slices to clean up tests 2024-07-24 18:45:08 +00:00
unique
unsafe unsafe: say "functions like syscall.Syscall", not only Syscall 2024-07-11 23:38:31 +00:00
vendor all: update vendored dependencies 2024-07-23 20:29:12 +00:00
all.bash
all.bat
all.rc
bootstrap.bash
buildall.bash
clean.bash
clean.bat
clean.rc
cmp.bash
go.mod all: update vendored dependencies 2024-07-23 20:29:12 +00:00
go.sum all: update vendored dependencies 2024-07-23 20:29:12 +00:00
make.bash make.bash: drop GNU/kFreeBSD handling 2024-07-22 21:24:34 +00:00
make.bat
Make.dist
make.rc make.bash: preserve GOROOT_BOOTSTRAP 2024-05-29 13:48:46 +00:00
race.bash
race.bat
README.vendor
run.bash
run.bat
run.rc

Vendoring in std and cmd
========================

The Go command maintains copies of external packages needed by the
standard library in the src/vendor and src/cmd/vendor directories.

There are two modules, std and cmd, defined in src/go.mod and
src/cmd/go.mod. When a package outside std or cmd is imported
by a package inside std or cmd, the import path is interpreted
as if it had a "vendor/" prefix. For example, within "crypto/tls",
an import of "golang.org/x/crypto/cryptobyte" resolves to
"vendor/golang.org/x/crypto/cryptobyte". When a package with the
same path is imported from a package outside std or cmd, it will
be resolved normally. Consequently, a binary may be built with two
copies of a package at different versions if the package is
imported normally and vendored by the standard library.

Vendored packages are internally renamed with a "vendor/" prefix
to preserve the invariant that all packages have distinct paths.
This is necessary to avoid compiler and linker conflicts. Adding
a "vendor/" prefix also maintains the invariant that standard
library packages begin with a dotless path element.

The module requirements of std and cmd do not influence version
selection in other modules. They are only considered when running
module commands like 'go get' and 'go mod vendor' from a directory
in GOROOT/src.

Maintaining vendor directories
==============================

Before updating vendor directories, ensure that module mode is enabled.
Make sure that GO111MODULE is not set in the environment, or that it is
set to 'on' or 'auto', and if you use a go.work file, set GOWORK=off.

Requirements may be added, updated, and removed with 'go get'.
The vendor directory may be updated with 'go mod vendor'.
A typical sequence might be:

    cd src  # or src/cmd
    go get golang.org/x/net@master
    go mod tidy
    go mod vendor

Use caution when passing '-u' to 'go get'. The '-u' flag updates
modules providing all transitively imported packages, not only
the module providing the target package.

Note that 'go mod vendor' only copies packages that are transitively
imported by packages in the current module. If a new package is needed,
it should be imported before running 'go mod vendor'.