qbit/go - go - Tape:neT

qbit/go

mirror of https://github.com/golang/go synced 2024-10-04 16:11:21 -06:00

Author	SHA1	Message	Date
Rémy Oudompheng	33cceb09e2	cmd/{5g,6g,8g,6c}: remove unused macro, use %E to print etype. R=golang-dev, rsc, dave CC=golang-dev https://golang.org/cl/6569044	2012-09-24 23:44:00 +02:00
Rémy Oudompheng	f4e76d5e02	cmd/6g, cmd/8g: add OINDREG, ODOT, ODOTPTR cases to igen. Apart from reducing the number of LEAL/LEAQ instructions by about 30%, it gives 8g easier registerization in several cases, for example in strconv. Performance with 6g is not affected. Before (386): src/pkg/strconv/decimal.go:22 TEXT (decimal).String+0(SB),$240-12 src/pkg/strconv/extfloat.go:540 TEXT (extFloat).ShortestDecimal+0(SB),$584-20 After (386): src/pkg/strconv/decimal.go:22 TEXT (decimal).String+0(SB),$196-12 src/pkg/strconv/extfloat.go:540 TEXT (extFloat).ShortestDecimal+0(SB),$420-20 Benchmarks with GOARCH=386 (on a Core 2). benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 7110191000 7079644000 -0.43% BenchmarkFannkuch11 7769274000 7766514000 -0.04% BenchmarkGobDecode 33454820 34755400 +3.89% BenchmarkGobEncode 11675710 11007050 -5.73% BenchmarkGzip 2013519000 1593855000 -20.84% BenchmarkGunzip 253368200 242667600 -4.22% BenchmarkJSONEncode 152443900 120763400 -20.78% BenchmarkJSONDecode 304112800 247461800 -18.63% BenchmarkMandelbrot200 29245520 29240490 -0.02% BenchmarkParse 8484105 8088660 -4.66% BenchmarkRevcomp 2695688000 2841263000 +5.40% BenchmarkTemplate 363759800 277271200 -23.78% benchmark old ns/op new ns/op delta BenchmarkAtof64Decimal 127 129 +1.57% BenchmarkAtof64Float 166 164 -1.20% BenchmarkAtof64FloatExp 308 300 -2.60% BenchmarkAtof64Big 584 571 -2.23% BenchmarkAppendFloatDecimal 440 430 -2.27% BenchmarkAppendFloat 995 776 -22.01% BenchmarkAppendFloatExp 897 746 -16.83% BenchmarkAppendFloatNegExp 900 752 -16.44% BenchmarkAppendFloatBig 1528 1228 -19.63% BenchmarkAppendFloat32Integer 443 453 +2.26% BenchmarkAppendFloat32ExactFraction 812 661 -18.60% BenchmarkAppendFloat32Point 1002 773 -22.85% BenchmarkAppendFloat32Exp 858 725 -15.50% BenchmarkAppendFloat32NegExp 848 728 -14.15% BenchmarkAppendFloat64Fixed1 447 431 -3.58% BenchmarkAppendFloat64Fixed2 480 462 -3.75% BenchmarkAppendFloat64Fixed3 461 457 -0.87% BenchmarkAppendFloat64Fixed4 509 484 -4.91% Update #1914. R=rsc, nigeltao CC=golang-dev, remy https://golang.org/cl/6494107	2012-09-24 23:07:44 +02:00
Russ Cox	650160e36a	cmd/gc: prepare for 64-bit ints This CL makes the compiler understand that the type of the len or cap of a map, slice, or string is 'int', not 'int32'. It does not change the meaning of int, but it should make the eventual change of the meaning of int in 6g a bit smoother. Update #2188. R=ken, dave, remyoudompheng CC=golang-dev https://golang.org/cl/6542059	2012-09-24 14:59:44 -04:00
Rémy Oudompheng	5e3fb887a3	cmd/[568]g: explain the purpose of various Reg fields. R=golang-dev, rsc CC=golang-dev, remy https://golang.org/cl/6554062	2012-09-24 20:55:11 +02:00
Rémy Oudompheng	36df358a30	cmd/6g: fix internal error with SSE registers. Revision 63f7abcae015 introduced a bug caused by code assuming registers started at X5, not X0. Fixes #4138. R=rsc CC=golang-dev, remy https://golang.org/cl/6558043	2012-09-23 18:22:03 +02:00
Russ Cox	05ac300830	cmd/gc: fix use of nil interface, slice Fixes #3670. R=ken2 CC=golang-dev https://golang.org/cl/6542058	2012-09-22 20:42:11 -04:00
Russ Cox	658482d70f	cmd/5g: fix register opt bug The width was not being set on the address, which meant that the optimizer could not find variables that overlapped with it and mark them as having had their address taken. This let to the compiler believing variables had been set but never used and then optimizing away the set. Fixes #4129. R=ken2 CC=golang-dev https://golang.org/cl/6552059	2012-09-22 10:01:35 -04:00
Rémy Oudompheng	413fbed341	cmd/6g: cosmetic improvements to regopt debugging. R=rsc, golang-dev CC=golang-dev https://golang.org/cl/6528044	2012-09-21 20:20:26 +02:00
Russ Cox	57ad05db15	cmd/6g: use all 16 float registers, optimize float moves Fixes #2446. R=ken2 CC=golang-dev https://golang.org/cl/6557044	2012-09-21 13:39:09 -04:00
Nigel Tao	a9a675ec35	cmd/6g, cmd/8g: clean up unnecessary switch code in componentgen. Code higher up in the function already catches these cases. R=remyoudompheng, rsc CC=golang-dev https://golang.org/cl/6496106	2012-09-12 21:47:05 +10:00
Rémy Oudompheng	b45b6fd1c7	cmd/6g, cmd/8g: do not LEA[LQ] interfaces when calling methods. It is enough to load directly the data word and the itab word from memory, so we save a LEA instruction for each method call, and allow elimination of some extra temporaries. Update #1914. R=daniel.morsing, rsc CC=golang-dev, remy https://golang.org/cl/6501110	2012-09-11 08:45:23 +02:00
Rémy Oudompheng	ff642e290f	cmd/6g, cmd/8g: eliminate extra agen for nil comparisons. Removes an extra LEAL/LEAQ instructions there and usually saves a useless temporary in the idiom if err := foo(); err != nil {...} Generated code is also less involved: MOVQ err+n(SP), AX CMPQ AX, $0 (potentially CMPQ n(SP), $0) instead of LEAQ err+n(SP), AX CMPQ (AX), $0 Update #1914. R=daniel.morsing, nigeltao, rsc CC=golang-dev, remy https://golang.org/cl/6493099	2012-09-11 08:08:40 +02:00
Nigel Tao	5d7ece6f44	6g: delete unnecessary OXXX initialization. No longer necessary after https://golang.org/cl/6497073/ removed the `if(n5.op != OXXX) { regfree(&n5); }`. R=remy, r CC=golang-dev, rsc https://golang.org/cl/6498101	2012-09-10 11:24:34 +10:00
Rémy Oudompheng	acbe6c94d7	cmd/6g: avoid taking the address of slices unnecessarily. The main case where it happens is when evaluating &s[i] without bounds checking, which usually happens during range loops (i=0). This allows registerization of the corresponding variables, saving 16 bytes of stack frame for each such range loop and a LEAQ instruction. R=golang-dev, rsc, dave CC=golang-dev, remy https://golang.org/cl/6497073	2012-09-07 06:54:42 +02:00
Nigel Tao	481e5c6ad0	cmd/gc: re-order some OFOO constants. Rename ORRC to ORROTC to be consistent with OLROT. Delete unused OBAD, OLRC. R=rsc, dave CC=golang-dev https://golang.org/cl/6489082	2012-09-06 10:47:25 +10:00
Rémy Oudompheng	8f3c2055bd	cmd/6g, cmd/8g: eliminate short integer arithmetic when possible. Fixes #3909. Fixes #3910. R=rsc, nigeltao CC=golang-dev https://golang.org/cl/6442114	2012-09-01 16:40:54 +02:00
Shenghou Ma	e80f6a4de1	cmd/6g: fix float32/64->uint64 conversion CVTSS2SQ's rounding mode is controlled by the RC field of MXCSR; as we specifically need truncate semantic, we should use CVTTSS2SQ. Fixes #3804. R=rsc, r CC=golang-dev https://golang.org/cl/6352079	2012-08-23 14:35:26 +08:00
Nigel Tao	18e86644a3	cmd/gc: cache itab lookup in convT2I. There may be further savings if convT2I can avoid the function call if the cache is good and T is uintptr-shaped, a la convT2E, but that will be a follow-up CL. src/pkg/runtime: benchmark old ns/op new ns/op delta BenchmarkConvT2ISmall 43 15 -64.01% BenchmarkConvT2IUintptr 45 14 -67.48% BenchmarkConvT2ILarge 130 101 -22.31% test/bench/go1: benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 8588997000 8499058000 -1.05% BenchmarkFannkuch11 5300392000 5358093000 +1.09% BenchmarkGobDecode 30295580 31040190 +2.46% BenchmarkGobEncode 18102070 17675650 -2.36% BenchmarkGzip 774191400 771591400 -0.34% BenchmarkGunzip 245915100 247464100 +0.63% BenchmarkJSONEncode 123577000 121423050 -1.74% BenchmarkJSONDecode 451969800 596256200 +31.92% BenchmarkMandelbrot200 10060050 10072880 +0.13% BenchmarkParse 10989840 11037710 +0.44% BenchmarkRevcomp 1782666000 1716864000 -3.69% BenchmarkTemplate 798286600 723234400 -9.40% R=rsc, bradfitz, go.peter.90, daniel.morsing, dave, uriel CC=golang-dev https://golang.org/cl/6337058	2012-07-03 09:09:05 +10:00
Nigel Tao	8f84328fdc	cmd/gc: inline convT2E when T is uintptr-shaped. GOARCH=amd64 benchmarks src/pkg/runtime benchmark old ns/op new ns/op delta BenchmarkConvT2ESmall 10 10 +1.00% BenchmarkConvT2EUintptr 9 0 -92.07% BenchmarkConvT2EBig 74 74 -0.27% BenchmarkConvT2I 27 26 -3.62% BenchmarkConvI2E 4 4 -7.05% BenchmarkConvI2I 20 19 -2.99% test/bench/go1 benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 5930908000 5937260000 +0.11% BenchmarkFannkuch11 3927057000 3933556000 +0.17% BenchmarkGobDecode 21998090 21870620 -0.58% BenchmarkGobEncode 12725310 12734480 +0.07% BenchmarkGzip 567617600 567892800 +0.05% BenchmarkGunzip 178284100 178706900 +0.24% BenchmarkJSONEncode 87693550 86794300 -1.03% BenchmarkJSONDecode 314212600 324115000 +3.15% BenchmarkMandelbrot200 7016640 7073766 +0.81% BenchmarkParse 7852100 7892085 +0.51% BenchmarkRevcomp 1285663000 1286147000 +0.04% BenchmarkTemplate 566823800 567606200 +0.14% I'm not entirely sure why the JSON* numbers have changed, but eyeballing the profile suggests that it could be spending less and more time in runtime.{new,old}stack, so it could simply be stack-split boundary noise. R=rsc, dave, bsiegert, dsymonds CC=golang-dev https://golang.org/cl/6280049	2012-06-14 10:43:20 +10:00
Russ Cox	b185de82a4	cmd/gc: limit data disassembly to -SS This makes -S useful again. R=ken2 CC=golang-dev https://golang.org/cl/6302054	2012-06-07 12:05:34 -04:00
Rémy Oudompheng	a7059cc793	cmd/[568]g: correct freeing of allocated Regs. R=golang-dev, rsc CC=golang-dev, remy https://golang.org/cl/6281050	2012-06-05 06:43:15 +02:00
Luuk van Dijk	40af78c19e	cmd/gc: inline slice[arr,str] in the frontend (mostly). R=rsc, ality, rogpeppe, minux.ma, dave CC=golang-dev https://golang.org/cl/5966075	2012-06-02 22:50:57 -04:00
Russ Cox	96b0594833	cmd/5g, cmd/6g, cmd/8g: delete clearstk Dreg from https://golang.org/cl/4629042 R=ken2 CC=golang-dev https://golang.org/cl/6259057	2012-06-01 10:10:59 -04:00
Russ Cox	001b75c942	cmd/gc: contiguous loop layout Drop expecttaken function in favor of extra argument to gbranch and bgen. Mark loop condition as likely to be true, so that loops are generated inline. The main benefit here is contiguous code when trying to read the generated assembly. It has only minor effects on the timing, and they mostly cancel the minor effects that aligning function entry points had. One exception: both changes made Fannkuch faster. Compared to before CL 6244066 (before aligned functions) benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 4222117400 4201958800 -0.48% BenchmarkFannkuch11 3462631800 3215908600 -7.13% BenchmarkGobDecode 20887622 20899164 +0.06% BenchmarkGobEncode 9548772 9439083 -1.15% BenchmarkGzip 151687 152060 +0.25% BenchmarkGunzip 8742 8711 -0.35% BenchmarkJSONEncode 62730560 62686700 -0.07% BenchmarkJSONDecode 252569180 252368960 -0.08% BenchmarkMandelbrot200 5267599 5252531 -0.29% BenchmarkRevcomp25M 980813500 985248400 +0.45% BenchmarkTemplate 361259100 357414680 -1.06% Compared to tip (aligned functions): benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 4140739800 4201958800 +1.48% BenchmarkFannkuch11 3259914400 3215908600 -1.35% BenchmarkGobDecode 20620222 20899164 +1.35% BenchmarkGobEncode 9384886 9439083 +0.58% BenchmarkGzip 150333 152060 +1.15% BenchmarkGunzip 8741 8711 -0.34% BenchmarkJSONEncode 65210990 62686700 -3.87% BenchmarkJSONDecode 249394860 252368960 +1.19% BenchmarkMandelbrot200 5273394 5252531 -0.40% BenchmarkRevcomp25M 996013800 985248400 -1.08% BenchmarkTemplate 360620840 357414680 -0.89% R=ken2 CC=golang-dev https://golang.org/cl/6245069	2012-05-30 18:07:39 -04:00
Russ Cox	a768de8347	cmd/6g: avoid MOVSD between registers MOVSD only copies the low half of the packed register pair, while MOVAPD copies both halves. I assume the internal register renaming works better with the latter, since it makes our code run 25% faster. Before: mandelbrot 16000 gcc -O2 mandelbrot.c 28.44u 0.00s 28.45r gc mandelbrot 44.12u 0.00s 44.13r gc_B mandelbrot 44.17u 0.01s 44.19r After: mandelbrot 16000 gcc -O2 mandelbrot.c 28.22u 0.00s 28.23r gc mandelbrot 32.81u 0.00s 32.82r gc_B mandelbrot 32.82u 0.00s 32.83r R=ken2 CC=golang-dev https://golang.org/cl/6248068	2012-05-30 14:41:19 -04:00
Russ Cox	de96df1b02	cmd/6g: change sbop swap logic I added the nl->op == OLITERAL case during the recent performance round, and while it helps for small integer constants, it hurts for floating point constants. In the Mandelbrot benchmark it causes 2ZrZi to compile like Zr2Zi: 0x000000000042663d <+249>: movsd %xmm6,%xmm0 0x0000000000426641 <+253>: movsd $2,%xmm1 0x000000000042664a <+262>: mulsd %xmm1,%xmm0 0x000000000042664e <+266>: mulsd %xmm5,%xmm0 instead of: 0x0000000000426835 <+276>: movsd $2,%xmm0 0x000000000042683e <+285>: mulsd %xmm6,%xmm0 0x0000000000426842 <+289>: mulsd %xmm5,%xmm0 It is unclear why that has such a dramatic performance effect in a tight loop, but it's obviously slightly better code, so go with it. benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 5957470000 5973924000 +0.28% BenchmarkFannkuch11 3811295000 3869128000 +1.52% BenchmarkGobDecode 26001900 25670500 -1.27% BenchmarkGobEncode 12051430 11948590 -0.85% BenchmarkGzip 177432 174821 -1.47% BenchmarkGunzip 10967 10756 -1.92% BenchmarkJSONEncode 78924750 79746900 +1.04% BenchmarkJSONDecode 313606400 307081600 -2.08% BenchmarkMandelbrot200 13670860 8200725 -40.01% !!! BenchmarkRevcomp25M 1179194000 1206539000 +2.32% BenchmarkTemplate 447931200 443948200 -0.89% BenchmarkMD5Hash1K 2856 2873 +0.60% BenchmarkMD5Hash8K 22083 22029 -0.24% benchmark old MB/s new MB/s speedup BenchmarkGobDecode 29.52 29.90 1.01x BenchmarkGobEncode 63.69 64.24 1.01x BenchmarkJSONEncode 24.59 24.33 0.99x BenchmarkJSONDecode 6.19 6.32 1.02x BenchmarkRevcomp25M 215.54 210.66 0.98x BenchmarkTemplate 4.33 4.37 1.01x BenchmarkMD5Hash1K 358.54 356.31 0.99x BenchmarkMD5Hash8K 370.95 371.86 1.00x R=ken2 CC=golang-dev https://golang.org/cl/6261051	2012-05-30 10:22:33 -04:00
Russ Cox	fefae6eed1	cmd/6g, cmd/8g: move panicindex calls out of line The old code generated for a bounds check was CMP JLT ok CALL panicindex ok: ... The new code is (once the linker finishes with it): CMP JGE panic ... panic: CALL panicindex which moves the calls out of line, putting more useful code in each cache line. This matters especially in tight loops, such as in Fannkuch. The benefit is more modest elsewhere, but real. From test/bench/go1, amd64: benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 6096092000 6088808000 -0.12% BenchmarkFannkuch11 6151404000 4020463000 -34.64% BenchmarkGobDecode 28990050 28894630 -0.33% BenchmarkGobEncode 12406310 12136730 -2.17% BenchmarkGzip 179923 179903 -0.01% BenchmarkGunzip 11219 11130 -0.79% BenchmarkJSONEncode 86429350 86515900 +0.10% BenchmarkJSONDecode 334593800 315728400 -5.64% BenchmarkRevcomp25M 1219763000 1180767000 -3.20% BenchmarkTemplate 492947600 483646800 -1.89% And 386: benchmark old ns/op new ns/op delta BenchmarkBinaryTree17 6354902000 6243000000 -1.76% BenchmarkFannkuch11 8043769000 7326965000 -8.91% BenchmarkGobDecode 19010800 18941230 -0.37% BenchmarkGobEncode 14077500 13792460 -2.02% BenchmarkGzip 194087 193619 -0.24% BenchmarkGunzip 12495 12457 -0.30% BenchmarkJSONEncode 125636400 125451400 -0.15% BenchmarkJSONDecode 696648600 685032800 -1.67% BenchmarkRevcomp25M 2058088000 2052545000 -0.27% BenchmarkTemplate 602140000 589876800 -2.04% To implement this, two new instruction forms: JLT target // same as always JLT $0, target // branch expected not taken JLT $1, target // branch expected taken The linker could also emit the prediction prefixes, but it does not: expected taken branches are reversed so that the expected case is not taken (as in example above), and the default expectaton for such a jump is not taken already. R=golang-dev, gri, r, dave CC=golang-dev https://golang.org/cl/6248049	2012-05-29 12:09:27 -04:00
Russ Cox	c6ce44822c	cmd/gc: faster code, mainly for rotate * Eliminate bounds check on known small shifts. * Rewrite x<<s \| x>>(32-s) as a rotate (constant s). * More aggressive (but still minimal) range analysis. R=ken, dave, iant CC=golang-dev https://golang.org/cl/6209077	2012-05-24 17:20:07 -04:00
Russ Cox	3d3b4906f9	cmd/6g: peephole fixes/additions * Shift/rotate by constant doesn't have to stop subprop. (also in 8g) * Remove redundant MOVLQZX instructions. * An attempt at issuing loads early. Good for 0.5% on a good day, might not be worth keeping. Need to understand more about whether the x86 looks ahead to what loads might be coming up. R=ken2, ken CC=golang-dev https://golang.org/cl/6203091	2012-05-24 12:11:32 -04:00
Russ Cox	8f8640a057	cmd/6g: allow use of R14, R15 now We stopped reserving them in 2009 or so. R=ken CC=golang-dev https://golang.org/cl/6215061	2012-05-21 12:59:26 -04:00
Rémy Oudompheng	061061e77c	cmd/6g: restore magic multiply for /=, %=. Also enables turning /= 2 in a right shift. Part of issue 2230. R=rsc CC=golang-dev, remy https://golang.org/cl/6012049	2012-04-13 10:12:31 +02:00
Russ Cox	e530d6a1e0	6c, 6g, 6l: add MOVQL to make truncation explicit Without an explicit signal for a truncation, copy propagation will sometimes propagate a 32-bit truncation and end up overwriting uses of the original 64-bit value. The case that arose in practice is in C but I believe that the same could plausibly happen in Go. The main reason we didn't run into the same in Go is that I (perhaps incorrectly?) drop MOVL AX, AX during gins, so the truncation was never generated, so it didn't confuse the optimizer. Fixes #1315. Fixes #3488. R=ken2 CC=golang-dev https://golang.org/cl/6002043	2012-04-10 12:51:59 -04:00
Rémy Oudompheng	f2ad374ae6	cmd/gc: don't believe that variables mentioned 256 times are unused. Such variables would be put at 0(SP), leading to serious corruptions at zero initialization. Fixes #3084. R=golang-dev, r CC=golang-dev, remy https://golang.org/cl/5683052	2012-02-21 16:38:01 +11:00
Russ Cox	8998835543	5g, 6g, 8g: flush modified globals aggressively The alternative is to record enough information that the trap handler know which registers contain cached globals and can flush the registers back to their original locations. That's significantly more work. This only affects globals that have been written to. Code that reads from a global should continue to registerize as well as before. Fixes #1304. R=ken2 CC=golang-dev https://golang.org/cl/5687046	2012-02-20 13:41:44 -05:00
Russ Cox	4e3f8e915f	gc, ld: tag data as no-pointers and allocate in separate section The garbage collector can avoid scanning this section, with reduces collection time as well as the number of false positives. Helps a little bit with issue 909, but certainly does not solve it. R=ken2 CC=golang-dev https://golang.org/cl/5671099	2012-02-19 03:19:52 -05:00
Shenghou Ma	6ed2b6c47d	5c, 6c, 8c, 6g, 8g: correct boundary checking CL 5666043 fixed the same checking for 5g. R=golang-dev, rsc CC=golang-dev https://golang.org/cl/5666045	2012-02-15 08:59:03 -05:00
Russ Cox	f91cc3bdbb	gc: optimize interface ==, != If the values being compared have different concrete types, then they're clearly unequal without needing to invoke the actual interface compare routine. This speeds tests for specific values, like if err == io.EOF, by about 3x. benchmark old ns/op new ns/op delta BenchmarkIfaceCmp100 843 287 -65.95% BenchmarkIfaceCmpNil100 184 182 -1.09% Fixes #2591. R=ken2 CC=golang-dev https://golang.org/cl/5651073	2012-02-11 00:19:24 -05:00
Russ Cox	ca5da31f83	6g: fix out of registers bug Fix it twice: reuse registers more aggressively in cgen abop, and also release R14 and R15, which are no longer m and g. Fixes #2669. R=ken2 CC=golang-dev https://golang.org/cl/5655056	2012-02-10 22:19:34 -05:00
Jamie Gennis	fff732ea2c	6g,8g: make constant propagation inlining-friendly. This changes makes constant propagation compare 'from' values using node pointers rather than symbol names when checking to see whether a set operation is redundant. When a function is inlined multiple times in a calling function its arguments will share symbol names even though the values are different. Prior to this fix the bug409 test would hit a case with 6g where an LEAQ instruction was incorrectly eliminated from the second inlined function call. 8g appears to have had the same bug, but the test did not fail there. R=golang-dev, rsc CC=golang-dev https://golang.org/cl/5646044	2012-02-08 10:25:13 -05:00
Russ Cox	fec7fa8b9d	build: delete make paraphernalia As a convenience to people working on the tools, leave Makefiles that invoke the go dist tool appropriately. They are not used during the build. R=golang-dev, bradfitz, n13m3y3r, gustavo CC=golang-dev https://golang.org/cl/5636050	2012-02-06 13:34:25 -05:00
Anthony Martin	e280035fc1	gc, cc: avoid using the wrong library when building the compilers This can happen on Plan 9 if we we're building with the 32-bit and 64-bit host compilers, one after the other. R=rsc CC=golang-dev https://golang.org/cl/5599053	2012-02-01 04:14:37 -08:00
Anthony Martin	6273d6e713	build: move the "-c" flag into HOST_CFLAGS On Plan 9 this flag is used to discover constant expressions in "if" statements. R=golang-dev, rsc CC=golang-dev https://golang.org/cl/5601060	2012-01-31 19:31:30 -08:00
Rob Pike	91cb3489ab	go: move compilers into the go-tool directory Also delete gotest, since it's messy to fix and slated for deletion anyway. A couple of things outside src can't be tested any more. "go test" will be fixed and these tests will be re-enabled. They're noisy for now. Fixes #284. R=rsc CC=golang-dev https://golang.org/cl/5598049	2012-01-30 14:46:31 -08:00
Luuk van Dijk	a6c49098bc	gc: Nicer errors before miscompiling. This fixes issue 2444. A big cleanup of all 31/32bit size boundaries i'll leave for another cl though. (see also issue 1700). R=rsc CC=golang-dev https://golang.org/cl/5484058	2012-01-10 11:19:22 +01:00
Russ Cox	196b663075	gc: implement == on structs and arrays To allow these types as map keys, we must fill in equal and hash functions in their algorithm tables. Structs or arrays that are "just memory", like [2]int, can and do continue to use the AMEM algorithm. Structs or arrays that contain special values like strings or interface values use generated functions for both equal and hash. The runtime helper func runtime.equal(t, x, y) bool handles the general equality case for x == y and calls out to the equal implementation in the algorithm table. For short values (<= 4 struct fields or array elements), the sequence of elementwise comparisons is inlined instead of calling runtime.equal. R=ken, mpimenov CC=golang-dev https://golang.org/cl/5451105	2011-12-12 22:22:09 -05:00
Russ Cox	8c0b699ca4	gc: fix another blank bug R=ken2 CC=golang-dev https://golang.org/cl/5478051	2011-12-09 11:59:21 -05:00
Russ Cox	be0ffbfd02	gc: implement character constant type rules R=ken2 CC=golang-dev https://golang.org/cl/5444054	2011-12-08 22:07:43 -05:00
Luuk van Dijk	6bee4e556f	gc: avoid re-genning ninit in branches involving float comparison. R=rsc CC=golang-dev https://golang.org/cl/5451050	2011-12-01 14:46:32 +01:00
Russ Cox	d604cf7808	5g, 6g: comment out uses of -r R=ken2 CC=golang-dev https://golang.org/cl/5299043	2011-10-18 14:55:28 -04:00
Russ Cox	e2d326b878	5g, 6g, 8g: fix loop finding bug, squash jmps The loop recognizer uses the standard dominance frontiers but gets confused by dead code, which has a (not explicitly set) rpo number of 0, meaning it looks like the head of the function, so it dominates everything. If the loop recognizer encounters dead code while tracking backward through the graph it fails to recognize where it started as a loop, and then the optimizer does not registerize values loaded inside that loop. Fix by checking rpo against rpo2r. Separately, run a quick pass over the generated code to squash JMPs to JMP instructions, which are convenient to emit during code generation but difficult to read when debugging the -S output. A side effect of this pass is to eliminate dead code, so the output files may be slightly smaller and the optimizer may have less work to do. There is no semantic effect, because the linkers flatten JMP chains and delete dead instructions when laying out the final code. Doing it here too just makes the -S output easier to read and more like what the final binary will contain. The "dead code breaks loop finding" bug is thus fixed twice over. It seemed prudent to fix loopit separately just in case dead code ever sneaks back in for one reason or another. R=ken2 CC=golang-dev https://golang.org/cl/5190043	2011-10-04 15:06:16 -04:00
Russ Cox	e419535f2a	5g, 6g, 8g: registerize variables again My previous CL: changeset: 9645:ce2e5f44b310 user: Russ Cox <rsc@golang.org> date: Tue Sep 06 10:24:21 2011 -0400 summary: gc: unify stack frame layout introduced a bug wherein no variables were being registerized, making Go programs 2-3x slower than they had been before. This CL fixes that bug (along with some others it was hiding) and adds a test that optimization makes at least one test case faster. R=ken2 CC=golang-dev https://golang.org/cl/5174045	2011-10-03 17:46:36 -04:00
Russ Cox	5ddf6255a1	gc: unify stack frame layout allocparams + tempname + compactframe all knew about how to place stack variables. Now only compactframe, renamed to allocauto, does the work. Until the last minute, each PAUTO variable is in its own space and has xoffset == 0. This might break 5g. I get failures in concurrent code running under qemu and I can't tell whether it's 5g's fault or qemu's. We'll see what the real ARM builders say. R=ken2 CC=golang-dev https://golang.org/cl/4973057	2011-09-06 10:24:21 -04:00
Russ Cox	919cb2ec7c	gc: fix zero-length struct eval Fixes #2232. R=ken2 CC=golang-dev https://golang.org/cl/4960054	2011-09-05 15:31:22 -04:00
Russ Cox	335da67e00	gc: make static initialization more static Does as much as possible in data layout instead of during the init function. Handles var x = y; var y = z as a special case too, because it is so prevalent in package unicode (var Greek = _Greek; var _Greek = []...). Introduces InitPlan description of initialized data so that it can be traversed multiple times (for example, in the copy handler). Cuts package unicode's init function size by 8x. All that remains there is map initialization, which is on the chopping block too. Fixes sinit.go test case. Aggregate DATA instructions at end of object file. Checkpoint. More to come. R=ken2 CC=golang-dev https://golang.org/cl/4969051	2011-08-31 07:37:14 -04:00
Russ Cox	4fb3c4f765	gc: fix div bug R=ken2 CC=golang-dev https://golang.org/cl/4950052	2011-08-30 08:47:28 -04:00
Lucio De Re	219c9e9c46	6g: fix build on Plan 9 src/cmd/6g/cgen.c src/cmd/6g/gobj.c src/cmd/6g/reg.c . dropped unused assignments; src/cmd/6g/gg.h . added varargck pragmas; src/cmd/6g/list.c . adjusted print format for ulong casts; src/cmd/6g/peep.c . dropped redundant increment; R=golang-dev CC=golang-dev, rsc https://golang.org/cl/4953049	2011-08-29 09:34:59 -04:00
Russ Cox	61f84a2cdc	gc: shuffle #includes #include "go.h" (or "gg.h") becomes #include <u.h> #include <libc.h> #include "go.h" so that go.y can #include <stdio.h> after <u.h> but before "go.h". This is necessary on Plan 9. R=ken2 CC=golang-dev https://golang.org/cl/4971041	2011-08-25 16:25:10 -04:00
Russ Cox	55db9fe730	build: fix unused parameters Found with gcc 4.6 -Wunused -Wextra but should be applicable to Plan 9 too. R=ken2 CC=golang-dev https://golang.org/cl/4958044	2011-08-25 16:08:13 -04:00
Russ Cox	5e188b40f2	build: avoid redundant bss declarations Some compilers care, sadly. R=iant, ken CC=golang-dev https://golang.org/cl/4931042	2011-08-23 22:39:14 -04:00
Russ Cox	28a23675cd	5g, 6g, 8g: shift, opt fixes Fixes #1808. R=ken2 CC=golang-dev https://golang.org/cl/4813052	2011-07-28 18:22:12 -04:00
Russ Cox	08bfb39515	6g, 8g: divide corner case Fixes #1772. R=ken2 CC=golang-dev https://golang.org/cl/4798062	2011-07-28 14:18:22 -04:00
Russ Cox	a84abbe508	gc: zero-width struct, zero-length array fixes Fixes #1774. Fixes #2095. Fixes #2097. R=ken2 CC=golang-dev https://golang.org/cl/4826046	2011-07-27 16:47:45 -04:00
Anthony Martin	028f74f827	5g, 6g, 8g: fix comments in method call generation R=golang-dev CC=golang-dev https://golang.org/cl/4652042	2011-06-20 14:49:29 -04:00
Russ Cox	7f4c5ea7d8	gc: implement goto restriction Remove now-unnecessary zeroing of stack frames. R=ken2 CC=golang-dev https://golang.org/cl/4641044	2011-06-17 15:25:05 -04:00
Russ Cox	e852202f37	gc: descriptive panic for nil pointer -> value method call R=ken2 CC=golang-dev https://golang.org/cl/4646042	2011-06-17 15:23:27 -04:00
Russ Cox	5a5a7b5163	6g, 8g: fix goto fix R=ken2 CC=golang-dev https://golang.org/cl/4632041	2011-06-16 01:25:49 -04:00
Russ Cox	5d9dbe19a7	gc: work around goto bug R=ken2 CC=golang-dev https://golang.org/cl/4629042	2011-06-16 00:18:43 -04:00
Luuk van Dijk	2ad42a8249	gc: frame compaction for arm. Required moving some parts of gc/pgen.c to ?g/ggen.c on linux tests pass for all 3 architectures, and frames are actually compacted (diagnostic code for that has been removed from the CL). R=rsc CC=golang-dev https://golang.org/cl/4571071	2011-06-14 17:03:37 +02:00
Luuk van Dijk	2ac375b2df	gc: compact stackframe After allocparams and walk, remove unused auto variables and re-layout the remaining in reverse alignment order. R=rsc CC=golang-dev https://golang.org/cl/4568068	2011-06-10 00:02:34 +02:00
Russ Cox	84f291b1bd	8g: compute register liveness during regopt Input code like 0000 (x.go:2) TEXT main+0(SB),$36-0 0001 (x.go:3) MOVL $5,i+-8(SP) 0002 (x.go:3) MOVL $0,i+-4(SP) 0003 (x.go:4) MOVL $1,BX 0004 (x.go:4) MOVL i+-8(SP),AX 0005 (x.go:4) MOVL i+-4(SP),DX 0006 (x.go:4) MOVL AX,autotmp_0000+-20(SP) 0007 (x.go:4) MOVL DX,autotmp_0000+-16(SP) 0008 (x.go:4) MOVL autotmp_0000+-20(SP),CX 0009 (x.go:4) CMPL autotmp_0000+-16(SP),$0 0010 (x.go:4) JNE ,13 0011 (x.go:4) CMPL CX,$32 0012 (x.go:4) JCS ,14 0013 (x.go:4) MOVL $0,BX 0014 (x.go:4) SHLL CX,BX 0015 (x.go:4) MOVL BX,x+-12(SP) 0016 (x.go:5) MOVL x+-12(SP),AX 0017 (x.go:5) CDQ , 0018 (x.go:5) MOVL AX,autotmp_0001+-28(SP) 0019 (x.go:5) MOVL DX,autotmp_0001+-24(SP) 0020 (x.go:5) MOVL autotmp_0001+-28(SP),AX 0021 (x.go:5) MOVL autotmp_0001+-24(SP),DX 0022 (x.go:5) MOVL AX,(SP) 0023 (x.go:5) MOVL DX,4(SP) 0024 (x.go:5) CALL ,runtime.printint+0(SB) 0025 (x.go:5) CALL ,runtime.printnl+0(SB) 0026 (x.go:6) RET , is problematic because the liveness range for autotmp_0000 (0006-0009) is nested completely inside a span where BX holds a live value (0003-0015). Because the register allocator only looks at 0006-0009 to see which registers are used, it misses the fact that BX is unavailable and uses it anyway. The n->pun = anyregalloc() check in tempname is a workaround for this bug, but I hit it again because I did the tempname call before allocating BX, even though I then used the temporary after storing in BX. This should fix the real bug, and then we can remove the workaround in tempname. The code creates pseudo-variables for each register and includes that information in the liveness propagation. Then the regu fields can be populated using that more complete information. With that approach, BX is marked as in use on every line in the whole span 0003-0015, so that the decision about autotmp_0000 (using only 0006-0009) still has all the information it needs. This is not specific to the 386, but it only happens in generated code of the form load R1 ... load var into R2 ... store R2 back into var ... use R1 and for the most part the other compilers generate the loads for a given compiled line before any of the stores. Even so, this may not be the case everywhere, so the change is worth making in all three. R=ken2, ken, ken CC=golang-dev https://golang.org/cl/4529106	2011-06-03 14:10:39 -04:00
Luuk van Dijk	e59aa8ea4a	gc: typecheck the whole tree before walking. preparation for some escape-analysis related changes. R=rsc CC=golang-dev https://golang.org/cl/4528116	2011-06-02 18:48:17 +02:00
Luuk van Dijk	d6b2925923	gc: inline append when len<cap issue 1604 R=rsc, bradfitz CC=golang-dev https://golang.org/cl/4313062	2011-05-11 16:35:11 +02:00
Russ Cox	bac8f18035	gc: fix order of operations for f() < g(). Also, 6g was passing uninitialized Node &n2 to regalloc, causing non-deterministic register collisions (but only when both left and right hand side of comparison had function calls). Fixes #1728. R=ken2 CC=golang-dev https://golang.org/cl/4425070	2011-04-26 00:57:03 -04:00
Russ Cox	3a1fdc655e	gc: fix import width bug Fixes #1705. R=ken2 CC=golang-dev https://golang.org/cl/4443060	2011-04-25 12:08:48 -04:00
Rob Pike	a89c0ff39e	for GCC4.6: fix a bunch of set-and-not-used errors. R=rsc CC=golang-dev https://golang.org/cl/4406048	2011-04-14 13:31:37 -07:00
Russ Cox	1bc84b7e18	ld: 25% faster The ld time was dominated by symbol table processing, so * increase hash table size * emit fewer symbols in gc (just 1 per string, 1 per type) * add read-only lookup to avoid creating spurious symbols * add linked list to speed whole-table traversals Breaks dwarf generator (no idea why), so disable dwarf. Reduces time for 6l to link godoc by 25%. R=ken2 CC=golang-dev https://golang.org/cl/4383047	2011-04-09 09:44:20 -04:00
Russ Cox	66f09fd459	gc: diagnose unused labels R=ken2 CC=golang-dev https://golang.org/cl/4287047	2011-03-15 14:05:37 -04:00
Eoghan Sherry	e6a934a1d9	6g: fix registerization of temporaries Use correct range in allocated register test. R=rsc, ken2 CC=golang-dev https://golang.org/cl/4073049	2011-02-01 12:12:42 -05:00
Russ Cox	50fe459ce2	6g: fix uint64(uintptr(unsafe.Pointer(&x))) Fixes #1417. R=ken2 CC=golang-dev https://golang.org/cl/4079042	2011-01-20 12:50:35 -05:00
Russ Cox	0849944694	gc: delete float, complex rename cmplx -> complex R=ken2 CC=golang-dev https://golang.org/cl/4071041	2011-01-19 23:08:11 -05:00
Russ Cox	e7a0f67603	gc: introduce explicit alignments No semantic changes here, but working toward being able to align structs based on the maximum alignment of the fields inside instead of having a fixed alignment for all structs (issue 482). R=ken2 CC=golang-dev https://golang.org/cl/3617041	2010-12-13 11:57:41 -05:00
Russ Cox	e48c0fb562	5g, 6g, 8g: generate code for string index instead of calling function. R=ken2 CC=golang-dev https://golang.org/cl/2762041	2010-10-26 21:11:17 -07:00
Russ Cox	c00f9f49bb	6g: avoid too-large immediate constants R=ken2 CC=golang-dev https://golang.org/cl/2566042	2010-10-20 00:40:06 -04:00
Russ Cox	d9c989fa25	various: avoid %ld etc The Plan 9 tools assume that long is 32 bits. We converted all instances of long to int32 when importing the code but missed the print formats. Because int32 is always int on the compilers we use, it is never correct to use %lux, %ld, etc. Convert to %ux, %d, etc. (It matters because on 64-bit gcc, long is 64 bits, so we were printing 32-bit quantities with 64-bit formats.) R=ken2 CC=golang-dev https://golang.org/cl/2491041	2010-10-13 16:20:22 -04:00
Russ Cox	30dd191171	gc: O(1) string comparison when lengths differ R=ken2 CC=golang-dev https://golang.org/cl/2331045	2010-10-06 09:53:12 -04:00
Russ Cox	698fb4f192	6g, 6l, 8g, 8l: move read-only data to text segment Changing 5g and 5l too, but it doesn't work yet. R=ken2 CC=golang-dev https://golang.org/cl/2136047	2010-09-12 00:17:44 -04:00
Russ Cox	1678dcc378	gc: more accurate line numbers for ATEXT and other begin and end of function code R=ken2 CC=golang-dev https://golang.org/cl/2158044	2010-09-09 17:11:51 -04:00
Russ Cox	aafe474ec9	build: $GOBIN defaults to $GOROOT/bin R=r CC=golang-dev https://golang.org/cl/1982049	2010-08-24 20:00:33 -04:00
Ken Thompson	96cbdd62b6	better job on 2007043 better registerization R=rsc CC=golang-dev https://golang.org/cl/1955049	2010-08-23 12:38:15 -07:00
Ken Thompson	3dc3ef4cf7	attempt to gete better registeration from the builtin structures (strings, slices, interfaces) R=rsc CC=golang-dev https://golang.org/cl/2007043	2010-08-19 18:18:51 -07:00
Ken Thompson	5b0c317c9c	code optimization on slices R=rsc CC=golang-dev https://golang.org/cl/1942043	2010-08-13 19:39:36 -07:00
Russ Cox	1d77ff5b6b	6g, 8g: handle slice by sub-word-sized index (uint8, int8, uint16, int16) R=ken2 CC=golang-dev https://golang.org/cl/1960042	2010-08-11 22:27:47 -07:00
Russ Cox	9bac9d23d3	gc: index bounds tests and fixes move constant index checking to front end x[2:1] is a compile-time error now too R=ken2 CC=golang-dev https://golang.org/cl/1848056	2010-08-03 00:26:02 -07:00
Russ Cox	f930d28164	5g: fix build R=ken2 CC=golang-dev https://golang.org/cl/1893042	2010-07-27 13:43:58 -07:00
Russ Cox	607eaea456	gc: fix smaller-than-pointer-sized receivers in interfaces Fixes #812. R=ken2 CC=golang-dev https://golang.org/cl/1904041	2010-07-26 15:25:10 -07:00
Russ Cox	ece6a8c549	gc: bug293 Fixes #846. R=ken2 CC=golang-dev https://golang.org/cl/1862042	2010-07-15 16:14:06 -07:00
Russ Cox	b2a919fc29	gc: issue 894 Fixes #894. R=ken2 CC=golang-dev https://golang.org/cl/1701051	2010-07-15 15:25:32 -07:00
Ken Thompson	1246ad8390	code gen bug in len(nil) and cap(nil) fixes #892 R=rsc CC=golang-dev https://golang.org/cl/1745042	2010-06-29 12:48:24 -07:00
Russ Cox	a212d174ac	gc: better error messages for interface failures, conversions x.go:13: cannot use t (type T) as type Reader in assignment: T does not implement Reader (Read method requires pointer receiver) x.go:19: cannot use q (type Q) as type Reader in assignment: Q does not implement Reader (missing Read method) have read() want Read() x.go:22: cannot use z (type int) as type Reader in assignment: int does not implement Reader (missing Read method) x.go:24: too many arguments to conversion to complex: complex(1, 3) R=ken2 CC=golang-dev https://golang.org/cl/1736041	2010-06-20 11:45:53 -07:00
Russ Cox	565b5dc076	gc: new typechecking rules * Code for assignment, conversions now mirrors spec. * Changed some snprint -> smprint. * Renamed runtime functions to separate interface conversions from type assertions: convT2I, assertI2T, etc. * Correct checking of \U sequences. Fixes #840. Fixes #830. Fixes #778. R=ken2 CC=golang-dev https://golang.org/cl/1303042	2010-06-08 18:50:02 -07:00

1 2 3 4 5 ...

440 Commits