]> www.infradead.org Git - users/hch/dma-mapping.git/commitdiff
bpf, x64: Drop "pop %rcx" instruction on BPF JIT epilogue
authorMaciej Fijalkowski <maciej.fijalkowski@intel.com>
Tue, 29 Sep 2020 20:46:52 +0000 (22:46 +0200)
committerAlexei Starovoitov <ast@kernel.org>
Tue, 29 Sep 2020 23:47:39 +0000 (16:47 -0700)
Back when all of the callee-saved registers where always pushed to stack
in x64 JIT prologue, tail call counter was placed at the bottom of the
BPF program's stack frame that had a following layout:

+-------------+
|  ret addr   |
+-------------+
|     rbp     | <- rbp
+-------------+
|             |
| free space  |
| from:       |
| sub $x,%rsp |
|             |
+-------------+
|     rbx     |
+-------------+
|     r13     |
+-------------+
|     r14     |
+-------------+
|     r15     |
+-------------+
|  tail call  | <- rsp
|   counter   |
+-------------+

In order to restore the callee saved registers, epilogue needed to
explicitly toss away the tail call counter via "pop %rbx" insn, so that
%rsp would be back at the place where %r15 was stored.

Currently, the tail call counter is placed on stack *before* the callee
saved registers (brackets on rbx through r15 mean that they are now
pushed to stack only if they are used):

+-------------+
|  ret addr   |
+-------------+
|     rbp     | <- rbp
+-------------+
|             |
| free space  |
| from:       |
| sub $x,%rsp |
|             |
+-------------+
|  tail call  |
|   counter   |
+-------------+
(     rbx     )
+-------------+
(     r13     )
+-------------+
(     r14     )
+-------------+
(     r15     ) <- rsp
+-------------+

For the record, the epilogue insns consist of (assuming all of the
callee saved registers are used by program):
pop    %r15
pop    %r14
pop    %r13
pop    %rbx
pop    %rcx
leaveq
retq

"pop %rbx" for getting rid of tail call counter was not an option
anymore as it would overwrite the restored value of %rbx register, so it
was changed to use the %rcx register.

Since epilogue can start popping the callee saved registers right away
without any additional work, the "pop %rcx" could be dropped altogether
as "leave" insn will simply move the %rbp to %rsp. IOW, tail call
counter does not need the explicit handling.

Having in mind the explanation above and the actual reason for that,
let's piggy back on "leave" insn for discarding the tail call counter
from stack and remove the "pop %rcx" from epilogue.

Signed-off-by: Maciej Fijalkowski <maciej.fijalkowski@intel.com>
Signed-off-by: Alexei Starovoitov <ast@kernel.org>
Link: https://lore.kernel.org/bpf/20200929204653.4325-2-maciej.fijalkowski@intel.com
arch/x86/net/bpf_jit_comp.c

index 26f43279b78b8b90a296af77b8874258aa1352f1..a263918043cebdb91041519bf1d873b28495e51a 100644 (file)
@@ -1441,8 +1441,6 @@ emit_jmp:
                        /* Update cleanup_addr */
                        ctx->cleanup_addr = proglen;
                        pop_callee_regs(&prog, callee_regs_used);
-                       if (tail_call_reachable)
-                               EMIT1(0x59); /* pop rcx, get rid of tail_call_cnt */
                        EMIT1(0xC9);         /* leave */
                        EMIT1(0xC3);         /* ret */
                        break;