I tried this patch with a simple benchmark that uses an AF_UNIX
stream socket pair to simulate a sendfile() to /dev/null. Throughput
goes up by about 1.5% when sending a large (8GB) memory-resident
file.
Edit: I had previous claimed a larger improvement, but had accidentally
been testing with the patch from D11943 applied as well.