www.infradead.org Git - users/willy/linux.git/log

]> www.infradead.org Git - users/willy/linux.git/log

projects / users / willy / linux.git / log

Matthew Wilcox [Sun, 20 Jan 2019 16:03:51 +0000 (11:03 -0500)]

mm: Move kvmalloc declarations to slab.h

Contrary to popular belief, not every file in the kernel includes mm.h
and this helps transition users from kmalloc to kvmalloc.

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Sun, 20 Jan 2019 15:53:38 +0000 (10:53 -0500)]

printf: Convert ip4_addr_string_sa to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Thu, 20 Dec 2018 21:11:04 +0000 (16:11 -0500)]

printf: Convert ip6_addr_string_sa to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Thu, 20 Dec 2018 21:10:50 +0000 (16:10 -0500)]

printf: Convert escaped_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Thu, 20 Dec 2018 14:14:41 +0000 (09:14 -0500)]

printf: Convert uuid_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Thu, 20 Dec 2018 04:00:33 +0000 (23:00 -0500)]

printf: Convert mac_address_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 21:03:50 +0000 (16:03 -0500)]

printf: Convert dentry_name to printf_char

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 20:59:27 +0000 (15:59 -0500)]

printf: Add printf_char

Convert all open-coded versions.

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 17:50:30 +0000 (12:50 -0500)]

printf: Convert device_node_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 17:05:51 +0000 (12:05 -0500)]

printf: Convert flags_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:49:29 +0000 (10:49 -0500)]

printf: Convert bitmap_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:44:30 +0000 (10:44 -0500)]

printf: Convert bitmap_list_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:38:18 +0000 (10:38 -0500)]

printf: Convert pointer_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:35:24 +0000 (10:35 -0500)]

printf: Convert restricted_pointer to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:33:10 +0000 (10:33 -0500)]

printf: Convert ptr_to_id to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:30:06 +0000 (10:30 -0500)]

printf: Convert clock to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:25:44 +0000 (10:25 -0500)]

printf: Convert special_hex_number to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 15:21:47 +0000 (10:21 -0500)]

printf: Convert address_val to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 14:13:51 +0000 (09:13 -0500)]

printf: Convert netdev_bits to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 19 Dec 2018 13:59:58 +0000 (08:59 -0500)]

printf: Convert hex_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Sat, 15 Dec 2018 10:58:12 +0000 (05:58 -0500)]

printf: Convert resource_string to printf_state

commit | commitdiff | tree

Matthew Wilcox [Wed, 12 Dec 2018 19:11:23 +0000 (14:11 -0500)]

printf: Convert symbol_string to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 12 Dec 2018 19:00:08 +0000 (14:00 -0500)]

printf: Convert dentry_name() to printf_state

Turn %pD into a fallthrough instead of a separate call.
Add printf_widen_string() as widen_string() still has legacy callers.

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 12 Dec 2018 18:52:06 +0000 (13:52 -0500)]

printf: Convert bdev_name() to printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 12 Dec 2018 18:46:09 +0000 (13:46 -0500)]

printf: Convert pointer() to take a printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Matthew Wilcox [Wed, 12 Dec 2018 16:56:51 +0000 (11:56 -0500)]

Introduce printf_state

Signed-off-by: Matthew Wilcox <willy@infradead.org>

commit | commitdiff | tree

Stephen Rothwell [Mon, 10 Dec 2018 09:12:07 +0000 (20:12 +1100)]

Add linux-next specific files for 20181210

Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Stephen Rothwell [Mon, 10 Dec 2018 08:31:09 +0000 (19:31 +1100)]

Merge branch 'akpm/master'

commit | commitdiff | tree

Andi Kleen [Wed, 5 Dec 2018 00:14:27 +0000 (11:14 +1100)]

drivers/media/platform/sti/delta/delta-ipc.c: fix read buffer overflow

The single caller passes a string to delta_ipc_open, which copies with a
fixed size larger than the string.  So it copies some random data after
the original string the ro segment.

If the string was at the end of a page it may fault.

Just copy the string with a normal strcpy after clearing the field.

Found by a LTO build (which errors out)
because the compiler inlines the functions and can resolve
the string sizes and triggers the compile time checks in memcpy.

In function `memcpy',
    inlined from `delta_ipc_open.constprop' at linux/drivers/media/platform/sti/delta/delta-ipc.c:178:0,
    inlined from `delta_mjpeg_ipc_open' at linux/drivers/media/platform/sti/delta/delta-mjpeg-dec.c:227:0,
    inlined from `delta_mjpeg_decode' at linux/drivers/media/platform/sti/delta/delta-mjpeg-dec.c:403:0:
/home/andi/lsrc/linux/include/linux/string.h:337:0: error: call to `__read_overflow2' declared with attribute error: detected read beyond size of object passed as 2nd parameter
    __read_overflow2();

Link: http://lkml.kernel.org/r/20171222001212.1850-1-andi@firstfloor.org
Signed-off-by: Andi Kleen <ak@linux.intel.com>
Cc: Hugues FRUCHET <hugues.fruchet@st.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Nikolay Borisov [Wed, 5 Dec 2018 00:14:26 +0000 (11:14 +1100)]

fs: don't open code lru_to_page()

Multiple filesystems open code lru_to_page(). Rectify this by moving the
macro from mm_inline (which is specific to lru stuff) to the more generic
mm.h header and start using the macro where appropriate.

No functional changes.

Link: http://lkml.kernel.org/r/20181129104810.23361-1-nborisov@suse.com
Link: https://lkml.kernel.org/r/20181129075301.29087-1-nborisov@suse.com
Signed-off-by: Nikolay Borisov <nborisov@suse.com>
Acked-by: Michal Hocko <mhocko@suse.com>
Reviewed-by: David Hildenbrand <david@redhat.com>
Reviewed-by: Mike Rapoport <rppt@linux.ibm.com>
Acked-by: Pankaj gupta <pagupta@redhat.com>
Acked-by: "Yan, Zheng" <zyan@redhat.com> [ceph]
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Andrei Vagin [Wed, 5 Dec 2018 00:14:26 +0000 (11:14 +1100)]

include/linux/sched/signal.h: replace `tsk' with `task'

This file uses "task" 85 times and "tsk" 25 times. It is better to be
consistent.

Link: http://lkml.kernel.org/r/20181129180547.15976-1-avagin@gmail.com
Signed-off-by: Andrei Vagin <avagin@gmail.com>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Cc: Oleg Nesterov <oleg@redhat.com>
Cc: "Eric W. Biederman" <ebiederm@xmission.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Andrew Morton [Wed, 5 Dec 2018 00:14:26 +0000 (11:14 +1100)]

fs-remove-caller-signal_pending-branch-predictions-fix

fix fs/buffer.c

Cc: Davidlohr Bueso <dave@stgolabs.net>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Davidlohr Bueso [Wed, 5 Dec 2018 00:14:26 +0000 (11:14 +1100)]

fs/: remove caller signal_pending branch predictions

This is already done for us internally by the signal machinery.

Link: http://lkml.kernel.org/r/20181116002713.8474-7-dave@stgolabs.net
Signed-off-by: Davidlohr Bueso <dave@stgolabs.net>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Davidlohr Bueso [Wed, 5 Dec 2018 00:14:26 +0000 (11:14 +1100)]

mm/: remove caller signal_pending branch predictions

This is already done for us internally by the signal machinery.

Link: http://lkml.kernel.org/r/20181116002713.8474-5-dave@stgolabs.net
Signed-off-by: Davidlohr Bueso <dave@stgolabs.net>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Davidlohr Bueso [Wed, 5 Dec 2018 00:14:25 +0000 (11:14 +1100)]

arch/arc/mm/fault.c: remove caller signal_pending_branch predictions

This is already done for us internally by the signal machinery.

Link: http://lkml.kernel.org/r/20181116002713.8474-4-dave@stgolabs.net
Signed-off-by: Davidlohr Bueso <dave@stgolabs.net>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Cc: Vineet Gupta <vgupta@synopsys.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Davidlohr Bueso [Wed, 5 Dec 2018 00:14:25 +0000 (11:14 +1100)]

kernel/sched/: remove caller signal_pending branch predictions

This is already done for us internally by the signal machinery.

Link: http://lkml.kernel.org/r/20181116002713.8474-3-dave@stgolabs.net
Signed-off-by: Davidlohr Bueso <dave@stgolabs.net>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Davidlohr Bueso [Wed, 5 Dec 2018 00:14:25 +0000 (11:14 +1100)]

kernel/locking/mutex.c: remove caller signal_pending branch predictions

This is already done for us internally by the signal machinery.

Link: http://lkml.kernel.org/r/20181116002713.8474-2-dave@stgolabs.net
Signed-off-by: Davidlohr Bueso <dave@stgolabs.net>
Reviewed-by: Andrew Morton <akpm@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:25 +0000 (11:14 +1100)]

zram: default to lzo-rle instead of lzo

lzo-rle gives higher performance and similar compression ratios to lzo.

Testing with 80 browser tabs showed a 27% reduction in total time spent
(de)compressing data during swapping.

Link: http://lkml.kernel.org/r/20181130142600.13782-9-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Matt Sealey <matt.sealey@arm.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:25 +0000 (11:14 +1100)]

lib-lzo-separate-lzo-rle-from-lzo-v4

v4

Link: http://lkml.kernel.org/r/20181130142600.13782-8-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:24 +0000 (11:14 +1100)]

lib/lzo: separate lzo-rle from lzo

To prevent any issues with persistent data, separate lzo-rle
from lzo so that it is treated as a separate algorithm, and
lzo is still available.

Use lzo-rle as the default algorithm for
zram.

Link: http://lkml.kernel.org/r/20181127161913.23863-8-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Matt Sealey <matt.sealey@arm.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:24 +0000 (11:14 +1100)]

lib-lzo-implement-run-length-encoding-v4.txt

fix warning

Link: http://lkml.kernel.org/r/20181130142600.13782-7-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:24 +0000 (11:14 +1100)]

lib/lzo: implement run-length encoding

When using zram, we frequently encounter long runs of zero bytes.
This adds a special case which identifies runs of zeros and encodes
them using run-length encoding.

This is faster for both compression and decompresion. For
high-entropy data which doesn't hit this case, impact is minimal.

Compression ratio is within a few percent in all cases.

This modifies the bitstream in a way which is backwards compatible
(i.e., we can decompress old bitstreams, but old versions of lzo
cannot decompress new bitstreams).

Link: http://lkml.kernel.org/r/20181127161913.23863-7-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Matt Sealey <matt.sealey@arm.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Matt Sealey [Wed, 5 Dec 2018 00:14:24 +0000 (11:14 +1100)]

lib/lzo: fast 8-byte copy on arm64

Enable faster 8-byte copies on arm64.

Link: http://lkml.kernel.org/r/20181127161913.23863-6-dave.rodgman@arm.com
Signed-off-by: Matt Sealey <matt.sealey@arm.com>
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Matt Sealey [Wed, 5 Dec 2018 00:14:24 +0000 (11:14 +1100)]

lib/lzo: 64-bit CTZ on arm64

LZO leaves some performance on the table by not realising that arm64 can
optimize count-trailing-zeros bit operations.

Add CONFIG_ARM64 to the checked definitions alongside CONFIG_X86_64 to
enable the use of rbit/clz instructions on full 64-bit quantities.

Link: http://lkml.kernel.org/r/20181127161913.23863-5-dave.rodgman@arm.com
Signed-off-by: Matt Sealey <matt.sealey@arm.com>
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Matt Sealey [Wed, 5 Dec 2018 00:14:23 +0000 (11:14 +1100)]

lib/lzo: enable 64-bit CTZ on Arm

ARMv6 Thumb state introduced an RBIT instruction which, combined with CLZ
as present in ARMv5, introduces an extremely fast path for counting
trailing zeroes.

Enable the use of the GCC builtin for this on ARMv6+ with
CONFIG_THUMB2_KERNEL to ensure we get the 'new' instruction usage.

We do not bother enabling LZO_USE_CTZ64 support for ARMv5 as the builtin
code path does the same thing as the LZO_USE_CTZ32 code, only with more
register pressure.

Link: http://lkml.kernel.org/r/20181127161913.23863-4-dave.rodgman@arm.com
Signed-off-by: Matt Sealey <matt.sealey@arm.com>
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Matt Sealey [Wed, 5 Dec 2018 00:14:23 +0000 (11:14 +1100)]

lib/lzo: clean-up by introducing COPY16

Most compilers should be able to merge adjacent loads/stores of sizes
which are less than but effect a multiple of a machine word size (in
effect a memcpy() of a constant amount). However the semantics of the
macro are that it just does the copy, the pointer increment is in the
code, hence we see

    *a = *b
    a += 8
    b += 8
    *a = *b
    a += 8
    b += 8

This introduces a dependency between the two groups of statements which
seems to defeat said compiler optimizers and generate some very strange
sequences of addition and subtraction of address offsets (i.e. it is
overcomplicated).

Since COPY8 is only ever used to copy amounts of 16 bytes (in pairs),
just define COPY16 as COPY8,COPY8. We leave the definition to preserve
the need to do unaligned accesses to machine-sized words per the
original code intent, we just don't use it in the code proper.

COPY16 then gives us code like:

    *a = *b
    *(a+8) = *(b+8)
    a += 16
    b += 16

This seems to allow compilers to generate much better code by using
base register writeback or simply positively incrementing offsets which
seems to positively affect performance. It is, at least, fewer
instructions to do the same job.

Link: http://lkml.kernel.org/r/20181127161913.23863-3-dave.rodgman@arm.com
Signed-off-by: Matt Sealey <matt.sealey@arm.com>
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: David S. Miller <davem@davemloft.net>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

Dave Rodgman [Wed, 5 Dec 2018 00:14:23 +0000 (11:14 +1100)]

lib/lzo: tidy-up ifdefs

Patch series "lib/lzo: performance improvements", v4.

This series introduces performance improvements for lzo.

The previous version of this patchset is here:
https://lkml.org/lkml/2018/11/21/625

This version tidies up the ifdefs as per Christoph's comment (although
certainly more could be done, this is at least a bit more consistent
with normal kernel coding style).

On 23/11/2018 2:12 am, Sergey Senozhatsky wrote:

>> The graph below shows the weighted round-trip throughput of lzo, lz4 and
>> lzo-rle, for randomly generated 4k chunks of data with varying levels of
>> entropy. (To calculate weighted round-trip throughput, compression performance
>> is emphasised to reflect the fact that zram does around 2.25x more compression
>> than decompression.
>
> Right. The number is data dependent. Not all swapped out pages can be
> compressed; compressed pages that end up being >= zs_huge_class_size() are
> considered incompressible and stored as it.
>
> I'd say that on my setups around 50-60% of pages are incompressible.

So, just to give a bit more detail: the test setup was a Samsung
Chromebook Pro, cycling through 80 tabs in Chrome. With lzo-rle, only
5% of pages increased in size, and 90% of pages compress to 75% of
original size (or better). Mean compression ratio was 41%. Importantly
for lzo-rle, there are a lot of low-entropy pages where it can do well:
in total about 20% of the data is zeros forming part of a run of 4 or
more bytes.

As a quick summary of the impact of these patches on bigger chunks of
data, I've compared the performance of four different variants of lzo
on two large (~40 MB) files. The numbers show round-trip throughput
in MB/s:

Variant         | Low-entropy | High-entropy
Current lzo     |  242        | 157
Arm opts        |  290        | 159
RLE             |  876        | 151
Arm opts + RLE  | 1150        | 181

So both the Arm optimisations (8,16-byte copy & CTZ patches), and the
RLE implementation make a significant contribution to the overall
performance uplift.

This patch (of 8):

Modify the ifdefs in lzodefs.h to be more consistent with normal kernel
macros (e.g., change __aarch64__ to CONFIG_ARM64).

Link: http://lkml.kernel.org/r/20181127161913.23863-2-dave.rodgman@arm.com
Signed-off-by: Dave Rodgman <dave.rodgman@arm.com>
Cc: Herbert Xu <herbert@gondor.apana.org.au>
Cc: David S. Miller <davem@davemloft.net>
Cc: Nitin Gupta <nitingupta910@gmail.com>
Cc: Richard Purdie <rpurdie@openedhand.com>
Cc: Markus F.X.J. Oberhumer <markus@oberhumer.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Sergey Senozhatsky <sergey.senozhatsky.work@gmail.com>
Cc: Sonny Rao <sonnyrao@google.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Matt Sealey <matt.sealey@arm.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree

David Hildenbrand [Wed, 5 Dec 2018 00:14:23 +0000 (11:14 +1100)]

pm-hibernate-exclude-all-pageoffline-pages-v2

In saveable_highmem_page(), move the PageReserved() check to a new
check along with the PageOffline() check to separate it from the
swsusp checks.

Link: http://lkml.kernel.org/r/20181122100627.5189-9-david@redhat.com
Signed-off-by: David Hildenbrand <david@redhat.com>
Acked-by: Pavel Machek <pavel@ucw.cz>
Acked-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>
Cc: Pavel Machek <pavel@ucw.cz>
Cc: Len Brown <len.brown@intel.com>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Matthew Wilcox <willy@infradead.org>
Cc: Michal Hocko <mhocko@suse.com>
Cc: "Michael S. Tsirkin" <mst@redhat.com>
Cc: Alexander Duyck <alexander.h.duyck@linux.intel.com>
Cc: Alexey Dobriyan <adobriyan@gmail.com>
Cc: Arnd Bergmann <arnd@arndb.de>
Cc: Baoquan He <bhe@redhat.com>
Cc: Borislav Petkov <bp@alien8.de>
Cc: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Cc: Christian Hansen <chansen3@cisco.com>
Cc: Dave Young <dyoung@redhat.com>
Cc: David Rientjes <rientjes@google.com>
Cc: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Cc: Haiyang Zhang <haiyangz@microsoft.com>
Cc: Jonathan Corbet <corbet@lwn.net>
Cc: Juergen Gross <jgross@suse.com>
Cc: Julien Freche <jfreche@vmware.com>
Cc: Kairui Song <kasong@redhat.com>
Cc: Kazuhito Hagio <k-hagio@ab.jp.nec.com>
Cc: "Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>
Cc: Konstantin Khlebnikov <koct9i@gmail.com>
Cc: "K. Y. Srinivasan" <kys@microsoft.com>
Cc: Lianbo Jiang <lijiang@redhat.com>
Cc: Michal Hocko <mhocko@kernel.org>
Cc: Mike Rapoport <rppt@linux.vnet.ibm.com>
Cc: Miles Chen <miles.chen@mediatek.com>
Cc: Nadav Amit <namit@vmware.com>
Cc: Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>
Cc: Omar Sandoval <osandov@fb.com>
Cc: Pankaj gupta <pagupta@redhat.com>
Cc: Pavel Tatashin <pasha.tatashin@oracle.com>
Cc: Stefano Stabellini <sstabellini@kernel.org>
Cc: Stephen Hemminger <sthemmin@microsoft.com>
Cc: Stephen Rothwell <sfr@canb.auug.org.au>
Cc: Vitaly Kuznetsov <vkuznets@redhat.com>
Cc: Vlastimil Babka <vbabka@suse.cz>
Cc: Xavier Deguillard <xdeguillard@vmware.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au>

commit | commitdiff | tree