]> git.baikalelectronics.ru Git - kernel.git/commit
drbd: fix potential silent data corruption
authorLars Ellenberg <lars.ellenberg@linbit.com>
Wed, 30 Mar 2022 18:55:51 +0000 (20:55 +0200)
committerJens Axboe <axboe@kernel.dk>
Wed, 30 Mar 2022 18:58:56 +0000 (12:58 -0600)
commitc0bc92c1bf144e6dec101e6a3d3ca31635e3a57f
treef623952cb596b626bf8df60776fedc51dba209b2
parentc07ed2b85d4fdb870949043513ad595ea2020305
drbd: fix potential silent data corruption

Scenario:
---------

bio chain generated by blk_queue_split().
Some split bio fails and propagates its error status to the "parent" bio.
But then the (last part of the) parent bio itself completes without error.

We would clobber the already recorded error status with BLK_STS_OK,
causing silent data corruption.

Reproducer:
-----------

How to trigger this in the real world within seconds:

DRBD on top of degraded parity raid,
small stripe_cache_size, large read_ahead setting.
Drop page cache (sysctl vm.drop_caches=1, fadvise "DONTNEED",
umount and mount again, "reboot").

Cause significant read ahead.

Large read ahead request is split by blk_queue_split().
Parts of the read ahead that are already in the stripe cache,
or find an available stripe cache to use, can be serviced.
Parts of the read ahead that would need "too much work",
would need to wait for a "stripe_head" to become available,
are rejected immediately.

For larger read ahead requests that are split in many pieces, it is very
likely that some "splits" will be serviced, but then the stripe cache is
exhausted/busy, and the remaining ones will be rejected.

Signed-off-by: Lars Ellenberg <lars.ellenberg@linbit.com>
Signed-off-by: Christoph Böhmwalder <christoph.boehmwalder@linbit.com>
Cc: <stable@vger.kernel.org> # 4.13.x
Link: https://lore.kernel.org/r/20220330185551.3553196-1-christoph.boehmwalder@linbit.com
Signed-off-by: Jens Axboe <axboe@kernel.dk>
drivers/block/drbd/drbd_req.c