eb619fdb2d4cb8b3d3419e9113921e87e7daf557 - kernel/common

commit	eb619fdb2d4cb8b3d3419e9113921e87e7daf557	[log] [tgz]
author	Jens Axboe <[email protected]>	Thu Nov 09 08:32:43 2017 -0700
committer	Jens Axboe <[email protected]>	Fri Nov 10 19:53:25 2017 -0700
tree	491c0230e3ce0cead62bf59b090bf6c8b2bbd6e0
parent	e454d122e22826589cfa08788a802ab09d4fae24 [diff]

blk-mq: fix issue with shared tag queue re-running

This patch attempts to make the case of hctx re-running on driver tag
failure more robust. Without this patch, it's pretty easy to trigger a
stall condition with shared tags. An example is using null_blk like
this:

modprobe null_blk queue_mode=2 nr_devices=4 shared_tags=1 submit_queues=1 hw_queue_depth=1

which sets up 4 devices, sharing the same tag set with a depth of 1.
Running a fio job ala:

[global]
bs=4k
rw=randread
norandommap
direct=1
ioengine=libaio
iodepth=4

[nullb0]
filename=/dev/nullb0
[nullb1]
filename=/dev/nullb1
[nullb2]
filename=/dev/nullb2
[nullb3]
filename=/dev/nullb3

will inevitably end with one or more threads being stuck waiting for a
scheduler tag. That IO is then stuck forever, until someone else
triggers a run of the queue.

Ensure that we always re-run the hardware queue, if the driver tag we
were waiting for got freed before we added our leftover request entries
back on the dispatch list.

Reviewed-by: Bart Van Assche <[email protected]>
Tested-by: Bart Van Assche <[email protected]>
Reviewed-by: Ming Lei <[email protected]>
Reviewed-by: Omar Sandoval <[email protected]>
Signed-off-by: Jens Axboe <[email protected]>

3 files changed

tree: 491c0230e3ce0cead62bf59b090bf6c8b2bbd6e0