Bug #1911999 “faulty paths are not removed” : Bugs : multipath-tools package : Ubuntu

Affects	Status	Importance	Assigned to	Milestone
	multipath-tools (Ubuntu)	Expired	High	Unassigned

Revision history for this message

Deyan Stanev (dstanev) wrote on 2021-01-15:

#1

Dependencies.txt Edit (2.4 KiB, text/plain; charset="utf-8")
ProcCpuinfoMinimal.txt Edit (1.3 KiB, text/plain; charset="utf-8")
modified.conffile..etc.multipath.conf.txt Edit (755 bytes, text/plain; charset="utf-8")

Robie Basak (racb) on 2021-01-19

Changed in multipath-tools (Ubuntu):
importance:	Undecided → High

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-21:

#2

Download full text (4.2 KiB)

Trying to recreate on 20.04

# enable my FC adapters
$ sudo chccwdev -e 0.0.e000
$ sudo chccwdev -e 0.0.e100

# Ensure and check I have a one minute set (default would be infinite)
$ for f in /sys/devices/css0/0.0.*/0.0.*/host*/rport-*/fc_remote_ports/rport-*/*loss_tmo; do b=$(basename $f); echo "$b : $(cat $f)"; done
dev_loss_tmo : 60
dev_loss_tmo : 60
dev_loss_tmo : 60
dev_loss_tmo : 60

An individual device right now looks like this:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc 8:32 active ready running
  |- 0:0:1:1074413604 sdh 8:112 active ready running
  |- 1:0:1:1074413604 sdr 65:16 active ready running
  `- 1:0:0:1074413604 sdm 8:192 active ready running

Then I was unmapping that on the storage server makes this

Even not "using" the disks actively I immediately see the errors on them in dmesg.

[ 4438.196385] device-mapper: multipath: Failing path 8:32.
[ 4438.205404] sd 0:0:1:1074413604: [sdh] tag#2379 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205407] sd 0:0:1:1074413604: [sdh] tag#2379 Sense Key : Aborted Command [current]
[ 4438.205410] sd 0:0:1:1074413604: [sdh] tag#2379 Add. Sense: Logical unit not supported
[ 4438.205413] sd 0:0:1:1074413604: [sdh] tag#2379 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205416] blk_update_request: I/O error, dev sdh, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205428] device-mapper: multipath: Failing path 8:112.
[ 4438.205595] sd 1:0:1:1074413604: [sdr] tag#2933 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205598] sd 1:0:1:1074413604: [sdr] tag#2933 Sense Key : Aborted Command [current]
[ 4438.205605] sd 1:0:1:1074413604: [sdr] tag#2933 Add. Sense: Logical unit not supported
[ 4438.205609] sd 1:0:1:1074413604: [sdr] tag#2933 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205611] blk_update_request: I/O error, dev sdr, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205617] device-mapper: multipath: Failing path 65:16.
[ 4438.205772] sd 1:0:0:1074413604: [sdm] tag#2934 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205775] sd 1:0:0:1074413604: [sdm] tag#2934 Sense Key : Aborted Command [current]
[ 4438.205777] sd 1:0:0:1074413604: [sdm] tag#2934 Add. Sense: Logical unit not supported
[ 4438.205779] sd 1:0:0:1074413604: [sdm] tag#2934 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205781] blk_update_request: I/O error, dev sdm, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205788] device-mapper: multipath: Failing path 8:192.

And multipath immediately switched them to faulty state

mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=0 status=enabled
  |- 0:0:0:1074413604 sdc 8:32 failed faulty running
  |- 0:0:1:1074413604 sdh 8:112 failed faulty running
  |- 1:0:1:1074413604 sdr 65:16 failed faulty running
  `- 1:0:0:1074413604 sdm 8:192 failed faulty runni...

Trying to recreate on 20.04

# enable my FC adapters
$ sudo chccwdev -e 0.0.e000
$ sudo chccwdev -e 0.0.e100

# Ensure and check I have a one minute set (default would be infinite)
$ for f in /sys/devices/css0/0.0.*/0.0.*/host*/rport-*/fc_remote_ports/rport-*/*loss_tmo; do b=$(basename $f); echo "$b : $(cat $f)"; done
dev_loss_tmo : 60
dev_loss_tmo : 60
dev_loss_tmo : 60
dev_loss_tmo : 60

An individual device right now looks like this:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  |- 0:0:1:1074413604 sdh   8:112  active ready running
  |- 1:0:1:1074413604 sdr   65:16  active ready running
  `- 1:0:0:1074413604 sdm   8:192  active ready running

Then I was unmapping that on the storage server makes this

Even not "using" the disks actively I immediately see the errors on them in dmesg.

[ 4438.196385] device-mapper: multipath: Failing path 8:32.
[ 4438.205404] sd 0:0:1:1074413604: [sdh] tag#2379 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205407] sd 0:0:1:1074413604: [sdh] tag#2379 Sense Key : Aborted Command [current] 
[ 4438.205410] sd 0:0:1:1074413604: [sdh] tag#2379 Add. Sense: Logical unit not supported
[ 4438.205413] sd 0:0:1:1074413604: [sdh] tag#2379 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205416] blk_update_request: I/O error, dev sdh, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205428] device-mapper: multipath: Failing path 8:112.
[ 4438.205595] sd 1:0:1:1074413604: [sdr] tag#2933 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205598] sd 1:0:1:1074413604: [sdr] tag#2933 Sense Key : Aborted Command [current] 
[ 4438.205605] sd 1:0:1:1074413604: [sdr] tag#2933 Add. Sense: Logical unit not supported
[ 4438.205609] sd 1:0:1:1074413604: [sdr] tag#2933 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205611] blk_update_request: I/O error, dev sdr, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205617] device-mapper: multipath: Failing path 65:16.
[ 4438.205772] sd 1:0:0:1074413604: [sdm] tag#2934 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[ 4438.205775] sd 1:0:0:1074413604: [sdm] tag#2934 Sense Key : Aborted Command [current] 
[ 4438.205777] sd 1:0:0:1074413604: [sdm] tag#2934 Add. Sense: Logical unit not supported
[ 4438.205779] sd 1:0:0:1074413604: [sdm] tag#2934 CDB: Read(10) 28 00 01 3f ff 80 00 00 08 00
[ 4438.205781] blk_update_request: I/O error, dev sdm, sector 20971392 op 0x0:(READ) flags 0x84700 phys_seg 1 prio class 0
[ 4438.205788] device-mapper: multipath: Failing path 8:192.

And multipath immediately switched them to faulty state

mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=0 status=enabled
  |- 0:0:0:1074413604 sdc   8:32   failed faulty running
  |- 0:0:1:1074413604 sdh   8:112  failed faulty running
  |- 1:0:1:1074413604 sdr   65:16  failed faulty running
  `- 1:0:0:1074413604 sdm   8:192  failed faulty running

The logs in journal then continued to detect that by the tur checker

...
Jan 21 16:59:02 s1lp5 multipathd[782]: mpathb: sdh - tur checker reports path is down
Jan 21 16:59:02 s1lp5 multipathd[782]: mpathb: sdr - tur checker reports path is down
Jan 21 16:59:02 s1lp5 multipathd[782]: mpathb: sdm - tur checker reports path is down
Jan 21 16:59:04 s1lp5 multipathd[782]: mpathb: sdc - tur checker reports path is down
Jan 21 16:59:05 s1lp5 systemd-udevd[648]: dm-3: Worker [18540] processing SEQNUM=17490 is taking a long time
Jan 21 16:59:07 s1lp5 multipathd[782]: mpathb: sdh - tur checker reports path is down
Jan 21 16:59:07 s1lp5 multipathd[782]: mpathb: sdr - tur checker reports path is down
Jan 21 16:59:07 s1lp5 multipathd[782]: mpathb: sdm - tur checker reports path is down
Jan 21 16:59:09 s1lp5 multipathd[782]: mpathb: sdc - tur checker reports path is down
...

And they stayed in "failed faulty running" for more than 60 seconds.
So I guess "confirmed".
I'm not entirely sure yet, but I think I can use that environment to debug it further.

Changed in multipath-tools (Ubuntu):
status:	New → Confirmed

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#3

I'm not a 100% sure, but I wondered what "to expect" from dev_loss_tmo.
Reading more docs I think it is more like:
"I/O is held in flight, since the target might come back"
After the timeout I'd assume it will kill the remaining I/O.

This makes me think that this bug is about two things:
a) "faulty path is not removed" as reported
b) "mapping new disk to a formerly present LUN should be detected as different/new"

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#4

Download full text (4.4 KiB)

# Argument why (a) isn't a problem: the "paths are never going away"

And this is (as seen on the sysfs) on the rports, which are the links between local and remote FC ports. In my case for example two ports on each side doing NxM => 4 rports.

-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-0:0-0/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-0:0-1/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-1:0-0/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-1:0-1/dev_loss_tmo

Those rports do not go down at all when I unmap the disk, as the paths are unaffected.

It more seems like kernel & multipath think "the paths are all happy and up, but something is wrong at the remote end" (which is true as we unmapped the disk). When the disk comes back everyone would expect and wan this to continue.
E.g. in my case I "mapped back" the original volume and was happy that

Mapping back the LUN:

journal:
Jan 25 08:16:46 s1lp5 kernel: sd 0:0:1:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 1:0:1:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 1:0:0:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdh - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 8:112: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 1
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdr - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 65:16: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 2
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdm - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 8:192: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 3
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 8:112.
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 65:16.
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 8:192.
Jan 25 08:16:47 s1lp5 systemd-udevd[648]: Worker [18540] terminated by signal 9 (KILL)
Jan 25 08:16:47 s1lp5 systemd-udevd[648]: dm-3: Worker [18540] failed
Jan 25 08:16:47 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:47 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:48 s1lp5 dbus-daemon[1048]: [system] Activating via systemd: service name='org.freedesktop.PackageKit' unit='packagekit.service' requested by ':1.34' (uid=0 pid=983318 comm="/usr/bin/gdbus call --system --dest org.freedeskto" label="unconfined")
Jan 25 08:16:48 s1lp5 systemd[1]: Starting PackageKit Daemon...
Jan 25 08:16:48 s1lp5 PackageKit[983321]: daemon start
Jan 25 08:16:48 s1lp5 kernel: sd 0:0:0:1074413604: Power-on or device reset occurred
Jan 25 08:16:48 s1lp5 kernel: sd 0:0:0:1074413604: alua: port group 00 state A ...

# Argument why (a) isn't a problem: the "paths are never going away"

And this is (as seen on the sysfs) on the rports, which are the links between local and remote FC ports. In my case for example two ports on each side doing NxM  => 4 rports.

-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-0:0-0/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-0:0-1/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-1:0-0/dev_loss_tmo
-rw-r--r-- 1 root root 4096 Jan 21 15:51 /sys/class/fc_remote_ports/rport-1:0-1/dev_loss_tmo

Those rports do not go down at all when I unmap the disk, as the paths are unaffected.

It more seems like kernel & multipath think "the paths are all happy and up, but something is wrong at the remote end" (which is true as we unmapped the disk). When the disk comes back everyone would expect and wan this to continue.
E.g. in my case I "mapped back" the original volume and was happy that

Mapping back the LUN:

journal:
Jan 25 08:16:46 s1lp5 kernel: sd 0:0:1:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 1:0:1:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 1:0:0:1074413604: Power-on or device reset occurred
Jan 25 08:16:46 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdh - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 8:112: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 1
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdr - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 65:16: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 2
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: sdm - tur checker reports path is up
Jan 25 08:16:47 s1lp5 multipathd[782]: 8:192: reinstated
Jan 25 08:16:47 s1lp5 multipathd[782]: mpathb: remaining active paths: 3
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 8:112.
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 65:16.
Jan 25 08:16:47 s1lp5 kernel: device-mapper: multipath: Reinstating path 8:192.
Jan 25 08:16:47 s1lp5 systemd-udevd[648]: Worker [18540] terminated by signal 9 (KILL)
Jan 25 08:16:47 s1lp5 systemd-udevd[648]: dm-3: Worker [18540] failed
Jan 25 08:16:47 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:47 s1lp5 kernel: sd 0:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:48 s1lp5 dbus-daemon[1048]: [system] Activating via systemd: service name='org.freedesktop.PackageKit' unit='packagekit.service' requested by ':1.34' (uid=0 pid=983318 comm="/usr/bin/gdbus call --system --dest org.freedeskto" label="unconfined")
Jan 25 08:16:48 s1lp5 systemd[1]: Starting PackageKit Daemon...
Jan 25 08:16:48 s1lp5 PackageKit[983321]: daemon start
Jan 25 08:16:48 s1lp5 kernel: sd 0:0:0:1074413604: Power-on or device reset occurred
Jan 25 08:16:48 s1lp5 kernel: sd 0:0:0:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:49 s1lp5 dbus-daemon[1048]: [system] Successfully activated service 'org.freedesktop.PackageKit'
Jan 25 08:16:49 s1lp5 systemd[1]: Started PackageKit Daemon.
Jan 25 08:16:49 s1lp5 multipathd[782]: mpathb: sdc - tur checker reports path is up
Jan 25 08:16:49 s1lp5 multipathd[782]: 8:32: reinstated
Jan 25 08:16:49 s1lp5 multipathd[782]: mpathb: remaining active paths: 4
Jan 25 08:16:49 s1lp5 kernel: device-mapper: multipath: Reinstating path 8:32.
Jan 25 08:16:49 s1lp5 kernel: sd 0:0:0:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 08:16:49 s1lp5 kernel: sd 0:0:0:1074413604: alua: port group 00 state A preferred supports tolusnA

New multipath state:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  |- 0:0:1:1074413604 sdh   8:112  active ready running
  |- 1:0:1:1074413604 sdr   65:16  active ready running
  `- 1:0:0:1074413604 sdm   8:192  active ready running

So it detected that it is back and re-established things just as I would expect.
It was never fully "dead" as the paths were just fine.

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#5

# Argument why (b) might in fact a problem: "disk should be detected as new, and not mapped onto the old paths/devices"

Obviously one could just say "have a static LUN Id plan and don't map back the old LUN, but that is evasive. In a perfect world something in kernel/udev/multipath would recognize it is a new thing.

I agree that if you "map a different / new disk to the same lun" things should not totally break as you reported. I'd have expected that it would detect e.g. a UUID and only do so if these match.
Do in your case the UUIDs also stay the same when you map a new disk under the same LUN?

In my case (adding back the same disk under the same LUN) nothing changed.
For example sg_inq reports absolutely the same

$ sudo sg_inq --id /dev/sdh
VPD INQUIRY: Device Identification page
  Designation descriptor number 1, descriptor length: 20
    designator_type: NAA, code_set: Binary
    associated with the Addressed logical unit
      NAA 6, IEEE Company_id: 0x5076
      Vendor Specific Identifier: 0x306ffd6b6
      Vendor Specific Identifier Extension: 0x240a
      [0x6005076306ffd6b6000000000000240a]
  Designation descriptor number 2, descriptor length: 8
    designator_type: Relative target port, code_set: Binary
    associated with the Target port
      Relative target port: 0x130
  Designation descriptor number 3, descriptor length: 8
    designator_type: Target port group, code_set: Binary
    associated with the Target port
      Target port group: 0x0

Be careful as there are plenty of UUIDs.
The one of the FC/SCSI layer (sg_inq).

Then there could be some below on GPT and on Filesystem.
But IMHO multipath works "below" that and most likely only considers the ID of the FC/SCSI layer.
In my test my LUN was totally empty, so I had no Partition/FS UUID at all.

If your SCSI/FC UUID does not change, then I think this is mis-usage/mis-expectation. I'd be unsure what kernel/udev/multipass should do different.
But if your UUID changes, then IMHO it should work. But I wonder if "appearance of a different disk in place of a missing one" was ever considered in mutlipath. I'd ask you to start a discussion upstream [1][2] for "what to expect" and "how to handle" that case.
Please report a link to the discussion here and let us know of the outcome. No matter if it will be:
1. a different config (then everyone tracking this can benefit)
2. a patch we can fix the package(s) with
3. a lessons learned about what to expect from which component (then everyone tracking this can still benefit)

[1]: https://www.redhat.com/mailman/listinfo/dm-devel
[2]: https://github.com/opensvc/multipath-tools/issues

# Argument why (b) might in fact a problem: "disk should be detected as new, and not mapped onto the old paths/devices"

Obviously one could just say "have a static LUN Id plan and don't map back the old LUN, but that is evasive. In a perfect world something in kernel/udev/multipath would recognize it is a new thing.

I agree that if you "map a different / new disk to the same lun" things should not totally break as you reported. I'd have expected that it would detect e.g. a UUID and only do so if these match.
Do in your case the UUIDs also stay the same when you map a new disk under the same LUN?

In my case (adding back the same disk under the same LUN) nothing changed.
For example sg_inq reports absolutely the same

$ sudo sg_inq --id  /dev/sdh
VPD INQUIRY: Device Identification page
  Designation descriptor number 1, descriptor length: 20
    designator_type: NAA,  code_set: Binary
    associated with the Addressed logical unit
      NAA 6, IEEE Company_id: 0x5076
      Vendor Specific Identifier: 0x306ffd6b6
      Vendor Specific Identifier Extension: 0x240a
      [0x6005076306ffd6b6000000000000240a]
  Designation descriptor number 2, descriptor length: 8
    designator_type: Relative target port,  code_set: Binary
    associated with the Target port
      Relative target port: 0x130
  Designation descriptor number 3, descriptor length: 8
    designator_type: Target port group,  code_set: Binary
    associated with the Target port
      Target port group: 0x0

Be careful as there are plenty of UUIDs.
The one of the FC/SCSI layer (sg_inq).

Then there could be some below on GPT and on Filesystem.
But IMHO multipath works "below" that and most likely only considers the ID of the FC/SCSI layer.
In my test my LUN was totally empty, so I had no Partition/FS UUID at all.

If your SCSI/FC UUID does not change, then I think this is mis-usage/mis-expectation. I'd be unsure what kernel/udev/multipass should do different.
But if your UUID changes, then IMHO it should work. But I wonder if "appearance of a different disk in place of a missing one" was ever considered in mutlipath. I'd ask you to start a discussion upstream [1][2] for "what to expect" and "how to handle" that case.
Please report a link to the discussion here and let us know of the outcome. No matter if it will be:
1. a different config (then everyone tracking this can benefit)
2. a patch we can fix the package(s) with
3. a lessons learned about what to expect from which component (then everyone tracking this can still benefit)

[1]: https://www.redhat.com/mailman/listinfo/dm-devel
[2]: https://github.com/opensvc/multipath-tools/issues

Changed in multipath-tools (Ubuntu):
status:	Confirmed → Incomplete

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#6

Download full text (8.2 KiB)

Bonus for my theory that these timeouts are about the paths going down and not the disks.
If I unplug the adapter (= 2 paths) it goes through this:

1. it immediately detects that something is wrong. I/O might still be queued.

Jan 25 09:27:52 s1lp5 multipathd[782]: checker failed path 65:32 in map mpatha
Jan 25 09:27:52 s1lp5 multipathd[782]: mpatha: remaining active paths: 3
Jan 25 09:27:52 s1lp5 kernel: device-mapper: multipath: Failing path 65:32.
Jan 25 09:27:53 s1lp5 multipathd[782]: checker failed path 8:176 in map mpathc
Jan 25 09:27:53 s1lp5 multipathd[782]: mpathc: remaining active paths: 3
Jan 25 09:27:53 s1lp5 multipathd[782]: checker failed path 8:160 in map mpathd
Jan 25 09:27:53 s1lp5 multipathd[782]: mpathd: remaining active paths: 3
Jan 25 09:27:53 s1lp5 kernel: device-mapper: multipath: Failing path 8:176.
Jan 25 09:27:53 s1lp5 kernel: device-mapper: multipath: Failing path 8:160.
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:208 in map mpatha
Jan 25 09:27:54 s1lp5 multipathd[782]: mpatha: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 65:48 in map mpathe
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathe: remaining active paths: 3
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:240 in map mpathd
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathd: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:224 in map mpathe
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathe: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 65:0 in map mpathc
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathc: remaining active paths: 2
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:208.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 65:48.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:240.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:224.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 65:0.

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc 8:32 active ready running
  |- 0:0:1:1074413604 sdh 8:112 active ready running
  |- 1:0:1:1074413604 sdr 65:16 active i/o pending running
  `- 1:0:0:1074413604 sdm 8:192 active i/o pending running

2. after the timeouts for failing I/O (fast_io_fail_tmo) all queued is cancelled

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc 8:32 active ready running
  |- 0:0:1:1074413604 sdh 8:112 active ready running
  |- 1:0:1:1074413604 sdr 65:16 failed faulty running
  `- 1:0:0:1074413604 sdm 8:192 failed faulty running

3. finally once I hit my 60 second limit on the paths (dev_loss_tmo) they are considered dead

Jan 25 09:28:52 s1lp5 kernel: rport-1:0-0: blocked FC remote port time out: removing target and saving binding
Jan 25 09:28:52 s1lp...

Bonus for my theory that these timeouts are about the paths going down and not the disks.
If I unplug the adapter (= 2 paths) it goes through this:

1. it immediately detects that something is wrong. I/O might still be queued.

Jan 25 09:27:52 s1lp5 multipathd[782]: checker failed path 65:32 in map mpatha
Jan 25 09:27:52 s1lp5 multipathd[782]: mpatha: remaining active paths: 3
Jan 25 09:27:52 s1lp5 kernel: device-mapper: multipath: Failing path 65:32.
Jan 25 09:27:53 s1lp5 multipathd[782]: checker failed path 8:176 in map mpathc
Jan 25 09:27:53 s1lp5 multipathd[782]: mpathc: remaining active paths: 3
Jan 25 09:27:53 s1lp5 multipathd[782]: checker failed path 8:160 in map mpathd
Jan 25 09:27:53 s1lp5 multipathd[782]: mpathd: remaining active paths: 3
Jan 25 09:27:53 s1lp5 kernel: device-mapper: multipath: Failing path 8:176.
Jan 25 09:27:53 s1lp5 kernel: device-mapper: multipath: Failing path 8:160.
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:208 in map mpatha
Jan 25 09:27:54 s1lp5 multipathd[782]: mpatha: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 65:48 in map mpathe
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathe: remaining active paths: 3
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:240 in map mpathd
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathd: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 8:224 in map mpathe
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathe: remaining active paths: 2
Jan 25 09:27:54 s1lp5 multipathd[782]: checker failed path 65:0 in map mpathc
Jan 25 09:27:54 s1lp5 multipathd[782]: mpathc: remaining active paths: 2
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:208.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 65:48.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:240.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 8:224.
Jan 25 09:27:54 s1lp5 kernel: device-mapper: multipath: Failing path 65:0.

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  |- 0:0:1:1074413604 sdh   8:112  active ready running
  |- 1:0:1:1074413604 sdr   65:16  active i/o pending running
  `- 1:0:0:1074413604 sdm   8:192  active i/o pending running

2. after the timeouts for failing I/O  (fast_io_fail_tmo) all queued is cancelled

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  |- 0:0:1:1074413604 sdh   8:112  active ready running
  |- 1:0:1:1074413604 sdr   65:16  failed faulty running
  `- 1:0:0:1074413604 sdm   8:192  failed faulty running

3. finally once I hit my 60 second limit on the paths (dev_loss_tmo) they are considered dead

Jan 25 09:28:52 s1lp5 kernel:  rport-1:0-0: blocked FC remote port time out: removing target and saving binding
Jan 25 09:28:52 s1lp5 kernel:  rport-1:0-1: blocked FC remote port time out: removing target and saving binding
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:0:1073889316: [sdk] Synchronizing SCSI cache
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:0:1073889316: [sdk] Synchronize Cache(10) failed: Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:0:1073954852: [sdl] Synchronizing SCSI cache
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:0:1073954852: [sdl] Synchronize Cache(10) failed: Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Jan 25 09:28:52 s1lp5 multipathd[782]: mpathd: load table [0 20971520 multipath 1 queue_if_no_path 1 alua 1 1 service-time 0 3 1 8:0 1 8:80 1 8:240 1]
Jan 25 09:28:52 s1lp5 multipathd[782]: sync_map_state: failing sdp state 2 dmstate 2
Jan 25 09:28:52 s1lp5 multipathd[782]: sdk [8:160]: path removed from map mpathd
Jan 25 09:28:52 s1lp5 kernel: device-mapper: multipath: Failing path 8:240.
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:1:1073889316: [sdp] Synchronizing SCSI cache
Jan 25 09:28:52 s1lp5 kernel: sd 1:0:1:1073889316: [sdp] Synchronize Cache(10) failed: Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Jan 25 09:28:52 s1lp5 kernel: scsi 1:0:0:1073889316: alua: Detached
Jan 25 09:28:52 s1lp5 multipath[994777]: 8:240: cannot find block device
Jan 25 09:28:52 s1lp5 multipath[994777]: 8:240: Empty device name
Jan 25 09:28:52 s1lp5 multipath[994777]: 8:240: Empty device name
...

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  `- 0:0:1:1074413604 sdh   8:112  active ready running

^^ nothing will be enqueued or checked anymore the paths are gone.
IIUC this is what dev_loss_tmo is for.

4. If I later bring back the paths they are newly detected and re-added

Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076052004: Direct-Access     IBM      2107900          2700 PQ: 0 ANSI: 5
Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076052004: alua: supports implicit TPGS
Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076052004: alua: device naa.6005076306ffd6b60000000000002423 port group 0 rel port 330
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:0:1076052004: Attached scsi generic sg18 type 0
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:0:1076052004: Power-on or device reset occurred
Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076117540: Direct-Access     IBM      2107900          2700 PQ: 0 ANSI: 5
Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076117540: alua: supports implicit TPGS
Jan 25 09:30:24 s1lp5 kernel: scsi 1:0:0:1076117540: alua: device naa.6005076306ffd6b60000000000002424 port group 0 rel port 330
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:0:1076117540: Attached scsi generic sg19 type 0
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:0:1076117540: Power-on or device reset occurred
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1074413604: alua: port group 00 state A preferred supports tolusnA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076117540: alua: port group 00 state A preferred supports tolusnA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1073954852: alua: port group 00 state A preferred supports tolusnA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1074413604: [sdm] 20971520 512-byte logical blocks: (10.7 GB/10.0 GiB)
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1073954852: [sdl] 20971520 512-byte logical blocks: (10.7 GB/10.0 GiB)
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076117540: [sdo] 134217728 512-byte logical blocks: (68.7 GB/64.0 GiB)
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1074413604: [sdm] Write Protect is off
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1074413604: [sdm] Mode Sense: ed 00 00 08
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1073954852: [sdl] Write Protect is off
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076117540: [sdo] Write Protect is off
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1073954852: [sdl] Mode Sense: ed 00 00 08
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076117540: [sdo] Mode Sense: ed 00 00 08
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1074413604: [sdm] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1073954852: [sdl] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076117540: [sdo] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076052004: alua: port group 00 state A preferred supports tolusnA
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076052004: [sdn] 134217728 512-byte logical blocks: (68.7 GB/64.0 GiB)
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076052004: [sdn] Write Protect is off
Jan 25 09:30:24 s1lp5 kernel: sd 1:0:1:1076052004: [sdn] Mode Sense: ed 00 00 08

state now:
mpathb (36005076306ffd6b6000000000000240a) dm-3 IBM,2107900
size=10G features='1 queue_if_no_path' hwhandler='1 alua' wp=rw
`-+- policy='service-time 0' prio=50 status=active
  |- 0:0:0:1074413604 sdc   8:32   active ready running
  |- 0:0:1:1074413604 sdh   8:112  active ready running
  |- 1:0:1:1074413604 sdm   8:192  active ready running
  `- 1:0:0:1074413604 sdr   65:16  active ready running

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#7

Download full text (5.1 KiB)

I have also checked if this was any different in the past and checked Bionic (thanks Frank for the system). It behaved the same way (i.e. no regression).

Unmapping device:

Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 Sense Key : Aborted Command [current]
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdc, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:32.
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 Sense Key : Aborted Command [current]
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sda, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:0.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 Sense Key : Aborted Command [current]
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdd, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:48.
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 Sense Key : Aborted Command [current]
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdb, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:16.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: alua: port group 00 state A preferred supports tolusnA
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Reinstating path 8:32.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 multipathd[591]: s...

I have also checked if this was any different in the past and checked Bionic (thanks Frank for the system). It behaved the same way (i.e. no regression).

Unmapping device:

Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 Sense Key : Aborted Command [current] 
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: [sdc] tag#1877 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdc, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:32.
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 Sense Key : Aborted Command [current] 
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: [sda] tag#2231 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sda, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:0.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 Sense Key : Aborted Command [current] 
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: [sdd] tag#1877 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdd, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:48.
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 Sense Key : Aborted Command [current] 
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 Add. Sense: Logical unit not supported
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: [sdb] tag#2231 CDB: Write(10) 2a 00 01 04 88 00 00 00 08 00
Jan 25 05:07:15 hwe0006 kernel: print_req_error: I/O error, dev sdb, sector 17074176
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Failing path 8:16.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:0:1074151462: alua: port group 00 state A preferred supports tolusnA
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:0:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: sd 0:0:1:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 kernel: device-mapper: multipath: Reinstating path 8:32.
Jan 25 05:07:15 hwe0006 kernel: sd 1:0:1:1074151462: Power-on or device reset occurred
Jan 25 05:07:15 hwe0006 multipathd[591]: sdc: mark as failed
Jan 25 05:07:15 hwe0006 multipathd[591]: mpatha: remaining active paths: 3
Jan 25 05:07:15 hwe0006 multipathd[591]: sda: mark as failed
Jan 25 05:07:15 hwe0006 multipathd[591]: mpatha: remaining active paths: 2
Jan 25 05:07:15 hwe0006 multipathd[591]: sdb: mark as failed
Jan 25 05:07:15 hwe0006 multipathd[591]: mpatha: remaining active paths: 1
Jan 25 05:07:15 hwe0006 multipathd[591]: sdd: mark as failed
Jan 25 05:07:15 hwe0006 multipathd[591]: mpatha: remaining active paths: 0
Jan 25 05:07:15 hwe0006 multipathd[591]: 8:32: reinstated
Jan 25 05:07:15 hwe0006 multipathd[591]: mpatha: remaining active paths: 1

mpatha (36005076306ffd6b60000000000002606) dm-0 IBM,2107900
size=64G features='3 queue_if_no_path queue_mode mq' hwhandler='0' wp=rw
`-+- policy='service-time 0' prio=0 status=active
  |- 0:0:0:1074151462 sda 8:0  active faulty running
  |- 0:0:1:1074151462 sdb 8:16 active faulty running
  |- 1:0:0:1074151462 sdc 8:32 active faulty running
  `- 1:0:1:1074151462 sdd 8:48 active faulty running

Mapping device back:

Jan 25 05:07:16 hwe0006 multipathd[591]: 8:0: reinstated
Jan 25 05:07:16 hwe0006 multipathd[591]: mpatha: remaining active paths: 2
Jan 25 05:07:16 hwe0006 kernel: device-mapper: multipath: Reinstating path 8:0.
Jan 25 05:07:16 hwe0006 multipathd[591]: 8:16: reinstated
Jan 25 05:07:16 hwe0006 multipathd[591]: mpatha: remaining active paths: 3
Jan 25 05:07:16 hwe0006 kernel: device-mapper: multipath: Reinstating path 8:16.
Jan 25 05:07:16 hwe0006 multipathd[591]: 8:48: reinstated
Jan 25 05:07:16 hwe0006 multipathd[591]: mpatha: remaining active paths: 4
Jan 25 05:07:16 hwe0006 kernel: device-mapper: multipath: Reinstating path 8:48.
Jan 25 05:07:16 hwe0006 multipath[2853]: dm-0: usable paths found
Jan 25 05:07:16 hwe0006 multipath[2860]: dm-0: usable paths found
Jan 25 05:07:16 hwe0006 multipath[2867]: dm-0: usable paths found

So this isn't a behavioral regression either.
Waiting for your feedback on your use case and expectations.

Revision history for this message

Deyan Stanev (dstanev) wrote on 2021-01-25:

#8

Download full text (3.3 KiB)

Yes, it was the same on 18.04 also.
The problem is that I don't reach "3. finally once I hit my 60 second limit on the paths (dev_loss_tmo) they are considered dead" . The paths are never removed and stay "running" forever. Maybe it is related to the fact that there is only one HBA active.

Do in your case the UUIDs also stay the same when you map a new disk under the same LUN?
The udev is caching the old disk info, so all the ids should be the same. When you run sg_inq, it is checking the real values and returns the real ID. Multipath however is still using the cached wrong ids from udev.

Also when the both paths are failing and the map is flushed(as we have only one FC HBA we have only 2 paths to both controllers of the storage), it is expected the devices underneath to be removed as well. They are not removed and the map comes back on multipath reload maps.

My expectation is the paths to be removed after dev_loss_tmo. However if it is acting on the whole rport - we still have healthy paths on the both rports, so it might be actually working as expected.

Also the udev is to reluctant to rescan the readded devices because they are never removed. When a path is reinstated it should be rescanned to validate that it is in fact the same disk.
Please tell me if you need more information.

here is some logs of a failing map:
Jan 25 13:30:36 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:36 joker multipathd[2127]: checker failed path 8:112 in map 360050763808081638000000000000054
Jan 25 13:30:36 joker multipathd[2127]: 360050763808081638000000000000054: remaining active paths: 1
Jan 25 13:30:37 joker multipathd[2127]: sdr: mark as failed
Jan 25 13:30:37 joker multipathd[2127]: 360050763808081638000000000000054: remaining active paths: 0
Jan 25 13:30:39 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:41 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:44 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:46 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:49 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:51 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:54 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:56 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:59 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:31:01 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:31:05 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:31:07 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is do...

Yes, it was the same on 18.04 also. 
The problem is that I don't reach "3. finally once I hit my 60 second limit on the paths (dev_loss_tmo) they are considered dead" . The paths are never removed and stay "running" forever. Maybe it is related to the fact that there is only one HBA active.

Do in your case the UUIDs also stay the same when you map a new disk under the same LUN?
The udev is caching the old disk info, so all the ids should be the same. When you run sg_inq, it is checking the real values and returns the real ID. Multipath however is still using the cached wrong ids from udev.

Also when the both paths are failing and the map is flushed(as we have only one FC HBA we have only 2 paths to both controllers of the storage), it is expected the devices underneath to be removed as well. They are not removed and the map comes back on multipath reload maps.

My expectation is the paths to be removed after dev_loss_tmo. However if it is acting on the whole rport - we still have healthy paths on the both rports, so it might be actually working as expected.

Also the udev is to reluctant to rescan the readded devices because they are never removed. When a path is reinstated it should be rescanned to validate that it is in fact the same disk.
Please tell me if you need more information.

here is some logs of a failing map:
Jan 25 13:30:36 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:36 joker multipathd[2127]: checker failed path 8:112 in map 360050763808081638000000000000054
Jan 25 13:30:36 joker multipathd[2127]: 360050763808081638000000000000054: remaining active paths: 1
Jan 25 13:30:37 joker multipathd[2127]: sdr: mark as failed
Jan 25 13:30:37 joker multipathd[2127]: 360050763808081638000000000000054: remaining active paths: 0
Jan 25 13:30:39 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:41 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:44 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:46 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:49 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:51 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:54 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:30:56 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:30:59 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:31:01 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
Jan 25 13:31:05 joker multipathd[2127]: 360050763808081638000000000000054: sdr - tur checker reports path is down
Jan 25 13:31:07 joker multipathd[2127]: 360050763808081638000000000000054: sdh - tur checker reports path is down
.... 
and it continues like that forever. 
I am running a cronjob to remove faulty paths as a workaround so we can use the system

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2021-01-25:

#9

Hi,
I reordered this slightly to split into topics properly.

> The problem is that I don't reach "3. finally once I hit my 60 second limit on the paths
> (dev_loss_tmo) they are considered dead" . The paths are never removed and stay "running"
> forever. Maybe it is related to the fact that there is only one HBA active.

> My expectation is the paths to be removed after dev_loss_tmo. However if it is acting on the
> whole rport - we still have healthy paths on the both rports, so it might be actually working
> as expected.

^^ yes I think this part indeed work as expected, but I'm open to be convinced otherwise

> Also when the both paths are failing and the map is flushed(as we have only one FC HBA we have
> only 2 paths to both controllers of the storage), it is expected the devices underneath to be
> removed as well. They are not removed and the map comes back on multipath reload maps.

You saw my example on "failing paths" above which indeed seemed to remove the devices for me.
In Journal/dmesg I had:
"rport-1:0-0: blocked FC remote port time out: removing target and saving binding"
If the port/path is down (not the LUN) then I'd expect the kernel to trigger that after dev_loss_tmo.

> udev is caching the old disk info, so all the ids should be the same. When you run sg_inq, it
> is checking the real values and returns the real ID. Multipath however is still using the
> cached wrong ids from udev.

> Also the udev is to reluctant to rescan the readded devices because they are never removed.
> When a path is reinstated it should be rescanned to validate that it is in fact the same disk.

^^ I agree to this, if indeed the UUID changed but is cached/not-rescanned that seems like an issue to me.

> Please tell me if you need more information.

I don't think "I/we" need any more here - it seems to be something that the multipath-tools don't do yet (or it could, but we both fail to see the right mix of config options to do so).

The next step to me seems to be to engage with upstream which is usually done best by the affected person. As mentioned before that would be at [1][2].
A link back to the discussion/issue would be awesome so that we can track the outcome and integrate it into Ubuntu.

[1]: https://www.redhat.com/mailman/listinfo/dm-devel
[2]: https://github.com/opensvc/multipath-tools/issues

Hi,
I reordered this slightly to split into topics properly.

> The problem is that I don't reach "3. finally once I hit my 60 second limit on the paths 
> (dev_loss_tmo) they are considered dead" . The paths are never removed and stay "running" 
> forever. Maybe it is related to the fact that there is only one HBA active.

> My expectation is the paths to be removed after dev_loss_tmo. However if it is acting on the 
> whole rport - we still have healthy paths on the both rports, so it might be actually working 
> as expected.

^^ yes I think this part indeed work as expected, but I'm open to be convinced otherwise

> Also when the both paths are failing and the map is flushed(as we have only one FC HBA we have 
> only 2 paths to both controllers of the storage), it is expected the devices underneath to be
> removed as well. They are not removed and the map comes back on multipath reload maps.

You saw my example on "failing paths" above which indeed seemed to remove the devices for me.
In Journal/dmesg I had:
  "rport-1:0-0: blocked FC remote port time out: removing target and saving binding"
If the port/path is down (not the LUN) then I'd expect the kernel to trigger that after dev_loss_tmo.

> udev is caching the old disk info, so all the ids should be the same. When you run sg_inq, it 
> is checking the real values and returns the real ID. Multipath however is still using the 
> cached wrong ids from udev.

> Also the udev is to reluctant to rescan the readded devices because they are never removed. 
> When a path is reinstated it should be rescanned to validate that it is in fact the same disk.

^^ I agree to this, if indeed the UUID changed but is cached/not-rescanned that seems like an issue to me.

> Please tell me if you need more information.

I don't think "I/we" need any more here - it seems to be something that the multipath-tools don't do yet (or it could, but we both fail to see the right mix of config options to do so).

The next step to me seems to be to engage with upstream which is usually done best by the affected person. As mentioned before that would be at [1][2].
A link back to the discussion/issue would be awesome so that we can track the outcome and integrate it into Ubuntu.

[1]: https://www.redhat.com/mailman/listinfo/dm-devel
[2]: https://github.com/opensvc/multipath-tools/issues

Revision history for this message

Launchpad Janitor (janitor) wrote on 2021-03-27:

#10

[Expired for multipath-tools (Ubuntu) because there has been no activity for 60 days.]

Changed in multipath-tools (Ubuntu):
status:	Incomplete → Expired

Ubuntu
multipath-tools package

faulty paths are not removed

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntumultipath-tools package

faulty paths are not removed

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
multipath-tools package