selftests: simult_flows.sh: unbalanced bwidth tests are unstable #137

matttbe · 2021-01-15T14:05:17Z

I noticed that after having applied 13a9499 ("mptcp: fix locking in mptcp_disconnect()") my CI reports instabilities with simult_flows.sh selftests with unbalanced bwidth tests, e.g.:

unbalanced bwidth, unbalanced delay
unbalanced bwidth with opposed, unbalanced delay
unbalanced bwidth with opposed, unbalanced delay - reverse direction

Mainly with the last one with both debug and non debug kernel configs.

# unbalanced bwidth with opposed, unbalanced delay 3408 max 3245 [ fail ]
# client exit code 0, server 0
#
# netns ns3-0-VxsqRF socket stat for 10009:
# State Recv-Q Send-Q Local Address:Port Peer Address:Port Process
# TIME-WAIT 0 0 10.0.3.3:10009 10.0.1.1:55602 timer:(timewait,59sec,0)
#
# TIME-WAIT 0 0 10.0.3.3:10009 10.0.2.1:37425 timer:(timewait,59sec,0)
#
#
# netns ns1-0-VxsqRF socket stat for 10009:
# State Recv-Q Send-Q Local Address:Port Peer Address:PortProcess
# -rw------- 1 root root 8388608 Jan 14 17:57 /tmp/tmp.VLIfxBoNRi
# -rw------- 1 root root 8388608 Jan 14 17:58 /tmp/tmp.bqj6pDxgap
# -rw------- 1 root root 81920 Jan 14 17:58 /tmp/tmp.pXbmYOeFBb
# -rw------- 1 root root 81920 Jan 14 17:57 /tmp/tmp.rJduZf5gt0
# unbalanced bwidth with opposed, unbalanced delay - reverse direction 3271 max 3245 [ fail ]
# client exit code 0, server 0
#
# netns ns3-0-VxsqRF socket stat for 10010:
# State Recv-Q Send-Q Local Address:Port Peer Address:PortProcess
#
# netns ns1-0-VxsqRF socket stat for 10010:
# State Recv-Q Send-Q Local Address:Port Peer Address:Port Process
# TIME-WAIT 0 0 10.0.2.1%ns1eth2:59907 10.0.3.3:10010 timer:(timewait,59sec,0)
#
# TIME-WAIT 0 0 10.0.1.1:55726 10.0.3.3:10010 timer:(timewait,59sec,0)
#
# -rw------- 1 root root 81920 Jan 14 17:58 /tmp/tmp.bqj6pDxgap
# -rw------- 1 root root 81920 Jan 14 17:57 /tmp/tmp.rJduZf5gt0
# -rw------- 1 root root 8388608 Jan 14 17:57 /tmp/tmp.VLIfxBoNRi
# -rw------- 1 root root 8388608 Jan 14 17:58 /tmp/tmp.pXbmYOeFBb

When looking at results from my CI, since yesterday -- after having applied the commit 13a9499: https://github.com/multipath-tcp/mptcp_net-next/compare/export/20210114T060000..export/20210114T172607 -- I have it all the time.

Before, I only got it once in total and it was on the 15th of December.

So it looks like a regression introduced by 13a9499.

The text was updated successfully, but these errors were encountered:

matttbe · 2021-01-25T09:55:59Z

FYI: this week-end, there were two builds which were unstable because simult_flows.sh reported these errors:

unbalanced bwidth with opposed, unbalanced delay - reverse direction 3316 max 3245 [ fail ]
unbalanced bwidth with unbalanced delay - reverse direction  3294 max 3245  [ fail ]

Both with a debug kernel. Without the debug kernel, everything was OK, e.g.

  | # balanced bwidth                                     4576 max 5005 [ OK ]
  | # balanced bwidth - reverse direction                 4584 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay               4576 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay - reverse direction  4576 max 5005 [ OK ]
  | # unbalanced bwidth                                   2961 max 3245 [ OK ]
  | # unbalanced bwidth - reverse direction               2934 max 3245 [ OK ]
  | # unbalanced bwidth with unbalanced delay             2944 max 3245 [ OK ]
  | # unbalanced bwidth with unbalanced delay - reverse direction  2928 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay    2938 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay - reverse direction  2998 max 3245 [ OK ]

  | # balanced bwidth                                     4589 max 5005 [ OK ]
  | # balanced bwidth - reverse direction                 4603 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay               4589 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay - reverse direction  4606 max 5005 [ OK ]
  | # unbalanced bwidth                                   2985 max 3245 [ OK ]
  | # unbalanced bwidth - reverse direction               2948 max 3245 [ OK ]
  | # unbalanced bwidth with unbalanced delay             2953 max 3245 [ OK ]
  | # unbalanced bwidth with unbalanced delay - reverse direction  2931 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay    2991 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay - reverse direction  2931 max 3245 [ OK ]

It happened "Kernel panic - not syncing: hung_task: blocked tasks" when test simulate crash and ifconfig down/rmmod meanwhile. Test steps: 1.Test commands, either can reproduce the hang for PCIe, SDIO and SNOC. echo soft > /sys/kernel/debug/ieee80211/phy0/ath10k/simulate_fw_crash;sleep 0.05;ifconfig wlan0 down echo soft > /sys/kernel/debug/ieee80211/phy0/ath10k/simulate_fw_crash;rmmod ath10k_sdio echo hw-restart > /sys/kernel/debug/ieee80211/phy0/ath10k/simulate_fw_crash;rmmod ath10k_pci 2. dmesg: [ 5622.548630] ath10k_sdio mmc1:0001:1: simulating soft firmware crash [ 5622.655995] ieee80211 phy0: Hardware restart was requested [ 5776.355164] INFO: task shill:1572 blocked for more than 122 seconds. [ 5776.355687] INFO: task kworker/1:2:24437 blocked for more than 122 seconds. [ 5776.359812] Kernel panic - not syncing: hung_task: blocked tasks [ 5776.359836] CPU: 1 PID: 55 Comm: khungtaskd Tainted: G W 4.19.86 #137 [ 5776.359846] Hardware name: MediaTek krane sku176 board (DT) [ 5776.359855] Call trace: [ 5776.359868] dump_backtrace+0x0/0x170 [ 5776.359881] show_stack+0x20/0x2c [ 5776.359896] dump_stack+0xd4/0x10c [ 5776.359916] panic+0x12c/0x29c [ 5776.359937] hung_task_panic+0x0/0x50 [ 5776.359953] kthread+0x120/0x130 [ 5776.359965] ret_from_fork+0x10/0x18 [ 5776.359986] SMP: stopping secondary CPUs [ 5776.360012] Kernel Offset: 0x141ea00000 from 0xffffff8008000000 [ 5776.360026] CPU features: 0x0,2188200c [ 5776.360035] Memory Limit: none command "ifconfig wlan0 down" or "rmmod ath10k_sdio" will be blocked callstack of ifconfig: [<0>] __switch_to+0x120/0x13c [<0>] msleep+0x28/0x38 [<0>] ath10k_sdio_hif_stop+0x24c/0x294 [ath10k_sdio] [<0>] ath10k_core_stop+0x50/0x78 [ath10k_core] [<0>] ath10k_halt+0x120/0x178 [ath10k_core] [<0>] ath10k_stop+0x4c/0x8c [ath10k_core] [<0>] drv_stop+0xe0/0x1e4 [mac80211] [<0>] ieee80211_stop_device+0x48/0x54 [mac80211] [<0>] ieee80211_do_stop+0x678/0x6f8 [mac80211] [<0>] ieee80211_stop+0x20/0x30 [mac80211] [<0>] __dev_close_many+0xb8/0x11c [<0>] __dev_change_flags+0xe0/0x1d0 [<0>] dev_change_flags+0x30/0x6c [<0>] devinet_ioctl+0x370/0x564 [<0>] inet_ioctl+0xdc/0x304 [<0>] sock_do_ioctl+0x50/0x288 [<0>] compat_sock_ioctl+0x1b4/0x1aac [<0>] __se_compat_sys_ioctl+0x100/0x26fc [<0>] __arm64_compat_sys_ioctl+0x20/0x2c [<0>] el0_svc_common+0xa4/0x154 [<0>] el0_svc_compat_handler+0x2c/0x38 [<0>] el0_svc_compat+0x8/0x18 [<0>] 0xffffffffffffffff callstack of rmmod: [<0>] __switch_to+0x120/0x13c [<0>] msleep+0x28/0x38 [<0>] ath10k_sdio_hif_stop+0x294/0x31c [ath10k_sdio] [<0>] ath10k_core_stop+0x50/0x78 [ath10k_core] [<0>] ath10k_halt+0x120/0x178 [ath10k_core] [<0>] ath10k_stop+0x4c/0x8c [ath10k_core] [<0>] drv_stop+0xe0/0x1e4 [mac80211] [<0>] ieee80211_stop_device+0x48/0x54 [mac80211] [<0>] ieee80211_do_stop+0x678/0x6f8 [mac80211] [<0>] ieee80211_stop+0x20/0x30 [mac80211] [<0>] __dev_close_many+0xb8/0x11c [<0>] dev_close_many+0x70/0x100 [<0>] dev_close+0x4c/0x80 [<0>] cfg80211_shutdown_all_interfaces+0x50/0xcc [cfg80211] [<0>] ieee80211_remove_interfaces+0x58/0x1a0 [mac80211] [<0>] ieee80211_unregister_hw+0x40/0x100 [mac80211] [<0>] ath10k_mac_unregister+0x1c/0x44 [ath10k_core] [<0>] ath10k_core_unregister+0x38/0x7c [ath10k_core] [<0>] ath10k_sdio_remove+0x8c/0xd0 [ath10k_sdio] [<0>] sdio_bus_remove+0x48/0x108 [<0>] device_release_driver_internal+0x138/0x1ec [<0>] driver_detach+0x6c/0xa8 [<0>] bus_remove_driver+0x78/0xa8 [<0>] driver_unregister+0x30/0x50 [<0>] sdio_unregister_driver+0x28/0x34 [<0>] cleanup_module+0x14/0x6bc [ath10k_sdio] [<0>] __arm64_sys_delete_module+0x1e0/0x22c [<0>] el0_svc_common+0xa4/0x154 [<0>] el0_svc_compat_handler+0x2c/0x38 [<0>] el0_svc_compat+0x8/0x18 [<0>] 0xffffffffffffffff SNOC: [ 647.156863] Call trace: [ 647.162166] [<ffffff80080855a4>] __switch_to+0x120/0x13c [ 647.164512] [<ffffff800899d8b8>] __schedule+0x5ec/0x798 [ 647.170062] [<ffffff800899dad8>] schedule+0x74/0x94 [ 647.175050] [<ffffff80089a0848>] schedule_timeout+0x314/0x42c [ 647.179874] [<ffffff80089a0a14>] schedule_timeout_uninterruptible+0x34/0x40 [ 647.185780] [<ffffff80082a494>] msleep+0x28/0x38 [ 647.192546] [<ffffff800117ec4c>] ath10k_snoc_hif_stop+0x4c/0x1e0 [ath10k_snoc] [ 647.197439] [<ffffff80010dfbd8>] ath10k_core_stop+0x50/0x7c [ath10k_core] [ 647.204652] [<ffffff80010c8f48>] ath10k_halt+0x114/0x16c [ath10k_core] [ 647.211420] [<ffffff80010cad68>] ath10k_stop+0x4c/0x88 [ath10k_core] [ 647.217865] [<ffffff8000fdbf54>] drv_stop+0x110/0x244 [mac80211] [ 647.224367] [<ffffff80010147ac>] ieee80211_stop_device+0x48/0x54 [mac80211] [ 647.230359] [<ffffff8000ff3eec>] ieee80211_do_stop+0x6a4/0x73c [mac80211] [ 647.237033] [<ffffff8000ff4500>] ieee80211_stop+0x20/0x30 [mac80211] [ 647.243942] [<ffffff80087e39b8>] __dev_close_many+0xa0/0xfc [ 647.250435] [<ffffff80087e3888>] dev_close_many+0x70/0x100 [ 647.255651] [<ffffff80087e3a60>] dev_close+0x4c/0x80 [ 647.261244] [<ffffff8000f1ba54>] cfg80211_shutdown_all_interfaces+0x44/0xcc [cfg80211] [ 647.266383] [<ffffff8000ff3fdc>] ieee80211_remove_interfaces+0x58/0x1b4 [mac80211] [ 647.274128] [<ffffff8000fda540>] ieee80211_unregister_hw+0x50/0x120 [mac80211] [ 647.281659] [<ffffff80010ca314>] ath10k_mac_unregister+0x1c/0x44 [ath10k_core] [ 647.288839] [<ffffff80010dfc94>] ath10k_core_unregister+0x48/0x90 [ath10k_core] [ 647.296027] [<ffffff800117e598>] ath10k_snoc_remove+0x5c/0x150 [ath10k_snoc] [ 647.303229] [<ffffff80085625fc>] platform_drv_remove+0x28/0x50 [ 647.310517] [<ffffff80085601a4>] device_release_driver_internal+0x114/0x1b8 [ 647.316257] [<ffffff80085602e4>] driver_detach+0x6c/0xa8 [ 647.323021] [<ffffff800855e5b8>] bus_remove_driver+0x78/0xa8 [ 647.328571] [<ffffff800856107c>] driver_unregister+0x30/0x50 [ 647.334213] [<ffffff8008562674>] platform_driver_unregister+0x1c/0x28 [ 647.339876] [<ffffff800117fefc>] cleanup_module+0x1c/0x120 [ath10k_snoc] [ 647.346196] [<ffffff8008143ab8>] SyS_delete_module+0x1dc/0x22c PCIe: [ 615.392770] rmmod D 0 3523 3458 0x00000080 [ 615.392777] Call Trace: [ 615.392784] __schedule+0x617/0x7d3 [ 615.392791] ? __mod_timer+0x263/0x35c [ 615.392797] schedule+0x62/0x72 [ 615.392803] schedule_timeout+0x8d/0xf3 [ 615.392809] ? run_local_timers+0x6b/0x6b [ 615.392814] msleep+0x1b/0x22 [ 615.392824] ath10k_pci_hif_stop+0x68/0xd6 [ath10k_pci] [ 615.392844] ath10k_core_stop+0x44/0x67 [ath10k_core] [ 615.392859] ath10k_halt+0x102/0x153 [ath10k_core] [ 615.392873] ath10k_stop+0x38/0x75 [ath10k_core] [ 615.392893] drv_stop+0x9a/0x13c [mac80211] [ 615.392915] ieee80211_do_stop+0x772/0x7cd [mac80211] [ 615.392937] ieee80211_stop+0x1a/0x1e [mac80211] [ 615.392945] __dev_close_many+0x9e/0xf0 [ 615.392952] dev_close_many+0x62/0xe8 [ 615.392958] dev_close+0x54/0x7d [ 615.392975] cfg80211_shutdown_all_interfaces+0x6e/0xa5 [cfg80211] [ 615.393021] ieee80211_remove_interfaces+0x52/0x1aa [mac80211] [ 615.393049] ieee80211_unregister_hw+0x54/0x136 [mac80211] [ 615.393068] ath10k_mac_unregister+0x19/0x4a [ath10k_core] [ 615.393091] ath10k_core_unregister+0x39/0x7e [ath10k_core] [ 615.393104] ath10k_pci_remove+0x3d/0x7f [ath10k_pci] [ 615.393117] pci_device_remove+0x41/0xa6 [ 615.393129] device_release_driver_internal+0x123/0x1ec [ 615.393140] driver_detach+0x60/0x90 [ 615.393152] bus_remove_driver+0x72/0x9f [ 615.393164] pci_unregister_driver+0x1e/0x87 [ 615.393177] SyS_delete_module+0x1d7/0x277 [ 615.393188] do_syscall_64+0x6b/0xf7 [ 615.393199] entry_SYSCALL_64_after_hwframe+0x41/0xa6 The test command run simulate_fw_crash firstly and it call into ath10k_sdio_hif_stop from ath10k_core_restart, then napi_disable is called and bit NAPI_STATE_SCHED is set. After that, function ath10k_sdio_hif_stop is called again from ath10k_stop by command "ifconfig wlan0 down" or "rmmod ath10k_sdio", then command blocked. It is blocked by napi_synchronize, napi_disable will set bit with NAPI_STATE_SCHED, and then napi_synchronize will enter dead loop becuase bit NAPI_STATE_SCHED is set by napi_disable. function of napi_synchronize static inline void napi_synchronize(const struct napi_struct *n) { if (IS_ENABLED(CONFIG_SMP)) while (test_bit(NAPI_STATE_SCHED, &n->state)) msleep(1); else barrier(); } function of napi_disable void napi_disable(struct napi_struct *n) { might_sleep(); set_bit(NAPI_STATE_DISABLE, &n->state); while (test_and_set_bit(NAPI_STATE_SCHED, &n->state)) msleep(1); while (test_and_set_bit(NAPI_STATE_NPSVC, &n->state)) msleep(1); hrtimer_cancel(&n->timer); clear_bit(NAPI_STATE_DISABLE, &n->state); } Add flag for it avoid the hang and crash. Tested-on: QCA6174 hw3.2 SDIO WLAN.RMH.4.4.1-00049 Tested-on: QCA6174 hw3.2 PCI WLAN.RM.4.4.1-00110-QCARMSWP-1 Tested-on: WCN3990 hw1.0 SNOC hw1.0 WLAN.HL.3.1-01307.1-QCAHLSWMTPL-2 Signed-off-by: Wen Gong <[email protected]> Signed-off-by: Kalle Valo <[email protected]> Link: https://lore.kernel.org/r/[email protected]

matttbe · 2021-03-20T09:35:39Z

Just to track the frequency, my CI reported an issue related to this one (I got a few I didn't report)


# selftests: net/mptcp: simult_flows.sh
--
  | # balanced bwidth                                     4710 max 5005 [ OK ]
  | # balanced bwidth - reverse direction                 4740 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay               4718 max 5005 [ OK ]
  | # balanced bwidth with unbalanced delay - reverse direction  4722 max 5005 [ OK ]
  | # unbalanced bwidth                                   3057 max 3245 [ OK ]
  | # unbalanced bwidth - reverse direction               3317 max 3245  [ fail ]
  | # client exit code 0, server 0
  | #
  | # netns ns3-0-M9w5b8 socket stat for 10006:
  | # State Recv-Q Send-Q Local Address:Port Peer Address:PortProcess
  | #
  | # netns ns1-0-M9w5b8 socket stat for 10006:
  | # State     Recv-Q Send-Q    Local Address:Port  Peer Address:Port Process
  | # TIME-WAIT 0      0      10.0.2.1%ns1eth2:49039     10.0.3.3:10006 timer:(timewait,59sec,0)
  | #
  | # TIME-WAIT 0      0              10.0.1.1:58034     10.0.3.3:10006 timer:(timewait,59sec,0)
  | #
  | # -rw------- 1 root root 81920 Mar 20 07:25 /tmp/tmp.9ZcRsEw0wa
  | # -rw------- 1 root root 81920 Mar 20 07:25 /tmp/tmp.g0U6WlP4cK
  | # -rw------- 1 root root 8388608 Mar 20 07:25 /tmp/tmp.Ro5hEpAAi7
  | # -rw------- 1 root root 8388608 Mar 20 07:25 /tmp/tmp.hq2PtYgzIT
  | # unbalanced bwidth with unbalanced delay             3138 max 3245 [ OK ]
  | # unbalanced bwidth with unbalanced delay - reverse direction  3144 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay    3053 max 3245 [ OK ]
  | # unbalanced bwidth with opposed, unbalanced delay - reverse direction  3094 max 3245 [ OK ]

Should we be more tolerant with this one?

matttbe · 2021-08-31T19:41:46Z

@pabeni Here are some captures captures.zip

For this output (I stripped tcpdump's output, no packets have been dropped by kernel):

10001: balanced bwidth                                     4845 max 5005 [ OK ]
10002: balanced bwidth - reverse direction                 4841 max 5005 [ OK ]
10003: balanced bwidth with unbalanced delay               4823 max 5005 [ OK ]
10004: balanced bwidth with unbalanced delay - reverse direction  4866 max 5005 [ OK ]
10005: unbalanced bwidth                                   3100 max 3245 [ OK ]
10006: unbalanced bwidth - reverse direction               3412 max 3245  [ fail ]
client exit code 0, server 0
netns ns3-0-YUItiE socket stat for 10006:
State       Recv-Q       Send-Q             Local Address:Port             Peer Address:Port       Process
netns ns1-0-YUItiE socket stat for 10006:
State       Recv-Q   Send-Q         Local Address:Port        Peer Address:Port    Process
TIME-WAIT   0        0           10.0.2.1%ns1eth2:52117           10.0.3.3:10006    timer:(timewait,57sec,0)
TIME-WAIT   0        0                   10.0.1.1:39534           10.0.3.3:10006    timer:(timewait,57sec,0)
-rw------- 1 root root 81920 Aug 31 19:22 /tmp/tmp.fDGja90ohq
-rw------- 1 root root 81920 Aug 31 19:23 /tmp/tmp.moeid9Udot
-rw------- 1 root root 8388608 Aug 31 19:23 /tmp/tmp.bv2vSaWBdK
-rw------- 1 root root 8388608 Aug 31 19:22 /tmp/tmp.zOWCj1JnVT

10007: unbalanced bwidth with unbalanced delay             3332 max 3245  [ fail ]
client exit code 0, server 0
netns ns3-0-YUItiE socket stat for 10007:
State        Recv-Q    Send-Q       Local Address:Port        Peer Address:Port    Process
TIME-WAIT    0         0                 10.0.3.3:10007           10.0.2.1:48961    timer:(timewait,58sec,0)
TIME-WAIT    0         0                 10.0.3.3:10007           10.0.1.1:48018    timer:(timewait,58sec,0)
netns ns1-0-YUItiE socket stat for 10007:
State       Recv-Q       Send-Q             Local Address:Port             Peer Address:Port       Process
-rw------- 1 root root 8388608 Aug 31 19:23 /tmp/tmp.moeid9Udot
-rw------- 1 root root 8388608 Aug 31 19:22 /tmp/tmp.zOWCj1JnVT
-rw------- 1 root root 81920 Aug 31 19:23 /tmp/tmp.bv2vSaWBdK
-rw------- 1 root root 81920 Aug 31 19:22 /tmp/tmp.fDGja90ohq

10008: unbalanced bwidth with unbalanced delay - reverse direction  3197 max 3245 [ OK ]
10009: unbalanced bwidth with opposed, unbalanced delay    3324 max 3245  [ fail ]
client exit code 0, server 0
netns ns3-0-YUItiE socket stat for 10009:
State        Recv-Q    Send-Q       Local Address:Port        Peer Address:Port    Process
TIME-WAIT    0         0                 10.0.3.3:10009           10.0.2.1:47499    timer:(timewait,58sec,0)
TIME-WAIT    0         0                 10.0.3.3:10009           10.0.1.1:48380    timer:(timewait,58sec,0)
netns ns1-0-YUItiE socket stat for 10009:
State       Recv-Q       Send-Q             Local Address:Port             Peer Address:Port       Process
-rw------- 1 root root 8388608 Aug 31 19:24 /tmp/tmp.moeid9Udot
-rw------- 1 root root 8388608 Aug 31 19:22 /tmp/tmp.zOWCj1JnVT
-rw------- 1 root root 81920 Aug 31 19:24 /tmp/tmp.bv2vSaWBdK
-rw------- 1 root root 81920 Aug 31 19:22 /tmp/tmp.fDGja90ohq

10010: unbalanced bwidth with opposed, unbalanced delay - reverse direction  3207 max 3245 [ OK ]

Do you also need capture when "everything is OK"?

EDIT: this was with a "debug" kernel.

We currently have some instabilities in the simult_flows tests-case. The problem boils down to the unneeded large wait introduced by the tests to allow for the MPJ handshake to complete, which can also introduce a quite relevant variance. Do wait on a single side of the connection, remove the delay at shutdown time and tune the expected test time with the above. Closes: multipath-tcp/mptcp_net-next#137 Signed-off-by: Paolo Abeni <[email protected]>

We queue an irq work for deferred processing of mce event in realmode mce handler, where translation is disabled. Queuing of the work may result in accessing memory outside RMO region, such access needs the translation to be enabled for an LPAR running with hash mmu else the kernel crashes. After enabling translation in mce_handle_error() we used to leave it enabled to avoid crashing here, but now with the commit 74c3354 ("powerpc/pseries/mce: restore msr before returning from handler") we are restoring the MSR to disable translation. Hence to fix this enable the translation before queuing the work. Without this change following trace is seen on injecting SLB multihit in an LPAR running with hash mmu. Oops: Kernel access of bad area, sig: 11 [#1] LE PAGE_SIZE=64K MMU=Hash SMP NR_CPUS=2048 NUMA pSeries CPU: 5 PID: 1883 Comm: insmod Tainted: G OE 5.14.0-mce+ #137 NIP: c000000000735d60 LR: c000000000318640 CTR: 0000000000000000 REGS: c00000001ebff9a0 TRAP: 0300 Tainted: G OE (5.14.0-mce+) MSR: 8000000000001003 <SF,ME,RI,LE> CR: 28008228 XER: 00000001 CFAR: c00000000031863c DAR: c00000027fa8fe08 DSISR: 40000000 IRQMASK: 0 ... NIP llist_add_batch+0x0/0x40 LR __irq_work_queue_local+0x70/0xc0 Call Trace: 0xc00000001ebffc0c (unreliable) irq_work_queue+0x40/0x70 machine_check_queue_event+0xbc/0xd0 machine_check_early_common+0x16c/0x1f4 Fixes: 74c3354 ("powerpc/pseries/mce: restore msr before returning from handler") Signed-off-by: Ganesh Goudar <[email protected]> [mpe: Fix comment formatting, trim oops in change log for readability] Signed-off-by: Michael Ellerman <[email protected]> Link: https://lore.kernel.org/r/[email protected]

The MPTCP packet scheduler has sub-optimal behavior with asymmetric subflows: if the faster subflow-level cwin is closed, the packet scheduler can enqueue "too much" data on a slower subflow. When all the data on the faster subflow is acked, if the mptcp-level cwin is closed, and link utilization becomes suboptimal. The solution is implementing blest-like[1] HoL-blocking estimation, transmitting only on the subflow with the shorter estimated time to flush the queued memory. If such subflows cwin is closed, we wait even if other subflows are available. This is quite simpler than the original blest implementation, as we leverage the pacing rate provided by the TCP socket. To get a more accurate estimation for the subflow linger-time, we maintain a per-subflow weighted average of such info. Additionally drop magic numbers usage in favor of newly defined macros and use more meaningful names for status variable. [1] http://dl.ifip.org/db/conf/networking/networking2016/1570234725.pdf Closes: #137 Reviewed-by: Matthieu Baerts <[email protected]> Signed-off-by: Paolo Abeni <[email protected]>

The MPTCP packet scheduler has sub-optimal behavior with asymmetric subflows: if the faster subflow-level cwin is closed, the packet scheduler can enqueue "too much" data on a slower subflow. When all the data on the faster subflow is acked, if the mptcp-level cwin is closed, and link utilization becomes suboptimal. The solution is implementing blest-like[1] HoL-blocking estimation, transmitting only on the subflow with the shorter estimated time to flush the queued memory. If such subflows cwin is closed, we wait even if other subflows are available. This is quite simpler than the original blest implementation, as we leverage the pacing rate provided by the TCP socket. To get a more accurate estimation for the subflow linger-time, we maintain a per-subflow weighted average of such info. Additionally drop magic numbers usage in favor of newly defined macros and use more meaningful names for status variable. [1] http://dl.ifip.org/db/conf/networking/networking2016/1570234725.pdf Closes: multipath-tcp/mptcp_net-next#137 Reviewed-by: Matthieu Baerts <[email protected]> Signed-off-by: Paolo Abeni <[email protected]> Signed-off-by: Mat Martineau <[email protected]>

matttbe added the bug label Jan 15, 2021

matttbe changed the title ~~selftests: diag.sh: unbalanced bwidth tests are unstable~~ selftests: simult_flows.sh: unbalanced bwidth tests are unstable Jan 15, 2021

matttbe assigned pabeni and unassigned pabeni Feb 6, 2021

matttbe mentioned this issue Feb 23, 2021

./mptcp_connect.sh -m mmap test blocks #160

Closed

pabeni mentioned this issue Aug 24, 2021

selftests: simult_flows: hit WARN_ON_ONCE(!mpext) #227

Closed

pabeni self-assigned this Oct 28, 2021

matttbe closed this as completed in a92c4f2 Nov 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

selftests: simult_flows.sh: unbalanced bwidth tests are unstable #137

selftests: simult_flows.sh: unbalanced bwidth tests are unstable #137

matttbe commented Jan 15, 2021 •

edited

Loading

matttbe commented Jan 25, 2021

matttbe commented Mar 20, 2021

matttbe commented Aug 31, 2021 •

edited

Loading

selftests: simult_flows.sh: unbalanced bwidth tests are unstable #137

selftests: simult_flows.sh: unbalanced bwidth tests are unstable #137

Comments

matttbe commented Jan 15, 2021 • edited Loading

matttbe commented Jan 25, 2021

matttbe commented Mar 20, 2021

matttbe commented Aug 31, 2021 • edited Loading

matttbe commented Jan 15, 2021 •

edited

Loading

matttbe commented Aug 31, 2021 •

edited

Loading