[Intel-wired-lan] Urgent Kernel Bug NETDEV WATCHDOG ixgbe transmit queue 2 timed out after kernel 5.19.2 to 5.19.6

Martin Zaharinov micron10 at gmail.com
Thu Sep 8 20:12:07 UTC 2022


Hi Dave

Good i am here if need more info.
same problem is happen with i40e driver.
and may be this is bug in kernel.


Martin

На вт, 6.09.2022 г. в 22:02 ч. Switzer, David <david.switzer at intel.com>
написа:

>
>
> >-----Original Message-----
> >From: Intel-wired-lan <intel-wired-lan-bounces at osuosl.org> On Behalf Of
> >Martin Zaharinov
> >Sent: Saturday, September 3, 2022 2:54 AM
> >To: Eric Dumazet <eric.dumazet at gmail.com>; Eric Dumazet
> ><edumazet at google.com>; Fijalkowski, Maciej
> ><maciej.fijalkowski at intel.com>; Brandeburg, Jesse
> ><jesse.brandeburg at intel.com>; David S . Miller <davem at davemloft.net>;
> >Jakub Kicinski <kuba at kernel.org>; Paolo Abeni <pabeni at redhat.com>;
> >Nguyen, Anthony L <anthony.l.nguyen at intel.com>; netdev
> ><netdev at vger.kernel.org>; Dubel, Helena Anna
> ><helena.anna.dubel at intel.com>; intel-wired-lan at lists.osuosl.org
> >Subject: [Intel-wired-lan] Urgent Kernel Bug NETDEV WATCHDOG ixgbe
> >transmit queue 2 timed out after kernel 5.19.2 to 5.19.6
> >
> >Hi All
> Hello Martin!
>
> I'm Dave, a driver validation engineer at Intel Corp.
> >
> >
> >after move to release 5.19.x (2 and up to 6 ) start geting this bug
> report and
> >machine reboot automatic after that.
> I'm sorry you're having this issue, I'm working on reproducing your issue
> so that we can have our developers look into it.
>
> I will reach out to you if I have any questions.
>
> Have a great day!
> Dave
> >
> >With kernel 5.18 this problem is not happen.
> >
> >Machine run with 2x 10G Intel 82599 card in bonding .
> >its a simple router with 6 core cpu.
> >
> >Sep  3 10:05:39  [193378.949952][   C10] ------------[ cut here
> ]------------
> >Sep  3 10:05:39  [193378.949965][   C10] NETDEV WATCHDOG: eth1 (ixgbe):
> >transmit queue 2 timed out
> >Sep  3 10:05:39  [193378.949980][   C10] WARNING: CPU: 10 PID: 0 at
> >net/sched/sch_generic.c:529 dev_watchdog+0x167/0x170
> >Sep  3 10:05:39  [193378.949992][   C10] Modules linked in:
> >nf_conntrack_netlink nft_limit pppoe pppox ppp_generic slhc nft_nat
> >nft_chain_nat nf_tables team_mode_loadbalance team netconsole coretemp
> >ixgbe mdio_devres libphy mdio nf_nat_sip nf_conntrack_sip nf_nat_pptp
> >nf_conntrack_pptp nf_nat_tftp nf_conntrack_tftp nf_nat_ftp
> >nf_conntrack_ftp nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4
> >acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler rtc_cmos
> >Sep  3 10:05:39  [193378.950023][   C10] CPU: 10 PID: 0 Comm: swapper/10
> >Tainted: G           O      5.19.4 #1
> >Sep  3 10:05:39  [193378.950028][   C10] Hardware name: Supermicro Super
> >Server/X10SRD-F, BIOS 3.3 10/28/2020
> >Sep  3 10:05:39  [193378.950032][   C10] RIP:
> 0010:dev_watchdog+0x167/0x170
> >Sep  3 10:05:39  [193378.950037][   C10] Code: 28 e9 77 ff ff ff 48 89 df
> c6 05 95
> >3d c4 00 01 e8 9e 5a fb ff 48 89 c2 44 89 e1 48 89 de 48 c7 c7 f0 d0 ec
> a7 e8 c2 c2
> >13 00 <0f> 0b eb 85 0f 1f 44 00 00 41 55 41 54 55 53 48 8b 47 50 4c 8b 28
> >Sep  3 10:05:39  [193378.950043][   C10] RSP: 0018:ffff96320030cee8
> EFLAGS:
> >00010292
> >Sep  3 10:05:39  [193378.950048][   C10] RAX: 0000000000000039 RBX:
> >ffff898a4da00000 RCX: 0000000000000001
> >Sep  3 10:05:39  [193378.950053][   C10] RDX: 00000000ffffffea RSI:
> >00000000fffbffff RDI: 00000000fffbffff
> >Sep  3 10:05:39  [193378.950057][   C10] RBP: ffff898a4da003c0 R08:
> >0000000000000001 R09: 00000000fffbffff
> >Sep  3 10:05:39  [193378.950061][   C10] R10: ffff89919d600000 R11:
> >0000000000000003 R12: 0000000000000002
> >Sep  3 10:05:39  [193378.950065][   C10] R13: 0000000000000000 R14:
> >ffff89919fca07a8 R15: 0000000000000082
> >Sep  3 10:05:39  [193378.950070][   C10] FS:  0000000000000000(0000)
> >GS:ffff89919fc80000(0000) knlGS:0000000000000000
> >Sep  3 10:05:39  [193378.950074][   C10] CS:  0010 DS: 0000 ES: 0000 CR0:
> >0000000080050033
> >Sep  3 10:05:39  [193378.950078][   C10] CR2: 00007fb39f41d000 CR3:
> >00000001002fd003 CR4: 00000000003706e0
> >Sep  3 10:05:39  [193378.950082][   C10] DR0: 0000000000000000 DR1:
> >0000000000000000 DR2: 0000000000000000
> >Sep  3 10:05:39  [193378.950086][   C10] DR3: 0000000000000000 DR6:
> >00000000fffe0ff0 DR7: 0000000000000400
> >Sep  3 10:05:39  [193378.950090][   C10] Call Trace:
> >Sep  3 10:05:39  [193378.950094][   C10]  <IRQ>
> >Sep  3 10:05:39  [193378.950098][   C10]  ? pfifo_fast_destroy+0x30/0x30
> >Sep  3 10:05:39  [193378.950104][   C10]
> call_timer_fn.constprop.0+0x14/0x70
> >Sep  3 10:05:39  [193378.950110][   C10]  __run_timers.part.0+0x164/0x190
> >Sep  3 10:05:39  [193378.950116][   C10]  ?
> >__hrtimer_run_queues+0x143/0x1a0
> >Sep  3 10:05:39  [193378.950120][   C10]  ? ktime_get+0x30/0x90
> >Sep  3 10:05:39  [193378.950125][   C10]  run_timer_softirq+0x21/0x50
> >Sep  3 10:05:39  [193378.950130][   C10]  __do_softirq+0xaf/0x1d7
> >Sep  3 10:05:39  [193378.950136][   C10]  __irq_exit_rcu+0x9a/0xd0
> >Sep  3 10:05:39  [193378.950142][   C10]
> >sysvec_apic_timer_interrupt+0x66/0x80
> >Sep  3 10:05:39  [193378.950149][   C10]  </IRQ>
> >Sep  3 10:05:39  [193378.950152][   C10]  <TASK>
> >Sep  3 10:05:39  [193378.950155][   C10]
> >asm_sysvec_apic_timer_interrupt+0x16/0x20
> >Sep  3 10:05:39  [193378.950160][   C10] RIP:
> >0010:cpuidle_enter_state+0xb3/0x290
> >Sep  3 10:05:39  [193378.950167][   C10] Code: e8 d2 0d b0 ff 31 ff 49 89
> c5 e8 48
> >68 af ff 45 84 ff 74 12 9c 58 f6 c4 02 0f 85 cf 01 00 00 31 ff e8 81 b4
> b3 ff fb 45 85
> >f6 <0f> 88 d0 00 00 00 49 63 ce 48 6b f1 68 48 8b 04 24 4c 89 ea 48 29
> >Sep  3 10:05:39  [193378.950402][   C10] RSP: 0018:ffff96320014fe98
> EFLAGS:
> >00000202
> >Sep  3 10:05:39  [193378.950411][   C10] RAX: ffff89919fca6800 RBX:
> >ffff898a4206c800 RCX: 000000000000001f
> >Sep  3 10:05:39  [193378.950418][   C10] RDX: 0000afe08b9e69de RSI:
> >00000000238e3b7a RDI: 0000000000000000
> >Sep  3 10:05:39  [193378.950424][   C10] RBP: 0000000000000001 R08:
> >0000000000000002 R09: ffff89919fca5704
> >Sep  3 10:05:39  [193378.950430][   C10] R10: 0000000000000008 R11:
> >000000000000010b R12: ffffffffa8222f40
> >Sep  3 10:05:39  [193378.950436][   C10] R13: 0000afe08b9e69de R14:
> >0000000000000001 R15: 0000000000000000
> >Sep  3 10:05:39  [193378.950443][   C10]  ? cpuidle_enter_state+0x98/0x290
> >Sep  3 10:05:39  [193378.950451][   C10]  cpuidle_enter+0x24/0x40
> >Sep  3 10:05:39  [193378.950459][   C10]  cpuidle_idle_call+0xbb/0x100
> >Sep  3 10:05:39  [193378.950468][   C10]  do_idle+0x76/0xc0
> >Sep  3 10:05:39  [193378.950476][   C10]  cpu_startup_entry+0x14/0x20
> >Sep  3 10:05:39  [193378.950483][   C10]  start_secondary+0xd6/0xe0
> >Sep  3 10:05:39  [193378.950491][   C10]
> >secondary_startup_64_no_verify+0xd3/0xdb
> >Sep  3 10:05:39  [193378.950499][   C10]  </TASK>
> >Sep  3 10:05:39  [193378.950504][   C10] ---[ end trace 0000000000000000
> ]---
> >Sep  3 10:05:39  [193378.950513][   C10] ixgbe 0000:02:00.1 eth1:
> initiating reset
> >due to tx timeout
> >Sep  3 10:05:39  [193378.950525][T1766094] ixgbe 0000:02:00.1 eth1: Reset
> >adapter
> >Sep  3 10:10:02  [   30.021823][  T454] ixgbe 0000:02:00.1 eth1: NIC Link
> is Up 10
> >Gbps, Flow Control: None
> >
> >
> >
> >_______________________________________________
> >Intel-wired-lan mailing list
> >Intel-wired-lan at osuosl.org
> >https://lists.osuosl.org/mailman/listinfo/intel-wired-lan
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osuosl.org/pipermail/intel-wired-lan/attachments/20220908/26adcf60/attachment-0001.html>


More information about the Intel-wired-lan mailing list