View Issue Details

IDProjectCategoryView StatusLast Update
0006205Rocky-Linux-8kernelpublic2024-07-16 02:46
ReporterGiannis Economou Assigned To 
PrioritynormalSeveritymajorReproducibilityhave not tried
Status newResolutionopen 
Summary0006205: aacraid problem on kernel 4.18.0-513.18.1
DescriptionAfter rebooting a server to the latest kernel, our Adaptec RAID controller started giving a lot or errors.
Seems related to this: https://bugzilla.kernel.org/show_bug.cgi?id=217599

Before finding out that this is kernel related, we changed RAID controller (same model), cables etc.
Problem was not resolved, but was immediately resolved after booting back to 4.18.0-425.13.1

Controller Model (from arcconf util) is: Adaptec ASR8405

From dmesg we where having many messages like those below.

[28081.438826] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[28287.848651] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[31271.793465] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[34331.562419] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[36861.330001] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[37721.047053] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[38049.512860] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[38211.972057] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[40502.146839] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[42225.219737] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[42431.029251] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[43139.577598] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[43378.654203] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[50770.782297] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[50996.711065] aacraid 0000:08:00.0: outstanding cmd: error handler-0
[51066.921025] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.924818] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.928570] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.932280] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.932282] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.935983] blk_update_request: I/O error, dev sda, sector 246629848 op 0x0:(READ) flags 0x80700 phys_seg 1 prio class 0
[51066.939742] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.943928] blk_update_request: I/O error, dev sda, sector 246629848 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
[51066.947333] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.959533] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.963643] sd 0:0:3:0: Device offlined - not ready after error recovery
[51066.967699] sd 0:0:3:0: Device offlined - not ready after error recovery

Steps To ReproduceJust boot to
TagsNo tags attached.

Activities

Akemi Yagi

Akemi Yagi

2024-03-31 17:01

reporter   ~0006568

You might want to do a test install of ELRepo's kernel to see if the latest kernel from upstream (kernel.org) fixes the issue.

kernel-lt ( https://elrepo.org/wiki/doku.php?id=kernel-lt )
or
kernel-ml ( https://elrepo.org/wiki/doku.php?id=kernel-ml )
Giannis Economou

Giannis Economou

2024-06-02 08:04

reporter   ~0007294

@Akemi Yagi As I wrote on my original message, all errors do not appear after we booted back to 4.18.0-425.13.1
So we are not having the errors now.
But we had to do a dnf versionlock to avoid kernel updates unfortunately (until a fix lands in rocky8).
Akemi Yagi

Akemi Yagi

2024-06-25 17:14

reporter   ~0007690

I have built the kmod-aacraid package using the patch referenced in https://bugzilla.kernel.org/show_bug.cgi?id=217599 (comment c63) and released it to the elrepo testing repository.

If you have elrepo enabled, you can install it by running:

sudo dnf --enablerepo=elrepo-testing install kmod-aacraid

Or you can download the kmod rpm:

https://elrepo.org/linux/testing/el8/x86_64/RPMS/kmod-aacraid-1.2.1-11.1.el8_10.elrepo.x86_64.rpm
Akemi Yagi

Akemi Yagi

2024-06-27 18:46

reporter   ~0007723

@Giannis Economou

Once I get a positive response, I will move the kmod package to the main repository.
Akemi Yagi

Akemi Yagi

2024-07-16 02:46

reporter   ~0007888

The kmod-aacraid package has now been moved to the elrepo main repository.

Issue History

Date Modified Username Field Change
2024-03-31 10:18 Giannis Economou New Issue
2024-03-31 17:01 Akemi Yagi Note Added: 0006568
2024-06-02 08:04 Giannis Economou Note Added: 0007294
2024-06-25 17:14 Akemi Yagi Note Added: 0007690
2024-06-27 18:46 Akemi Yagi Note Added: 0007723
2024-07-16 02:46 Akemi Yagi Note Added: 0007888