The original post: /r/datahoarder by /u/dp3471 on 2025-01-19 04:37:49.
First, I understand r/techsuport exists, just keep reading.
Here's the situation:
-
Bought 2x10TB sas drives + lsi-9361-8i card for $110 for data storage, and a mini sas + sata power -> sas drive breakout x4 cable (+ tiny fan to cool the cheap heatsink)
-
plugged in 1 drive (to test out with Windows) at first. Showed up without me setting up anything, so I thought, might as well partition it, since everything seems to be plug-and-play right?
-
partitioned the drive to 100% capacity in Windows.
-
Installed storcli (from here after a couple attempts at super old versions and mismatching protocols) to check on the card, and plugged in 2nd drive to attempt to create a ~18 tb raid0. Whenever I tried to do anything to that drive, it freaked out and disconnected. Want to delete the virtual drive (which was automatically set up[??])? too bad, disconnect. Want to reset the drive's configuration? too bad, disconnect. Every disconnect required me to power cycle to get the drive to show up again.
-
Bios configuration did not help at all and I was absolutely stumped, so I flashed the card, and still no progress.
-
Today, I tried to wipe first and last GB of each drive with zeroes, factory reset card with drives plugged out, and still no hope
-
Tried using storcli's sanitize on the bad drive, which didn't work. Then, I tested it on the good drive. It works, but now I CAN'T STOP THE SANITIZATION!! No warning about this.
Honestly, this is driving me nuts. I understand I made a foolish assumption, but I feel like there is a level of firmware/software communication incompetence I can (am) blame.
Here are some more technical details [some things omitted for de-cluttering]:
CLI Version = 007.2309.0000.0000 Sep 16, 2022
Controller = 0
Model = AVAGO MegaRAID SAS 9361-8i
Serial Number = SK74377539
Current Controller Date/Time = 01/19/2025, 04:28:28
Current System Date/time = 01/18/2025, 23:28:28
SAS Address = 500605b00d860410
PCI Address = 00:03:00:00
Mfg Date = 10/27/17
Rework Date = 00/00/00
Revision No = 14C
Firmware Package Build = 24.21.0-0159
Firmware Version = 4.680.00-8577
CPLD Version = 26515-01A
Bios Version = 6.36.00.3_4.19.08.00_0x06180206
HII Version = 03.25.05.15
Ctrl-R Version = 5.19-0609
Preboot CLI Version = 01.07-05:#%0000
NVDATA Version = 3.1705.00-0028
Boot Block Version = 3.07.00.00-0004
Driver Name = megaraid_sas
Driver Version = 07.727.03.00-rc1
Here are relevant event logs:
seqNum: 0x000041bc
Time: Sun Jan 19 04:00:12 2025
Code: 0x00000072
Class: 0
Locale: 0x02
Event Description: State change on PD 17(e0xfc/s4) from ONLINE(18) to OFFLINE(10)
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
Previous state: 24
New state: 16
seqNum: 0x000041bf
Time: Sun Jan 19 04:00:28 2025
Code: 0x000000e7
Class: 0
Locale: 0x42
Event Description: Marked Missing for PD 17(e0xfc/s4) on array 0 row 0
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
Array: 0
Row: 0
seqNum: 0x000041c0
Time: Sun Jan 19 04:00:28 2025
Code: 0x00000072
Class: 0
Locale: 0x02
Event Description: State change on PD 17(e0xfc/s4) from OFFLINE(10) to UNCONFIGURED_GOOD(0)
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
Previous state: 16
New state: 0
seqNum: 0x000041c2
Time: Sun Jan 19 04:00:31 2025
Code: 0x00000071
Class: 0
Locale: 0x02
Event Description: Unexpected sense: PD 17(e0xfc/s4) Path 5000cca3c400a2d1, CDB: 93 00 00 00 00 04 8c 30 00 00 00 00 00 01 00 00, Sense: 6/29/02
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
CDB Length: 16
CDB Data:
0093 0000 0000 0000 0000 0004 008c 0030 0000 0000 0000 0000 0000 0001 0000 0000
Sense Length: 32
Sense Data:
0070 0000 0006 0000 0000 0000 0000 0018 0000 0000 0000 0000 0029 0002 0000 0000 0000 0000 0000 0000 00f5 0017 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
seqNum: 0x000041c3
Time: Sun Jan 19 04:00:31 2025
Code: 0x00000071
Class: 0
Locale: 0x02
Event Description: Unexpected sense: PD 17(e0xfc/s4) Path 5000cca3c400a2d1, CDB: 00 00 00 00 00 00, Sense: 2/04/01
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
CDB Length: 6
CDB Data:
0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
Sense Length: 16
Sense Data:
0072 0002 0004 0001 0000 0000 0000 0008 0003 0002 0000 0000 0080 0002 00f5 0002 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000
seqNum: 0x000041c4
Time: Sun Jan 19 04:00:46 2025
Code: 0x0000010c
Class: 1
Locale: 0x02
Event Description: PD 17(e0xfc/s4) Path 5000cca3c400a2d1 reset (Type 03)
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
Error: 3
seqNum: 0x000041c5
Time: Sun Jan 19 04:00:46 2025
Code: 0x00000070
Class: 1
Locale: 0x02
Event Description: Removed: PD 17(e0xfc/s4)
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
seqNum: 0x000041c6
Time: Sun Jan 19 04:00:46 2025
Code: 0x000000f8
Class: 0
Locale: 0x02
Event Description: Removed: PD 17(e0xfc/s4) Info: enclPd=fc, scsiType=0, portMap=00, sasAddr=5000cca3c400a2d1,0000000000000000
Event Data:
===========
Device ID: 23
Enclosure Device ID: 252
Enclosure Index: 1
Slot Number: 4
SAS Address 1: 5000cca3c400a2d1
SAS Address 2: 0
seqNum: 0x000041c8
Time: Sun Jan 19 04:00:46 2025
Code: 0x00000072
Class: 0
Locale: 0x02
Event Description: State change on PD 17(e0xfc/s4) from UNCONFIGURED_GOOD(0) to UNCONFIGURED_BAD(1)
Event Data:
===========
Device ID: 23
Enclosure Index: 252
Slot Number: 4
Previous state: 0
New state: 1
Honestly, I'm at a complete loss. I would appreciate any help from people that know what they're doing.