From: Marco De Vitis on
Hi,
I need to understand what's wrong in a SCSI storage enclosure like this:
http://www.dmi-inc.com/pdf/raidtec/flexarray_hi12.pdf

It was mounted on a Linux system, and suddenly appeared to have bad
filesystem errors.
I couldn't work around the problem in any way through software.
I'd like to check its RAID configuration and see if just some HDs are
damaged, but I can't find a way to do it!

Its LCD panel, its serial port and a related Windows application (the
latter two according to the manual, as I didn't have a way to check
myself) are only used for a few configuration options like alarms etc.
By the way, the only error shown in the LCD panel is about a fan
receiving low current.

How on earth is that thing actually configured?? I mean RAID levels, etc...

Any clues? Thanks.

--
Ciao,
Marco.
From: Arno on
Marco De Vitis <starless(a)spin.it> wrote:
> Hi,
> I need to understand what's wrong in a SCSI storage enclosure like this:
> http://www.dmi-inc.com/pdf/raidtec/flexarray_hi12.pdf

> It was mounted on a Linux system, and suddenly appeared to have bad
> filesystem errors.
> I couldn't work around the problem in any way through software.
> I'd like to check its RAID configuration and see if just some HDs are
> damaged, but I can't find a way to do it!

> Its LCD panel, its serial port and a related Windows application (the
> latter two according to the manual, as I didn't have a way to check
> myself) are only used for a few configuration options like alarms etc.
> By the way, the only error shown in the LCD panel is about a fan
> receiving low current.

> How on earth is that thing actually configured?? I mean RAID levels, etc...

> Any clues? Thanks.

First, if the device itself and its monitoring software does not
display any problem, then this may actually not be a problem with
the storage. Can you post the exac first error messages from the
Linux system log here?

As to the hardware, I think this may be SCSI-Raid controller on the
drive side with an SCSI host interface showing the RAID array as a
single drive. If so, you only chance may be the Windows application,
and that immediately shows the stupidity of such a setup.

If you want to test the individual disks, you habe a differenc
option: Stop this thing and test the disks one by one in a
different computer. This is SCSI, so a bit different procedure
to the usual SMART stats query is needed. What you can do is
read the complete disks (dd_rescue <disk> /dev/null, e.g.)
and then query the SCSI device error pages (scsictl, if I
remember correctly).

Incidentially, this shows (again) the superiority of software
RAID, were you do not have to jumt through hoops like inadequate
proprietary management software.


Arno
From: Marco De Vitis on
Il 20-09-2009 15:14, Arno ha scritto:

> the storage. Can you post the exac first error messages from the
> Linux system log here?

Hard to remember, also because it's not mine.
But it was first noticed that some files were unreadable, and the error
was something like "short read".
Then, when trying to reinitialize the thing (mk2fs etc.), things got
even worse, and it seemed to "lock" during access attempts, starting to
emit a sort "beep - beep" sound alert which, of course, the manual does
not mention at all :-P.

> If you want to test the individual disks, you habe a differenc

This can be a bit hard in this specific case, I'd prefer to find out how
to manage the thing and let it make its checks...

--
Ciao,
Marco.
From: Arno on
Marco De Vitis <starless(a)spin.it> wrote:
> Il 20-09-2009 15:14, Arno ha scritto:

>> the storage. Can you post the exac first error messages from the
>> Linux system log here?

> Hard to remember, also because it's not mine.
> But it was first noticed that some files were unreadable, and the error
> was something like "short read".

THis basically only means the device stopped to supply data. It
can be anything, and does not need to be a disk problem.

> Then, when trying to reinitialize the thing (mk2fs etc.), things got
> even worse, and it seemed to "lock" during access attempts, starting to
> emit a sort "beep - beep" sound alert which, of course, the manual does
> not mention at all :-P.

Ah, but this is not a data-recovery operation then? Good.

>> If you want to test the individual disks, you habe a differenc

> This can be a bit hard in this specific case, I'd prefer to find out how
> to manage the thing and let it make its checks...

Understandable. Regard the manual drive check as a fallback option
then or as the "we could do this, but it is not really cost
effective" option.

Arno
From: Marco De Vitis on
Il 21-09-2009 0:05, Arno ha scritto:

> Ah, but this is not a data-recovery operation then? Good.

Exactly.
We (they) have a backup of the data. We just need to know if the array
is still usable.

--
Ciao,
Marco.