Prev: Hplip
Next: sound devices
From: Alex Samad on
Hi

I have been getting alot of these

May 5 23:31:46 hufpuf kernel: ata2: timeout waiting for ADMA IDLE,
stat=0x440
May 5 23:31:46 hufpuf kernel: ata2.00: exception Emask 0x0 SAct
0x7fffffff SErr 0x0 action 0x0
May 5 23:31:46 hufpuf kernel: ata2.00: CPB resp_flags 0x11: , CMD error
May 5 23:31:46 hufpuf kernel: ata2.00: cmd
61/08:00:42:e8:21/00:00:01:00:00/40 tag 0 ncq 4096 out
May 5 23:31:46 hufpuf kernel: res
41/04:00:cd:7d:91/04:00:31:00:00/40 Emask 0x1 (device error)
May 5 23:31:46 hufpuf kernel: ata2.00: status: { DRDY ERR }
May 5 23:31:46 hufpuf kernel: ata2.00: error: { ABRT }
May 5 23:31:46 hufpuf kernel: ata2.00: cmd
61/08:08:1a:e8:25/00:00:01:00:00/40 tag 1 ncq 4096 out
May 5 23:31:46 hufpuf kernel: res
41/04:08:1a:e8:25/24:00:01:00:00/40 Emask 0x401 (device error) <F>
May 5 23:31:46 hufpuf kernel: ata2.00: status: { DRDY ERR }



and finding my machine in a hung state (caps lock and the third led
flashing) I have had to reset the machine (3 times now in the last 3
weeks).

I am guessing it is a hard drive problem, cause I recently bought new
hard drives.

I am trying to figure out is it sda or sdb can you tell from ata2.00
?(if so how)

Ale
--
Objects are lost only because people look where they are not rather than
where they are.
From: Douglas A. Tutty on
On Tue, May 06, 2008 at 07:10:34AM +1000, Alex Samad wrote:
>
> May 5 23:31:46 hufpuf kernel: ata2: timeout waiting for ADMA IDLE,
> stat=0x440
> May 5 23:31:46 hufpuf kernel: ata2.00: exception Emask 0x0 SAct
> 0x7fffffff SErr 0x0 action 0x0
> May 5 23:31:46 hufpuf kernel: ata2.00: CPB resp_flags 0x11: , CMD error
> May 5 23:31:46 hufpuf kernel: ata2.00: cmd
> 61/08:00:42:e8:21/00:00:01:00:00/40 tag 0 ncq 4096 out
> May 5 23:31:46 hufpuf kernel: res
> 41/04:00:cd:7d:91/04:00:31:00:00/40 Emask 0x1 (device error)
> May 5 23:31:46 hufpuf kernel: ata2.00: status: { DRDY ERR }
> May 5 23:31:46 hufpuf kernel: ata2.00: error: { ABRT }
> May 5 23:31:46 hufpuf kernel: ata2.00: cmd
> 61/08:08:1a:e8:25/00:00:01:00:00/40 tag 1 ncq 4096 out
> May 5 23:31:46 hufpuf kernel: res
> 41/04:08:1a:e8:25/24:00:01:00:00/40 Emask 0x401 (device error) <F>
> May 5 23:31:46 hufpuf kernel: ata2.00: status: { DRDY ERR }
>
> and finding my machine in a hung state (caps lock and the third led
> flashing) I have had to reset the machine (3 times now in the last 3
> weeks).
>
> I am guessing it is a hard drive problem, cause I recently bought new
> hard drives.
>
> I am trying to figure out is it sda or sdb can you tell from ata2.00
> ?(if so how)

First, ensure you have good recent backups.

Then install smartmontools and run a long S.M.A.R.T. test and check the
results.

Why did you recently buy new drives? If it was because old drives
failed, they may have damaged the controller and you're seeing
controller failure instead of drive failure.

Doug.


--
To UNSUBSCRIBE, email to debian-user-REQUEST(a)lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster(a)lists.debian.org
From: Alex Samad on
On Tue, May 06, 2008 at 10:17:23AM -0400, Douglas A. Tutty wrote:
> On Tue, May 06, 2008 at 06:08:47PM +1000, Alex Samad wrote:
> > On Mon, May 05, 2008 at 09:33:19PM -0400, Douglas A. Tutty wrote:
> > > On Tue, May 06, 2008 at 07:10:34AM +1000, Alex Samad wrote:
> > > >
> >
> > [snip]
> >
> > > > ?(if so how)
> > >
> > > First, ensure you have good recent backups.
> > >
> > > Then install smartmontools and run a long S.M.A.R.T. test and check the
> > > results.
> > >
> > > Why did you recently buy new drives? If it was because old drives
> > > failed, they may have damaged the controller and you're seeing
> > > controller failure instead of drive failure.
> >
> > upgraded drives 500G -> 1TB. This is one drive of a raid1 md. and I have
> > onsite and off site backups.
> >
> > I just want to make sure its sdb not sda
>
> You snipped so much, I forget what the problem was. IIRC, you had drive
> errors showing up in syslog on an ata controller and you didn't know
> which drive was the culprit.
sorry and yes

>
> Since its raid, see what the raid status is. Does it show an active
> raid with two copies synced, or does it show degraded status. mdadm
> should email you with a problem, but it doesn't hurt to check.
>
my main question was weather ata2 == sdb and ata1 == sda seems like it
does

My raid set was okay, when the machine hanged and was reboot, the would
be a resync

> If you have a third hard drive as a spare, you could add it to the
> array, let it sync, then remove on of the origional drives and see if
> the error goes away.
got another one replaced it and reseated the sata cable. Came back to
me i had problems in this drive bay before - loose cable! (hopefully not
the control), but this is one of those boxes that I have had problems
with from the beginning

Thanks

>
> Doug.
>
>
> --
> To UNSUBSCRIBE, email to debian-user-REQUEST(a)lists.debian.org
> with a subject of "unsubscribe". Trouble? Contact listmaster(a)lists.debian.org
>
>

--
Sauron is alive in Argentina!
From: Alex Samad on
On Mon, May 05, 2008 at 09:33:19PM -0400, Douglas A. Tutty wrote:
> On Tue, May 06, 2008 at 07:10:34AM +1000, Alex Samad wrote:
> >

[snip]

> > ?(if so how)
>
> First, ensure you have good recent backups.
>
> Then install smartmontools and run a long S.M.A.R.T. test and check the
> results.
>
> Why did you recently buy new drives? If it was because old drives
> failed, they may have damaged the controller and you're seeing
> controller failure instead of drive failure.

upgraded drives 500G -> 1TB. This is one drive of a raid1 md. and I have
onsite and off site backups.

I just want to make sure its sdb not sda

alex

>
> Doug.
>
>
> --
> To UNSUBSCRIBE, email to debian-user-REQUEST(a)lists.debian.org
> with a subject of "unsubscribe". Trouble? Contact listmaster(a)lists.debian.org
>
>

--
Von Neumann was the subject of many dotty professor stories. Von Neumann
supposedly had the habit of simply writing answers to homework assignments on
the board (the method of solution being, of course, obvious) when he was asked
how to solve problems. One time one of his students tried to get more helpful
information by asking if there was another way to solve the problem. Von
Neumann looked blank for a moment, thought, and then answered, "Yes.".
From: Douglas A. Tutty on
On Tue, May 06, 2008 at 06:08:47PM +1000, Alex Samad wrote:
> On Mon, May 05, 2008 at 09:33:19PM -0400, Douglas A. Tutty wrote:
> > On Tue, May 06, 2008 at 07:10:34AM +1000, Alex Samad wrote:
> > >
>
> [snip]
>
> > > ?(if so how)
> >
> > First, ensure you have good recent backups.
> >
> > Then install smartmontools and run a long S.M.A.R.T. test and check the
> > results.
> >
> > Why did you recently buy new drives? If it was because old drives
> > failed, they may have damaged the controller and you're seeing
> > controller failure instead of drive failure.
>
> upgraded drives 500G -> 1TB. This is one drive of a raid1 md. and I have
> onsite and off site backups.
>
> I just want to make sure its sdb not sda

You snipped so much, I forget what the problem was. IIRC, you had drive
errors showing up in syslog on an ata controller and you didn't know
which drive was the culprit.

Since its raid, see what the raid status is. Does it show an active
raid with two copies synced, or does it show degraded status. mdadm
should email you with a problem, but it doesn't hurt to check.

If you have a third hard drive as a spare, you could add it to the
array, let it sync, then remove on of the origional drives and see if
the error goes away.

Doug.


--
To UNSUBSCRIBE, email to debian-user-REQUEST(a)lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster(a)lists.debian.org
 | 
Pages: 1
Prev: Hplip
Next: sound devices