From: kaushal on
Hi,

I get the below information on the console of Ubuntu Linux 8.04
server. I did some basic troubleshooting by replacing RAM but did not
help.Also Please suggest/guide me the further steps to be taken to
fix this issue.
#########################################################################################################################
ata 5.00 : exception Emask 0x1 SAct 0x0 SErr 0X0 action 0x0
ata 5.00 : CPB resp_flags 0x11 :, CMD error
ata 5.00 : cmd c8/00:88:6f:f2:01/00:00:00:00:00/e5 tag0 dma 69632
ata 5.00 : status : {DRDY ERR}
ata 5.00 : error : {UNC}
HARDWARE ERROR
CPU 1 : MACHINE CHECK EXCEPTION
4 BANK 4 : b200000000070f0f
TSC 24adf1489e
This is not a software problem !
Run through mce-log ascii to decode and contact your hardware vendor
HARDWARE ERROR
CPU 0 : MACHINE CHECK EXCEPTION
4 BANK 4 : b200000000070f0f
TSC 24adf15158
This is not a software problem !
Run through mce-log ascii to decode and contact your hardware vendor
Kernel Panic - not syncing Machine check
#########################################################################################################################
I have replaced all the RAM Chips with the new set and It worked fine
for sometime and then when i start mysql server the system spew out
the Machine Check Exception again.I have tried to decode the MCE on a
test machine using the below method.
#cat error
CPU 1 : MACHINE CHECK EXCEPTION 4 BANK 4 : b200000000070f0f
TSC 24adf1489e
# /usr/sbin/mcelog --k8 --ascii < error
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 0 data cache TSC 24adf1489e
STATUS 0 MCGSTATUS 0
#cat error
CPU 0 : MACHINE CHECK EXCEPTION 4 BANK 4 : b200000000070f0f
TSC 24adf15158
# /usr/sbin/mcelog --k8 --ascii < error
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 0 data cache TSC 24adf15158
STATUS 0 MCGSTATUS 0
I am not sure what those error means. Please suggest the further
steps.

Thanks,
Kaushal
From: philo on
On 05/11/2010 02:16 PM, kaushal wrote:
> Hi,
>
> I get the below information on the console of Ubuntu Linux 8.04
> server. I did some basic troubleshooting by replacing RAM but did not
> help.Also Please suggest/guide me the further steps to be taken to
> fix this issue.
> #########################################################################################################################
> ata 5.00 : exception Emask 0x1 SAct 0x0 SErr 0X0 action 0x0
> ata 5.00 : CPB resp_flags 0x11 :, CMD error
> ata 5.00 : cmd c8/00:88:6f:f2:01/00:00:00:00:00/e5 tag0 dma 69632
> ata 5.00 : status : {DRDY ERR}
> ata 5.00 : error : {UNC}
> HARDWARE ERROR
> CPU 1 : MACHINE CHECK EXCEPTION
> 4 BANK 4 : b200000000070f0f
> TSC 24adf1489e
> This is not a software problem !
> Run through mce-log ascii to decode and contact your hardware vendor
> HARDWARE ERROR
> CPU 0 : MACHINE CHECK EXCEPTION
> 4 BANK 4 : b200000000070f0f
> TSC 24adf15158
> This is not a software problem !
> Run through mce-log ascii to decode and contact your hardware vendor
> Kernel Panic - not syncing Machine check
> #########################################################################################################################
> I have replaced all the RAM Chips with the new set and It worked fine
> for sometime and then when i start mysql server the system spew out
> the Machine Check Exception again.I have tried to decode the MCE on a
> test machine using the below method.
> #cat error
> CPU 1 : MACHINE CHECK EXCEPTION 4 BANK 4 : b200000000070f0f
> TSC 24adf1489e
> # /usr/sbin/mcelog --k8 --ascii< error
> HARDWARE ERROR. This is *NOT* a software problem!
> Please contact your hardware vendor
> CPU 1 0 data cache TSC 24adf1489e
> STATUS 0 MCGSTATUS 0
> #cat error
> CPU 0 : MACHINE CHECK EXCEPTION 4 BANK 4 : b200000000070f0f
> TSC 24adf15158
> # /usr/sbin/mcelog --k8 --ascii< error
> HARDWARE ERROR. This is *NOT* a software problem!
> Please contact your hardware vendor
> CPU 0 0 data cache TSC 24adf15158
> STATUS 0 MCGSTATUS 0
> I am not sure what those error means. Please suggest the further
> steps.
>
> Thanks,
> Kaushal



Just because you replaced the RAM...it could still be giving you a problem.

I'd run an actual RAM test

and also run the mfg's HD diagnostic for starters
From: Clemens Ladisch on
kaushal wrote:
> I get the below information on the console of Ubuntu Linux 8.04
> server.
> #########################################################################################################################
> ata 5.00 : exception Emask 0x1 SAct 0x0 SErr 0X0 action 0x0
> ata 5.00 : CPB resp_flags 0x11 :, CMD error
> ata 5.00 : cmd c8/00:88:6f:f2:01/00:00:00:00:00/e5 tag0 dma 69632
> ata 5.00 : status : {DRDY ERR}
> ata 5.00 : error : {UNC}
> HARDWARE ERROR
> CPU 1 : MACHINE CHECK EXCEPTION
> 4 BANK 4 : b200000000070f0f
> TSC 24adf1489e

Plugging this number into AMD's MCAT tool gives this:

| mcat.exe /k8 /cmd 4 0xb200000000070f0f 0 0
| Processor Number : 0
| Bank Number : 4
| Time Stamp (0x): 00000000 00000000
| Error Status (0x): B2000000 00070F0F
| Error Address (0x): 00000000 00000000
| Error Misc. (0x): 00000000 00000000
| Status Bit Decode:
| Processor state corrupted by error
| Error reporting enabled
| Error not corrected
| Error valid
| Bus Error Code:
| Participation processor: Generic
| Time-out: Request timed out (TIMOUT)
| Memory transaction type: Generic error (GEN)
| I/O: Generic (GEN)
| Cache level: Generic (LG)
| North Bridge Error MC4:
| Extended Error Code: 0x7 - Watchdog Timeout Error
| Error Code: 0x0F0F
| Generic (GEN) Generic error (GEN), on Generic (LG) cache
| ...

> I have replaced all the RAM Chips with the new set and It worked fine
> for sometime and then when i start mysql server the system spew out
> the Machine Check Exception again.

Assuming that this is related with the previous IDE error in the log,
this could indicate that the disk controller has locked up. Google says
that this can happen with certain buggy nVidia SATA controllers, but you
didn't tell us anything about the hardware in your system.


Regards,
Clemens
From: Ant on
> Plugging this number into AMD's MCAT tool gives this:
>
> | mcat.exe /k8 /cmd 4 0xb200000000070f0f 0 0
> | Processor Number : 0
> | Bank Number : 4
> | Time Stamp (0x): 00000000 00000000
> | Error Status (0x): B2000000 00070F0F
> | Error Address (0x): 00000000 00000000
> | Error Misc. (0x): 00000000 00000000
> | Status Bit Decode:
> | Processor state corrupted by error
> | Error reporting enabled
> | Error not corrected
> | Error valid
> | Bus Error Code:
> | Participation processor: Generic
> | Time-out: Request timed out (TIMOUT)
> | Memory transaction type: Generic error (GEN)
> | I/O: Generic (GEN)
> | Cache level: Generic (LG)
> | North Bridge Error MC4:
> | Extended Error Code: 0x7 - Watchdog Timeout Error
> | Error Code: 0x0F0F
> | Generic (GEN) Generic error (GEN), on Generic (LG) cache
> | ...

Where can I ge that tool? Is that smiliar to mcelog?
--
"Everyone knows an ant can't move a rubber tree plant." --Motto of FTL
Foundation (in Isaac Asimov book)
/\___/\ Phil./Ant @ http://antfarm.ma.cx (Personal Web Site)
/ /\ /\ \ Ant's Quality Foraged Links: http://aqfl.net
| |o o| |
\ _ / If crediting, then use Ant nickname and AQFL URL/link.
( ) If e-mailing, then axe ANT from its address if needed.
Ant is currently not listening to any songs on this computer.
From: Clemens Ladisch on
Ant wrote:
> > | mcat.exe /k8 /cmd 4 0xb200000000070f0f 0 0
>
> Where can I ge that tool?

http://lmgtfy.com/?q=amd+mcat

> Is that smiliar to mcelog?

Yes. Er, no, it doesn't run in Linux.


Regards,
Clemens