Quantcast
Channel: PowerEdge General HW Forum - Recent Threads
Viewing all articles
Browse latest Browse all 5887

uncorrectable ecc memory error

$
0
0

Hi


We have a T605 server (2009 vintage) running Windows Server 2003 that started crashing today.  The server has 2 quad-core opterons with 1GB ECC DIMMs in slots A1, A2, B1 and B2 (giving 4GB total, with 2GB "local" to each processor socket).


Loading the diagnostic utility showed an error message saying there was an uncorrectable ECC error affecting DIMM slots A1 & A2.

I first tried removing and then re-seating the memory sticks in DIMM slots A1 & A2.  This didn't work and the server crashed again when starting windows.

I then ran the memory diagnostic from the bios utility menu (express version).  The diagnostic completed without any errors, but the server again crashed when trying to boot into windows.

To see if the memory stick(s) themselves were the problem, I removed both DIMMs from slots A1 & A2, and took the DIMM from B2 and put it in A1.  The server again crashed on start up, and this time the logged error message said there was an ecc error affecting slot A1 only.

Finally, I put the DIMM from A1 back in B2 where it came from, and left all of socket A's memory slots unpopulated.  The server then booted in windows normally and has been up for several hours since.

So, it looks like the problem isn't the memory sticks themselves.  Maybe it's a motherboard issue, or even a memory controller problem on the processor.

Can anyone suggest what else might cause this problem, and what else I can do to troubleshoot?

Thanks.


Viewing all articles
Browse latest Browse all 5887

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>