Hi,
Over the past few days, three of our PE1950's in two separate data centers have reported:
Possible memory module event cause:Single bit warning error rate exceeded,Single bit failure error rate exceeded
We do have hundreds of these servers, but I just think it is strange a bit of a strange coincidence for three of them to have the same issue within days of each other.
In each case it is a 4gb module which has been reported as having the issue. In one of the servers that had the issue, DIMM 8 was reported as having the problem, so I swapped DIMM's 7&8 with a pair of known working 2GB DIMM's (not from the same server) - I didn't have any spare 4gb DIMM's to hand and the issue appears to have moved to DIMM 4
All of the servers are on BIOS 2.7.0, two of the affected servers are G2 and the other is G3.
Does anyone have any suggestions as to what could be causing this strange coincidence and the behavior I've noticed while troubleshooting?
Thanks
James