ECC RAM is a specialized type of computer memory that includes an extra chip to store checksums. Unlike standard non-ECC modules, it can detect and fix internal data corruption caused by electrical interference or cosmic rays in real-time, ensuring continuous system operation without crashes.
How it works
The memory utilizes checksum algorithms, typically Hamming code, to verify data integrity. When a 64-bit word is written, an 8-bit correction code is generated. During the read process, the memory controller compares the stored code with the calculated one. This mechanism provides:
- SEC (Single Error Correction): automatic correction of a single-bit error without interrupting the CPU;
- DED (Double Error Detection): detection of two-bit errors, triggering a system halt to prevent silent data corruption.
ECC is mandatory for servers, high-end workstations, and financial infrastructure where data integrity is paramount. It prevents system instability and corrupted file structures. Research indicates that memory modules experience up to five bit-errors per gigabyte per year, making ECC essential for maintaining high availability and protecting critical databases.