Cisco says manufacturing errors are to blame for flaws in its 16GB, 32GB, and 64GB dual in-line memory modules (DIMM). Credit: Larry White / Pixabay Cisco is urging customers to replace flawed memory sticks in some of its Unified Computing System (UCS) servers before they fail. The problem is caused by a manufacturing error in 24 dual in-line memory modules (DIMM) that exhibit persistent correctable memory errors that if left in place could knock the servers offline. The problem is found in 16GB, 32GB, and 64GB memory DIMMs. Cisco describes the flaws as manufacturing deviations that affect memory modules used to make up the DIMMs. All of the problem parts were manufactured during the middle-to-end of 2020, according to a Cisco alert. A symptom of the problem is that the DIMMs will exhibit persistent correctable memory errors. “If left untreated, the DIMMs might eventually encounter an uncorrectable memory event. If encountered during runtime, uncorrectable errors will cause a sudden unexpected server reset. If encountered during Power-On Self-Test (POST), the DIMM will be mapped out and the total available memory reduced. In some cases a boot error might be seen,” the alert states. The company noted that operating system features and memory Reliability, Availability and Serviceability (RAS) features might mask the extent of the correctable errors, so customers are advised not to judge their exposure based on a lack of error reports. Instead, they should check whether the serial number of the suspect part has been flagged. The process is described in the Cisco alert, which lists the potentially faulty products. Replacement parts are available from Cisco. Cisco did not identify the maker of the defective memory modules, and declined to answer my query as well. The only thing it would say is that the memory was manufactured in mid to late 2020. However, SK Hynix, the South Korean memory maker that does manufacture memory modules used in Cisco UCS servers admitted to manufacturing problems during its most recent earnings call. During that call, an unidentified company representative stated that if changed its manufacturing process beginning in mid-2020 with some unintended side effects. “Some of the products that were produced at this particular time had been reportedly suffering some quality degradation since about one year ago. So we have been receiving reports of them sometime in the middle of last year,” the unidentified representatives said. Related content news AMD holds steady against Intel in Q1 x86 processor shipments finally realigned with typical seasonal trends for client and server processors, according to Mercury Research. By Andy Patrizio May 22, 2024 4 mins CPUs and Processors Data Center news Broadcom launches 400G Ethernet adapters The highly scalable, low-power 400G PCIe Gen 5.0 Ethernet adapters are designed for AI in the data center. By Andy Patrizio May 21, 2024 3 mins CPUs and Processors Networking news HPE updates block storage services The company adds new storage controller support as well as AWS. By Andy Patrizio May 20, 2024 3 mins Enterprise Storage Data Center news ZutaCore launches liquid cooling for advanced Nvidia chips The HyperCool direct-to-chip system from ZutaCore is designed to cool up to 120kW of rack power without requiring a facilities modification. By Andy Patrizio May 15, 2024 3 mins Servers Data Center PODCASTS VIDEOS RESOURCES EVENTS NEWSLETTERS Newsletter Promo Module Test Description for newsletter promo module. Please enter a valid email address Subscribe