by Caroline Trippel on Aug 15, 2022 | Tags: Datacenters, Errors, Reliability, Testing
Hyperscalers are reporting frequent silent data corruptions (SDCs)—a.k.a. silent errors or corrupt execution errors (CEEs)—in their cloud fleets caused by silicon manufacturing defects. Notably, SDCs at-scale exhibit error occurrence rates on the order of one fault...				
				Read more...
				
					 
			
					
												
								
							
					
															
					
					 by Steve Swanson on Nov 7, 2017 | Tags: Errors, Memory, Persistent, Storage
Integrating non-volatile main memories (NVMMs) into the storage/memory hierarchy make data integrity a critical design consideration.  Protecting data in NVMM is a complex problem:  media errors and software bugs can corrupt data and the reliability of each memory...				
				Read more...