by Sai Srivatsa Bhamidipati on Mar 12, 2026 | Tags: Accelerators, deep neural networks, Machine Learning
The debate of sparsity versus quantization has made its rounds in the ML optimization community for many years. Now, with the Generative AI revolution, the debate is intensifying. While these might both seem like simple mathematical approximations to an AI researcher,...
Read more...
by Mark D. Hill on Jan 12, 2026 | Tags: Accelerators, Memory, Modelling
TL;DR: Latency-tolerant architectures, e.g., GPUs, increasingly use memory/storage hierarchies, e.g., for KV Caches to speed Large-Language Model AI inference. To aid codesign of such workloads and architectures, we develop the simple PipeOrgan analytic model for...
Read more...
by Jay Shah on Oct 8, 2024 | Tags: Accelerators, Machine Learning, Programming
General Matrix Multiplication (GEMM) is a fundamental operation in machine learning and scientific computing. It is the classic example of an algorithm that benefits greatly from GPU acceleration due to its high degree of data parallelism. More recently, efficient...
Read more...
by Daniel S. Berger, David Brooks, Fiodar Kazhamiaka, Mark D. Hill, Ricardo Bianchini, Carole-Jean Wu, Karin Strauss, Kali Frost, Jaylen Wang, Kevin Martins, Sharon Gillett, Esha Choukse, Dan Ernst, Rodrigo Fonseca, Kari Lio, Bhargavi Narayanasetty, Pratyush Patel, Celine Irvene, Akshitha Sriraman, George Porter, Alex Jones, Udit Gupta, Bilge Acun-Uyan, Kim Hazelwood, and Doug Carmean on Aug 3, 2023 | Tags: Accelerators, amd, apple, Architecture, Carbon emissions, Cloud computing, Datacenters, embodied carbon, intel, Measurements, nvidia, operational carbon, qualcomm, Sustainability, tsmc
A recent post raises awareness of the challenges of reducing operational carbon, while also controversially challenging the importance of embodied carbon. We rebut the arguments raised against using embodied carbon as a design metric and conclude by advocating for more research on reducing embodied carbon.
Read more...
by Andrew A. Chien on Jul 24, 2023 | Tags: Accelerators, amd, apple, Cloud computing, Datacenters, embodied carbon, intel, nvidia, operational carbon, qualcomm, Sustainability, tsmc
I. Embodied Carbon Recently, embodied carbon, defined as the Scope 3 GHG emissions that arise from the manufacturing processes that lead to computing electronics, has become popular as an architectural metric for sustainability. If you are considering it or using it,...
Read more...