by Shuwen Deng on Mar 26, 2025 | Tags: Conferences
The 31st IEEE International Symposium on High-Performance Computer Architecture (HPCA’25) was held from March 1 to 5, 2025 in Las Vegas, USA. Co-located with PPoPP, CGO, and CC as part of the “Parallel Programming, Architecture and Compilation (PARC)” event, HPCA’25...
Read more...
by David Patterson on Mar 5, 2025 | Tags: AI accelerator, AI hardware, Carbon emissions, Carbon footprint, Data center
“It’s not that easy bein’ green.” – Kermit the Frog, 1976 We likely just published the first paper to report the carbon footprint of manufacturing AI accelerators. This life-cycle assessment in a real-world setting found that hardware operation emits more...
Read more...
by Yao Fu, Luo Mai and Dmitrii Ustiugov on Feb 28, 2025 | Tags: Cloud computing, generative ai, machine learning systems, serverless ai, serverless computing
Artificial Intelligence (AI) is revolutionizing the world, powering productivity tools, healthcare, and education innovations through large-scale models like ChatGPT, DeepSeek, Gemini, and Claude. Most of these models, managed by tech giants such as OpenAI, Google,...
Read more...
by Akanksha Atrey, Sercan Aygun, Udit Gupta, Abdulrahman Mahmoud, Lillian Pentecost, Vijay Janapa Reddi on Jan 28, 2025 | Introduction The increasing complexity of machine learning systems demands a new approach to their design, optimization, and deployment. While tremendous progress has been made in ML algorithms and hardware acceleration, building reliable, efficient, and scalable ML...
Read more...
by Geraldo Oliveira, Juan Gómez-Luna, Onur Mutlu on Jan 22, 2025 | Tags: Emerging Technology, Memory, Processing-in-Memory
Processing-in-Memory (PIM) is a computing paradigm that aims to overcome the data movement bottleneck (i.e., wasted execution cycles and energy, resulting from the back-and-forth data movement between memory units and compute units) by making memory (and storage)...
Read more...
by Shvetank Prakash and Vijay Janapa Reddi on Jan 7, 2025 | Tags: AI Agents, Benchmarks, Datasets, Machine Learning
Introduction The rise of large language models (LLMs) and generative artificial intelligence (GenAI) presents new opportunities to build innovative tools and is already enabling revolutionary AI-based tools in various domains. However, a significant gap remains in the...
Read more...