GSI Technology’s Compute-in-Memory APU Matches GPU AI Performance with 98% Lower Energy Use
Sunnyvale-based GSI Technology, Inc. announced that its Gemini-I Associative Processing Unit (APU) achieved GPU-class performance for large-scale AI applications while consuming over 98% less energy, according to a press release.
The findings, published by Cornell University researchers and presented at the ACM Micro ’25 conference, showed that GSI’s compute-in-memory (CIM) architecture matched the throughput of NVIDIA’s A6000 GPU on retrieval-augmented generation (RAG) workloads. The APU also performed retrieval tasks several times faster than standard CPUs, reducing total processing time by up to 80%.
The Cornell-led team benchmarked the Gemini-I APU across datasets ranging from 10GB to 200GB, highlighting its efficiency gains due to high-density and high-bandwidth memory integration. The study also introduced an analytical framework for optimizing general-purpose compute-in-memory devices.
GSI noted that its second-generation Gemini-II silicon offers roughly 10 times faster throughput and lower latency for memory-intensive AI workloads, while a future platform named Plato is expected to deliver greater compute power for embedded edge applications.
We hope you enjoyed this article.
Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.
Also, consider following us on social media:
More from: Data Centers
Subscribe to Silicon Brief
Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.
Industry analysis
2025 Global Business Services Agenda: Gen AI Takes Center Stage
This industry analysis by The Hackett Group explores the transformative impact of generative artificial intelligence (Gen AI) on global business services (GBS) in 2025. The study highlights the shift from exploration to acceleration of Gen AI initiatives, with 89% of executives advancing these projects to improve customer satisfaction, innovate products, and reduce costs. The report also discusses the challenges and strategies for successful Gen AI adoption, emphasizing the need for a technology-enabled operating model and the importance of reskilling the workforce.
Read more