Moreh and SGLang Showcase Distributed Inference System on AMD
Moreh has unveiled its distributed inference system on AMD at the AI Infra Summit 2025, showcasing collaborations with Tenstorrent and SGLang announced in a press release. The event, held in Santa Clara from September 9 to 11, featured Moreh's CEO Gangwon Jo presenting benchmark results that demonstrated the system's efficiency in optimizing deep learning models like DeepSeek, surpassing NVIDIA's performance.
Additionally, Moreh introduced a next-generation AI semiconductor system that combines its software with Tenstorrent's hardware. The company co-hosted a presentation with SGLang and organized networking sessions to further discuss their joint development of an AMD-based distributed inference system. This collaboration aims to accelerate the expansion of the deep learning inference market.
We hope you enjoyed this article.
Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.
Also, consider following us on social media:
More from: Data Centers
Subscribe to Silicon Brief
Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.
Whitepaper
Stanford HAI’s 2025 AI Index Reveals Record Growth in AI Capabilities, Investment, and Regulation
The 2025 AI Index by Stanford HAI provides a comprehensive overview of the global state of artificial intelligence, highlighting significant advancements in AI capabilities, investment, and regulation. The report details improvements in AI performance, increased adoption in various sectors, and the growing global optimism towards AI, despite ongoing challenges in reasoning and trust. It serves as a critical resource for policymakers, researchers, and industry leaders to understand AI's rapid evolution and its implications.
Read more