Novita AI and SGLang Partner to Enhance AI Inference
Novita AI has announced a strategic partnership with SGLang to enhance AI inference capabilities, announced in a press release. Novita AI will supply high-performance GPU cloud resources to support SGLang's research, benchmarking, and optimization efforts.
SGLang is known for its fast-serving engine for large language and vision-language models, which includes innovations like RadixAttention cache reuse and zero-overhead batch scheduling. This collaboration aims to empower developers to build complex generation workflows and multi-modal applications with improved reliability and scale.
Novita AI's contribution has already facilitated the development of SGLang's multi-turn reinforcement learning framework and the Prism multi-large language model serving system. The partnership also involves collaboration on SGLang's large-scale expert parallelism project, an open-source initiative to meet throughput benchmarks outlined in the DeepSeek blog.
This partnership underscores Novita AI's commitment to advancing an open ecosystem of inference engines and supporting diverse research initiatives through shared infrastructure and joint development efforts.
We hope you enjoyed this article.
Consider subscribing to one of several newsletters we publish like Silicon Brief.
Also, consider following us on social media:
More from: Chips & Data Centers
Marvell Introduces Advanced Packaging for AI Accelerators
MinIO AIStor Integrates AWS S3 Express API for Enhanced AI Workloads
EdgeMode Acquires Synthesis Analytics to Boost AI Data Center Capabilities
Strider and SCSP Report Highlights China's AI Infrastructure Expansion
Subscribe to Silicon Brief
Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.
Market report
AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation
The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.
Read more