Verkor Introduces VerTQ, TurboQuant Accelerator for LLM Inference

May 20, 2026

Verkor has launched VerTQ, a silicon IP accelerator implementing Google's TurboQuant algorithm to reduce large language model memory usage by over four times while maintaining performance. The chip was autonomously designed by Verkor's Conductor 2.0 AI platform and targets edge AI applications.

Verkor has introduced VerTQ, described as the industry's first TurboQuant accelerator silicon IP, announced in a press release. The technology implements the TurboQuant algorithm developed by Google Research, which cuts key-value cache memory usage in large language models by a factor of 4.3 while maintaining or improving performance.

VerTQ compresses KV data and accelerates attention operations, including Flash Attention and online SoftMax, directly on-chip without decompressing data. This approach reduces memory bandwidth requirements and increases inference efficiency, especially for applications where memory is limited.

The chip was built autonomously by Verkor's Conductor 2.0 AI platform using standard electronic design automation tools. The process took about 80 hours from algorithm to a verified FPGA implementation. Mapped to a Xilinx FPGA running at 125 MHz, VerTQ supports between one and thirty-two attention decoders.

VerTQ is designed for edge AI systems such as autonomous vehicles, drones, and robots, where compact design, low power use, and cost efficiency are key. The VerTQ customer package includes specifications, verification IP, testbenches, and FPGA images, and the product is available now.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Silicon Brief or Daily AI Brief.

Also, consider following us on social media:

AI Chips & Datacenters AI Brief AI Brief (X)

More from: Data Centers

07/09

Parasail Combines NVIDIA GPUs with D-Matrix Accelerators for Faster Inference

07/09

Architect and Compute Desk Launch Compute Exchange for GPU Capacity

07/07

Ceva Signs AI Licensing Deal with Major US Software Company

07/07

Nscale Secures $900 Million Credit Facility for Global AI Data Center Expansion

07/07

RiverMeadow Integrates with AWS Transform for Automated Cloud Modernization

Subscribe to Silicon Brief

Weekly coverage of AI hardware developments including chips, GPUs, cloud platforms, and data center technology.

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Verkor Introduces VerTQ, TurboQuant Accelerator for LLM Inference

We hope you enjoyed this article.

More from: Data Centers

Subscribe to Silicon Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like

Dnotitia Unveils STAR-KV Compression Method Selected as ICML 2026 Spotlight Paper

Nuvoton and Qualcomm Collaborate on SoC Solutions for XR Glasses

OpenAI and Broadcom Reveal Jalapeño Inference Chip for LLMs

Qualcomm to Acquire AI Software Firm Modular

Verkada Partners with NVIDIA to Advance Physical AI Platform

Qualcomm and Meta Sign Multi-Generation Data Center CPU Agreement

MemryX Expands Cascade Platform with New Edge AI Accelerators

DapuStor Highlights AI-Optimized SSD Portfolio at Computex 2026

X Square Robot Launches QUANXTA Zero Series for Embodied Data Production

Teradar Begins Paid Evaluation with Major German Automaker for Terahertz Vision Sensors

OPAQUE Introduces OPAQUE 3.0 with Verifiable AI Governance and Post-Quantum Security

Neurometric Launches Automated Token Engineering Platform and Raises $4 Million