Exabase M-1 Tops LongMemEval Benchmark Using Smaller, Cheaper Model

May 26, 2026

Exabase announced that its M-1 memory engine achieved a record 96.4% accuracy on the LongMemEval benchmark, outperforming other systems while using a smaller and less expensive model.

Exabase that its memory engine M-1 achieved the highest reported score on the LongMemEval benchmark for conversational AI memory. The system scored 96.4 percent accuracy at top-50 retrieval, surpassing previous leaders such as Mem0, Honcho, HydraDB, and Supermemory.

M-1 reached this result using Gemini 3 Flash, a model four to six times cheaper and faster than Gemini 3 Pro, which powered competing systems. The benchmark evaluates conversational memory across 500 questions and over 115,000 tokens, testing recall, reasoning, and knowledge update capabilities.

The M-1 retrieval architecture was developed with Hyperplane Labs and is based on principles from episodic memory theory and reconstructive recall. The system is already in production, supporting memory and search in the Fabric AI workspace, which has more than 300,000 users. Developers can access the memory API through the Exabase platform.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Daily AI Brief.

Also, consider following us on social media:

AI Brief AI Brief (X)

Subscribe to Daily AI Brief

Daily report covering major AI developments and industry news, with both top stories and complete market updates

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

ModelOp

The 2025 AI Governance Benchmark Report by ModelOp provides insights from 100 senior AI and data leaders across various industries, highlighting the challenges enterprises face in scaling AI initiatives. The report emphasizes the importance of AI governance and automation in overcoming fragmented systems and inconsistent practices, showcasing how early adoption correlates with faster deployment and stronger ROI.

Categories

Companies

Resources

Exabase M-1 Tops LongMemEval Benchmark Using Smaller, Cheaper Model

We hope you enjoyed this article.

Subscribe to Daily AI Brief

Market report

AI’s Time-to-Market Quagmire: Why Enterprises Struggle to Scale AI Innovation

You May Also Like