DMind AI Study Finds No AI Model Ready for Web3 Safety Tasks

June 01, 2026

DMind AI, working with Zhejiang University and Nanyang Technological University, tested 31 major AI models and found none suitable for safety-critical Web3 use cases. The results will be presented at KDD 2026 in Korea.

DMind AI, in collaboration with researchers from Zhejiang University and Nanyang Technological University, announced in a press release that its paper "DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain" has been accepted at KDD 2026. The conference will take place in Jeju, Korea, from August 9 to 13, 2026.

The DMind Benchmark evaluated 31 leading AI systems, including GPT-5, Claude, Gemini, DeepSeek, and Qwen, using 3,543 expert-level questions across Web3-related domains. The study found that no model is suitable for safety-critical applications in Web3. Performance was especially weak in areas such as security vulnerability detection and token economics reasoning.

According to the results, even top-performing systems showed capability gaps that would be unacceptable in real-world Web3 auditing or governance scenarios. The research team concluded that reasoning performance remains a key limitation for current AI systems.

The dataset and evaluation toolkit used in the study are publicly available on Hugging Face at https://huggingface.co/datasets/DMindAI/DMind_Benchmark.

We hope you enjoyed this article.

Consider subscribing to one of our newsletters like Cybersecurity AI Weekly, Finance AI Weekly or Daily AI Brief.

Also, consider following us on social media:

Cybersecurity AI AI in Finance AI Brief AI Brief (X)

More from: Cybersecurity

07/16

White House Launches Gold Eagle Cybersecurity Coordination Program

07/16

Oak Raises $60 Million to Build AI Identity Operating System

07/16

CleanStart Introduces Clean Libraries for Verified Open Source Dependencies

07/16

IANS Launches Cybersecurity MCP Server for AI Integration

07/16

Sandler Partners and Liminal Form Partnership to Expand Secure Enterprise AI Access

More from: Finance

07/16

New American Funding to Adopt Vesta Loan Origination System in 2027

07/16

IntelliShift and TruckerCloud Partner to Provide Insurers with Fleet Safety Insights

07/16

Rime Raises $24 Million Series A for Speech to Speech Enterprise AI

07/15

Stout Launches Drivr Platform for Portfolio Valuation and Monitoring

07/15

Hadrius Raises $27 Million to Expand AI Compliance Infrastructure

Subscribe to Cybersecurity AI Weekly

Weekly newsletter about AI in Cybersecurity.

Whitepaper

Governing the Future: A Strategic Framework for AI Adoption in Financial Institutions

This whitepaper explores the transformative impact of artificial intelligence on the financial industry, focusing on the governance challenges and regulatory demands faced by banks. It provides a strategic framework for AI adoption, emphasizing the importance of a unified AI approach to streamline compliance and reduce operational costs. The document offers actionable insights and expert recommendations for banks with fewer than 2,000 employees to become leaders in compliant, customer-centric AI.

Categories

Companies

Resources

DMind AI Study Finds No AI Model Ready for Web3 Safety Tasks

We hope you enjoyed this article.

More from: Cybersecurity

More from: Finance

Subscribe to Cybersecurity AI Weekly

Whitepaper

Governing the Future: A Strategic Framework for AI Adoption in Financial Institutions

You May Also Like

dev.fun Launches Poker Arena Benchmark for AI Agent Reasoning

Wisner Baum Warns Law Firms of AI Hallucination Risks and Malpractice Exposure

Grow Therapy and Stanford Partner on AI Safety Standards for Mental Health

Advantech MIC-735 Aligns with NVIDIA Halos for Functional Safety in Physical AI

MGI Tech and Shanghai AI Lab Introduce Physical AI Systems for Life Sciences

OpenMatter Network Launches Verifiable Trust Layer for AI Collaboration

ModelCop Launches AI Agent Security Platform for Enterprise Machine Identities

VIDIZMO Highlights Local Control as Enterprise AI Platforms Turn to Foreign Models

Patronus AI Raises $50 Million and Introduces Digital World Models for AI Agent Training

OpenAI Begins Limited Preview of GPT-5.6 Series with Sol, Terra, and Luna Models

AI Revenue Growth Begins to Match Massive Data Center Spending

Markup AI Launches Content Guardian Agents for Marketing Teams