
Bloomberg Research Highlights Risks of RAG LLMs in Finance
Bloomberg researchers have published two new academic papers that highlight the risks associated with retrieval-augmented generation (RAG) large language models (LLMs) in financial services. The findings, announced in a press release, suggest that RAG-based LLMs, which integrate external data to enhance accuracy, may actually be less safe than their non-RAG counterparts.
In the paper titled "RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models," Bloomberg's AI team assessed the safety profiles of 11 popular LLMs using over 5,000 harmful questions. The study found that RAG settings led to increased unsafe responses, even in models that were previously considered safe.
A second paper, "Understanding and Mitigating Risks of Generative AI in Financial Services," introduces a new AI content risk taxonomy tailored to the financial sector. This taxonomy addresses specific risks such as confidential disclosure and financial services misconduct, which are not covered by general-purpose safety frameworks.
These findings underscore the importance of evaluating the safety of RAG-based systems, especially in high-stakes domains like finance, and suggest the need for additional safeguards to mitigate potential vulnerabilities.
We hope you enjoyed this article.
Consider subscribing to one of several newsletters we publish like AI Policy Brief.
Also, consider following us on social media:
More from: AI Safety
More from: Finance
KX Introduces AI Banker Agent for Global Markets
TD Bank Unveils AI Prism for Enhanced Customer Personalization
Farsight Secures $16M to Automate Financial Workflows
WorkFusion Recognized as Luminary in Generative AI for Financial Crime Compliance
BNP Paribas Launches LLM as a Service for Generative AI
Subscribe to AI Policy Brief
Weekly report on AI regulations, safety standards, government policies, and compliance requirements worldwide.
Market report
2025 Generative AI in Professional Services Report
This report by Thomson Reuters explores the integration and impact of generative AI technologies, such as ChatGPT and Microsoft Copilot, within the professional services sector. It highlights the growing adoption of GenAI tools across industries like legal, tax, accounting, and government, and discusses the challenges and opportunities these technologies present. The report also examines professionals' perceptions of GenAI and the need for strategic integration to maximize its value.
Read more