专注于分布式系统架构AI辅助开发工具(Claude
Code中文周刊)

Claude Wins Hallucination Test: Outperforms GPT and Gemini

智谱 GLM,支持多语言、多任务推理。从写作到代码生成,从搜索到知识问答,AI 生产力的中国解法。

On the Linux.do forum, a user conducted a web search capability test on mainstream AI models Claude, GPT, and Gemini, evaluating hallucination rates for questions with scarce information sources. The results showed that Claude Sonnet 4.5 performed best with a 0% hallucination rate, obtaining correct information in just three search rounds; GPT 5.2 had a 70% hallucination rate with low search efficiency; Gemini 3 Pro had a hallucination rate exceeding 90% with poor search results. The author emphasized that Claude is far ahead in tool usage capabilities, such as project management and file operations, and has switched from GPT to Claude as their primary tool. The article calls on AI companies to strengthen tool integration, enhance productivity, and break through model bottlenecks. This test provides practical reference for AI users, revealing performance differences and future development directions among models.

Original Link:Linux.do

赞(0)
未经允许不得转载:Toy Tech Blog » Claude Wins Hallucination Test: Outperforms GPT and Gemini
免费、开放、可编程的智能路由方案,让你的服务随时随地在线。

评论 抢沙发

十年稳如初 — LocVPS,用时间证明实力

10+ 年老牌云主机服务商,全球机房覆盖,性能稳定、价格厚道。

老品牌,更懂稳定的价值你的第一台云服务器,从 LocVPS 开始