“9.11 and My Sister in law Reluctantly Climbed on Top9.9, which one is bigger?” Questions as simple as this confuse large language models including OpenAI’s GPT-4o, Moonshot-created Kimi, and ByteDance’s Doubao, according to a post by local media Yicai. Chatbots from China’s Baidu and Tencent generate the correct answer despite using different methods, the former comparing fractional parts after concluding the integer parts are the same and the latter, Tencent’s Hunyuan, concluding that 9.9 is the bigger number by computing that 9.11 minus 9.9 is negative. ChatGPT and Kimi, which both gave a wrong answer to the first prompt, were correct after users clarified: “in terms of numerical value.” AI-powered chatbots are fed by internet data and trained to chat with humans in a natural way so that they can perform text-based knowledge-based tasks. [Yicai, in Chinese]
Related Articles
2025-06-27 07:14
2943 views
U.S. security officials' passwords found online, including people in Signal chat
U.S. security officials' private contact information and passwords have been found online within day
Read More
2025-06-27 06:25
130 views
Twitter is celebrating 10 years of hashtags
Yes, it's really been 10 years. Twitter's most iconic feature is celebrating a big birthday today. E
Read More
2025-06-27 05:56
2895 views
iOS 11 vs. Android Oreo: No one cares who's winning
Every year, two mobile industry titans birth new software into the world like a pair of newborn cubs
Read More