AI & RoboticsNews

When AI reasoning goes wrong: Microsoft Research shows more tokens can mean more problems

In a Nutshell Microsoft Research finds that inference-time scaling methods for large language models don’t universally improve performance. Varying benefits, token inefficiency, and cost unpredictability challenge assumptions. Verification mechanisms enhance model accuracy. Brute-force scaling has limits; conventional models can match reasoning models on simpler tasks but struggle with…
Read more
Cleantech & EV'sNews

Add Geely EX5 to the list of Chinese EVs earning a 5 star safety rating [video]

Volvo parent company Geely is the latest Chinese EV to be put through the ringer by the European NCAP and Australian ANCAP, and its EX5 electric crossover aced both tests scoring an all-important five-star safety rating. A common refrain around most American water coolers is that Chinese products aren’t as good as those made elsewhere – despite the fact that most of the guys saying that stuff…
Read more
CryptoNews

China Jails 9 in $6M Crypto Scam Hitting Indians

China cracks down hard on cross-border crypto fraud, jailing fraudsters in a major blow to digital crime targeting Indians. China Drops Heavy Prison Terms Over Cross-Border Crypto Fraud on Indians A wide-reaching crypto scam targeting Indian nationals has led to harsh prison terms for a group of Chinese fraudsters, following months of investigation and judicial proceedings. The case, which…
Read more
AI & RoboticsNews

Beyond ARC-AGI: GAIA and the search for a real intelligence benchmark

In a Nutshell Intelligence measurement in AI is evolving beyond traditional benchmarks like MMLU, with new tests like ARC-AGI and Humanity’s Last Exam focusing on real-world reasoning. The GAIA benchmark assesses practical AI capabilities across web browsing, code execution, and complex reasoning, setting a new standard for evaluating AI performance. Intelligence is pervasive, yet its…
Read more