
China has become the first country to approve an invasive brain-computer interface chip, which was surgically implanted in a paralyzed patient who regained the ability to write. Dong Hui, a 39-year-old man paralyzed from the neck down after a 2018 car accident, demonstrated the technology's capability by holding a pen and writing in his courtyard using only neural signals.
An OpenAI model has solved a longstanding mathematics problem that eluded human mathematicians for eight decades, demonstrating significant progress in AI's ability to tackle complex theoretical challenges. The breakthrough suggests AI systems are developing stronger reasoning and problem-solving capabilities in abstract mathematical domains.
Researchers have introduced EHRBench, an automated benchmark containing nearly 1 million question-answer items derived from real electronic health records, designed to evaluate how reliably large language models can support clinical decision-making tasks like diagnosis, treatment selection, and prognosis. The benchmark was constructed using an EHR-LLM-knowledge base pipeline that automatically converts patient encounter data into structured templates while filtering out hallucinations, and testing of 30+ LLMs reveals consistent capability gaps that highlight what work remains to make these systems clinically safe.
The Pentagon is accelerating the development and deployment of artificial intelligence systems for military combat operations, but senior military officials are raising concerns about the risks and ethical implications of autonomous weapons systems. The debate reflects broader tensions within defense leadership over how quickly AI should be integrated into warfare versus the need for safeguards and human oversight.
A new study from arXiv reframes hospital mechanism design as a code-synthesis problem, using language models and multi-agent simulation to test how providers respond strategically to payment incentives. Researchers demonstrate that common payment structures inadvertently encourage up-coding and cherry-picking of low-complexity patients, but their LLM-guided search discovered mixed-objective payment rules that eliminate up-coding while preserving financial viability.
A new research paper presents MAVEN, a lightweight symbolic reasoning framework designed to improve how AI agents generalize across different tool-calling environments through structured decomposition and adaptive tool coordination. The system boosts accuracy on a new stress-test benchmark from 48% to 71% without additional training, while remaining cost-competitive with proprietary models at roughly one-tenth the expense.
Nvidia has chosen Unitree, a Chinese robotics startup, as the hardware partner for its first publicly available humanoid robotics system. Unitree is simultaneously preparing for an IPO, marking a significant commercial validation of the startup's robotics platform.
Nvidia CEO Jensen Huang announced the company's first Arm-based processor designed for personal computers, marking the chipmaker's entry into the laptop market. The new chip will power upcoming devices from major manufacturers including Dell, Microsoft, HP, and ASUS, representing a significant competitive move against Intel and AMD's PC processor dominance.
Nvidia, the world's most valuable company, is pursuing a strategy to integrate its chips into laptops and desktops, directly challenging Intel and Apple's dominance in the PC processor market. The move positions Nvidia to capitalize on growing demand for AI agents running locally on consumer devices rather than relying solely on cloud-based services.
NVIDIA has launched Cosmos 3, an open-source omni-model designed to enable physical AI systems to understand and reason about the physical world. The model represents a significant step toward AI systems that can process multimodal inputs and predict physical outcomes, addressing a key gap in autonomous robotics and embodied AI applications.
The UK Home Office has announced a contract to deploy AI facial age estimation technology to assess disputed ages among young asylum seekers, a move that has prompted more than 100 refugee children's organizations to raise alarm. The coalition warns that the AI system could result in children being incorrectly classified as adults and placed in adult detention facilities, raising serious concerns about the accuracy and fairness of automated age determination.