
Researchers introduce BOHM, a hierarchical attribution method that extracts component importance directly from routing weights in compound AI systems, eliminating the need for expensive coalition evaluations required by Shapley-based methods like SHAP. BOHM achieves near-parity with SHAP on benchmarks (Kendall tau 0.928 vs 0.980) while requiring 9,000x fewer evaluations and working with opaque third-party APIs and agentic orchestrators where traditional methods fail.
Researchers introduce NeuroNL2LTL, a neurosymbolic system that translates natural language requirements into formal specifications while guaranteeing logical correctness—a capability previously requiring specialized expertise. Tested on 200,000+ requirements across aerospace, robotics, and autonomous vehicles, the system achieves 86% verified satisfiability by combining neural translation with formal verification as a training signal and runtime safety filter.
Researchers have developed Research Math Agents (RMA), an agentic system that tackles research-grade mathematical problems requiring long-horizon reasoning and literature integration—a significant step beyond competition math and formal theorem proving. Tested on the First Proof benchmark, RMA solved 8 of 10 expert-contributed research problems and outperformed baselines including GPT-5.2R, with gains driven by coordinated specialized modules for problem analysis, literature search, and proof verification working through iterative feedback.
A new large-scale knowledge graph called SciAtlas integrates over 43 million academic papers across 26 disciplines, using 157 million entities and 3 billion triplets to create a structured map of scientific knowledge. The system pairs this with a neuro-symbolic retrieval algorithm designed to enable AI agents to navigate complex logical connections across disciplines while reducing reasoning costs and hallucinations compared to traditional keyword or vector-based search.
A new framework called A-LEMS redefines how energy consumption is measured for agentic AI systems, shifting from energy-per-inference to Energy per Successful Goal (EpG) to account for multi-step orchestration, retries, and failures. Testing across eight task families revealed agentic workflows consume 888.1 joules per successful goal versus 205.3 joules for linear baselines, with orchestration structure—not raw compute—driving the overhead.
A new thesis transforms classical impossibility results—from Turing and No Free Lunch theorems—into concrete design specifications for AI systems, proving that transformer architectures hit an accuracy ceiling determined by layer count and embedding width alone, independent of training data or methods. The "Deterministic Horizon," measurable before deployment and found to range from 19 to 31 layers across twelve architectures, establishes sixteen computable boundaries where violations impose quantified costs, from auction mechanism failures to 110–190x overhead in zero-knowledge neural verification.
A survey finds that 99% of CEOs expect artificial intelligence to drive significant layoffs at their organizations over the next two years. The finding underscores widespread executive expectations that AI automation will reshape employment across industries, with leadership preparing for substantial workforce adjustments.
Researchers introduce ImProver 2, a neurosymbolic framework that trains a 7-billion-parameter model to automatically optimize formal mathematical proofs in Lean 4, outperforming much larger models in the same family. The framework combines expert-iteration pipelines with formal structure scaffolding and new metrics for measuring proof quality, demonstrating that smaller models can effectively restructure research-level proofs when properly trained.
Pope Leo has released a major theological document warning of artificial intelligence's potential to fuel warfare and destabilize global peace, marking the Vatican's most comprehensive statement on AI ethics. The encyclical, titled 'Magnifica humanitas,' calls for international disarmament efforts around AI technology and frames the issue as a matter of fundamental human dignity and religious concern.
Cybersecurity researchers warn that attackers are increasingly leveraging AI to develop exploits faster, forcing defenders to adopt AI-driven vulnerability detection tools to keep pace. The shift is reshaping how companies identify and patch software weaknesses before they can be exploited.
Huawei is developing a novel scaling approach aimed at circumventing the physical constraints of Moore's Law as the semiconductor industry reaches critical density limits. Major chipmakers including Samsung, TSMC, Qualcomm, MediaTek, and Intel have already begun mass production at the 2nm node and beyond, signaling an urgent industry-wide push past traditional transistor miniaturization barriers.