Researchers introduce BODHI, a domain knowledge prompting technique that boosts LLM accuracy in generating formal OS kernel specifications from 55% to as high as 96.73% on the OSV-Bench benchmark. The method augments standard prompts with a structured C-to-Python translation guide covering 15 domain-specific patterns, improving performance across nine models from six major AI providers with gains ranging from 11% to 32%.
Why it matters: As formal verification becomes critical for secure OS development, this breakthrough demonstrates that domain-specific prompt engineering can make LLMs practical for specialized technical tasks that previously required expert manual effort.