§ 01

News

What happened in AI today

Sources: OpenAI / Anthropic / DeepMind / Moonshot / arXiv and other public RSS feeds. Updated twice daily at 06:00 / 18:00.

May 29, 2026Kimi
1.46.0What's Changed docs: announce evolution to Kimi Code successor project by @RealKai42 in #2377 docs: fix router auto language redirect by @RealKai42 in #2378 feat(shell): update welcome tip link to kimi.com/code and support styled Text by @jackfish212 in #2390 fix(acp): replay session history on load by @bugkeep in #21…
May 29, 2026arXiv cs.AI
VFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element AnalysisFinite Element Analysis (FEA) serves as the cornerstone of modern engineering design. However, its workflow is inherently complex and relies heavily on domain expertise. Although recent efforts have integrated Large Language Models (LLMs) into FEA, existing approaches face limitations in handling multimodal inputs and…
May 29, 2026arXiv cs.AI
Frontier LLM-based agents can overcome the ontology curation bottleneck for natural phenotypesLinking free-text phenotype descriptions to ontology terms, typically referred to as phenotype annotation, is essential for the cross-study integration of comparative morphological data. This labor intensive process has heavily relied on highly trained human experts, which makes it challenging to scale and thus a key…
May 29, 2026arXiv cs.AI
Orthogonal Concept Erasure for Diffusion ModelsConcept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations. While training-based methods are effective, their high computational cost limits scalability. Editing-based methods are more efficient and deployment…
May 29, 2026arXiv cs.AI
Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy PredictionGradient temporal-difference methods provide stable off-policy prediction with linear function approximation, but their practical performance is strongly affected by the geometry induced by the auxiliary-variable metric. Existing Mirror-Prox TD methods typically use the feature covariance metric, whereas hybrid TD met…
May 29, 2026arXiv cs.AI
Review Arcade: On the Human Alignment and Gameability of LLM ReviewsLLM-generated reviews for scientific papers are gaining considerable traction and are even being officially piloted by major conferences. We have to assume that not only reviewers are using LLM-assistance, but also that authors use LLMs to revise their papers before submitting. In this work, we perform empirical exper…
May 29, 2026arXiv cs.AI
Ultra-Reduced-Impact-Encased-Logging (URIEL): propose a new method for selective sustainable logging and post-harvest silvicultural treatment in tropical forest using airborne robotics systemsTropical forests worldwide are under intense deforestation pressure driven by economic and political interests, and scientific evidence suggests this deforestation contributes to climate change. This paper proposes a novel logging method for tropical forests, Ultra-Reduced-Impact-Encased-Logging (URIEL). This new meth…
May 29, 2026arXiv cs.AI
Behavior-Aware Auxiliary Corrections for Off-Policy Temporal-Difference PredictionTemporal-difference learning with function approximation can be unstable under off-policy sampling. TDC stabilizes off-policy TD through an auxiliary covariance correction, and TDRC further regularizes this correction in a single-timescale recursion. This paper studies a behavior-aware replacement of the auxiliary cov…
May 29, 2026arXiv cs.AI
The Cognitive Categorical Transformer: Category-Theoretic Inductive Biases for Language ModelingThe Cognitive Categorical Transformer (CCT) is a 306M-parameter architecture that augments a pretrained GPT-2 Small backbone with cognitively grounded components derived from category theory and several inspirations from cognitive science. Under a matched-step protocol (215,000 optimizer steps, matched data, matched o…
May 28, 2026Hacker News (AI)
Sam Altman and Dario Amodei are both walking back AI jobs apocalypse predictions
May 28, 2026Hacker News (AI)
Various LLM Smells
May 28, 2026Hacker News (AI)
Dynamic Workflows in Claude Code
May 28, 2026Hacker News (AI)
Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue
May 28, 2026OpenAI
How Endava builds an agentic organization with CodexLearn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.
May 28, 2026Hacker News (AI)
AI sticker shock hits corporate America
May 28, 2026Hacker News (AI)
A Eureka machine that thinks like nature and explores what AI cannot
May 28, 2026arXiv cs.AI
Discovery Agents for Real-Time Analytics: Toward Proactive Insight SystemsModern analytics systems are fundamentally reactive, requiring users to define queries over increasingly complex and continuously evolving data. In real-time streaming environments, this paradigm breaks down, as the space of potential insights becomes too large to enumer…
May 28, 2026arXiv cs.AI
Identifying and Understanding Human Values in Text: A Tailorable LLM-based ArchitectureAs intelligent systems become more autonomous, the scientific community focuses on creating decision-making mechanisms that include ethical and moral considerations, unlike traditional utility-maximisation models. To achieve this, a key aspect is assessing how well these…
May 28, 2026arXiv cs.AI
Soro: A Lightweight Foundation Model and Chatbot for TajikWe present Soro, a family of Tajik-specialized conversational large language models (LLMs) designed for real-world deployment under tight compute and connectivity constraints in Tajikistan. Starting from open-weight Gemma 3 checkpoints, we perform Tajik-only continual pr…
May 28, 2026arXiv cs.AI
On the Origin of Synthetic Information by Means of Steganographic InheritanceThe origin of species has been the mystery of mysteries in natural science. By analogy, the origin of synthetic information, we suggest, is the mystery of mysteries in information science. The question carries a moral weight that a technical account can neither fully res…
May 28, 2026arXiv cs.AI
DynaSchedBench: Calibrated Dynamic Scheduling Benchmarks and Observability Paradox in LLM-based Scheduling AgentsProgress in neural combinatorial optimization for Dynamic Flexible Job Shop Scheduling Problem (DFJSP) is currently hindered by a methodological tension: static benchmarks encourage benchmark overfitting, while uncalibrated generators obscure algorithmic capability with…
May 28, 2026arXiv cs.AI
Why LLMs Fail at Causal Discovery and How Interventional Agents EscapeCausal discovery is a cornerstone of scientific reasoning, yet whether large language models can perform it reliably remains an open question. Recent benchmarks show that even fine-tuned models plateau on simple causal graphs and degrade as complexity grows, but why they…
May 28, 2026arXiv cs.AI
RULER: Representation-Level Verification of Machine UnlearningMachine unlearning aims to remove the influence of specific training records from a deployed model without retraining from scratch. Current protocols verify this at the output level through membership inference, retain accuracy, and forget-set accuracy, but a model can s…
May 28, 2026arXiv cs.AI
LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and GenerationParallel LLM test-time scaling techniques (e.g., best-of-$N$) require drawing $N>1$ sequences conditioned on the same input prompt. These methods boost accuracy while exploiting the computational efficiency of batching $N$ generations. However, each sequence in the batch…
May 28, 2026OpenAI
OpenAI’s Frontier Governance FrameworkExplore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.
May 28, 2026OpenAI
MUFG aims to become AI-native with OpenAIMUFG uses ChatGPT Enterprise to build an AI-native organization, improve workflows, and deliver new AI-powered financial services at scale.
May 28, 2026Anthropic
Introducing Claude Opus 4.8Introducing Claude Opus 4.8
May 28, 2026Anthropic
Anthropic raises $65B in Series H funding at $965B post-money valuationAnthropic raises $65B in Series H funding at $965B post-money valuation
May 27, 2026Hacker News (AI)
YouTube to automatically label AI-generated videos
May 27, 2026Hugging Face
ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM
May 27, 2026Hacker News (AI)
DuckDuckGo search saw 28% more visits after Google said people love AI mode
May 27, 2026Hacker News (AI)
Training our own AI models
May 27, 2026Hacker News (AI)
Tech CEOs are apparently suffering from AI psychosis
May 27, 2026OpenAI
Cisco and OpenAI redefine enterprise engineering with CodexCisco and OpenAI are redefining enterprise engineering with Codex, helping Cisco scale AI-native development, accelerate AI Defense work, and automate defect remediation.
May 27, 2026Hacker News (AI)
I'm Tired of Talking to AI
May 27, 2026OpenAI
Building self-improving tax agents with CodexSee how OpenAI, Thrive, and Crete built a self-improving tax agent with Codex, automating filings, improving accuracy, and accelerating workflows.
May 27, 2026Hacker News (AI)
Claude Code as a Daily Driver: Claude.md, Skills, Subagents, Plugins, and MCPs
May 27, 2026Kimi
1.45.0What's Changed fix(shell): Fix misleading "Quota exceeded" prefix shown on every 403 error by @liruifengv in #2342 feat(toolset): improve dedup with sparse reminders and canonical args by @jackfish212 in #2372 chore(release): bump kimi-cli to 1.45.0 by @jackfish212 in #2373 Full Changelog: 1.44.0...1.45.0
May 27, 2026Hugging Face
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL
May 27, 2026Hugging Face
Reachy Mini goes fully local
May 27, 2026OpenAI
Election information and safeguards in 2026Ahead of global elections, we’re helping people access information, supporting cyber defenders, and increasing AI transparency
May 27, 2026Anthropic
Anthropic opens Milan office to support Italian enterprise, research, and developersAnthropic opens Milan office to support Italian enterprise, research, and developers
May 27, 2026OpenAI
Warp’s big bet on building open source with GPT-5.5Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.
May 26, 2026Anthropic
Anthropic appoints KiYoung Choi as Representative Director of Korea ahead of Seoul office openingAnthropic appoints KiYoung Choi as Representative Director of Korea ahead of Seoul office opening
May 25, 2026OpenAI
OpenAI, Grupo Folha and Grupo UOL announce strategic content partnershipOpenAI partners with Grupo Folha and Grupo UOL to bring trusted Brazilian journalism to ChatGPT, expanding access to news with attribution and transparency.
May 25, 2026Anthropic
Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"Anthropic co-founder Chris Olah's remarks on Pope Leo XIV's encyclical "Magnifica humanitas"
May 25, 2026Hugging Face
Harness, Scaffold, and the AI Agent Terms Worth Getting Right
May 23, 2026Hugging Face
Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models
May 22, 2026Hugging Face
Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook
May 22, 2026OpenAI
OpenAI named a Leader in enterprise coding agents by GartnerOpenAI is named a leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, with Codex recognized for innovation and enterprise-scale deployment.