AI-Pilled 日报

AI 圈每天在变,我帮你看着

§ 01

News

AI 圈每天发生了什么
2026.06.06Hugging Face
Five labs, five minds: building a multi-model finance drama on small models
2026.06.06Hacker News (AI)
Meta confirms 1000s of Instagram accounts were hacked by abusing its AI chatbot
2026.06.06Hacker News (AI)
Police in England and Wales told to halt AI use in court statements
2026.06.06Hacker News (AI)
US House lawmakers release draft bill to prohibit state AI rules
2026.06.06arXiv cs.AI
How Far Did They Go? The Persuasive Tactics of Covert LLM Agents in a Discontinued Field ExperimentThis study analyzes a publicly released dataset from a discontinued field experiment on Reddit's r/ChangeMyView. The intervention, conducted by unknown, external researchers and halted following ethical backlash, involved undisclosed AI-generated accounts engaging users in live debate. After public disclosure, Reddit…
2026.06.06arXiv cs.AI
What Should Agents Say? Action-state Communication for Efficient Multi-Agent SystemsMulti-agent systems (MAS) built on large language models are typically organized around roles, pipelines, and turn schedules, while the content that agents pass to one another is often left as unconstrained natural language. However, this free-form communication can rapidly inflate token usage, consume the shared cont…
2026.06.06arXiv cs.AI
I Know What You Meme, Even If it Emerged Today: Understanding Evolving Memes through Open-World Knowledge AcquisitionMultimodal memes are dynamic and often require up to date background knowledge for interpretation. Existing methods often overlook such knowledge or rely on fixed parametric knowledge of pretrained models that may be incomplete, outdated, or unavailable for emerging memes. We introduce Query Retrieve Conclude, a zero…
2026.06.06arXiv cs.AI
GITCO: Gated Inference-Time Context Optimization in TSFMsPatch-based Time Series Foundation Models (TSFMs) suffer from context poisoning: structurally anomalous patches capture disproportionate attention and silently degrade zero-shot forecast quality. We propose improving TSFM accuracy at inference time by optimizing the input context rather than modifying model weights. W…
2026.06.06arXiv cs.AI
Uncertainty Aware Functional Behavior Prediction and Material Fatigue Assessment for Circular FactoryReturned products in circular factories re-enter production with heterogeneous degradation states, usage histories, and remaining capability. Reuse cannot be decided from the current inspection alone, because future function fulfillment and component integrity may evolve differently under the next service scenario. Ex…
2026.06.06arXiv cs.AI
SentinelBench: A Benchmark for Long-Running Monitoring AgentsAI agents are increasingly asked to carry out work that spans minutes, hours, or longer. Yet the default model of agent behavior is continuous action: issuing tool calls, refreshing pages, searching for alternatives, or otherwise trying to force progress. This is the wrong approach for many long-running tasks, which a…
看全部 News
§ 03

Perspectives

他者之声
Software 1.0 是手写代码;Software 2.0 是神经网络的权重;Software 3.0 是 prompt。
Andrej KarpathySoftware 3.0, May 2026
我的看法这个分类很顺,但它低估了转换成本。把 SOP 翻译成 prompt 比把伪代码翻译成代码难得多——它要捕捉的是隐性知识,不是显性逻辑。
这个时代最持久的能力,是学习任何你想学的东西。
Naval RavikantAlmanack, 2020
我的看法十年前对,今天更对。当工具的 half-life 缩短到 18 个月,可迁移的"元能力"才是真正的复利。
§ 04

Plans

正在进行

Carol's Website

用 Claude Code 把这个站做出来,两天 MVP。一次从 idea 到 product 的完整练习,你正在看的就是它。

Shipped · 2026

The Big Inventory of AI Products

2025 年底做的一份 AI 产品分类图谱,被几个朋友长期收藏。

Shipped · 2025

AI 产品周记

每周一份精选短评,记录我在 AI 产品圈看到的、想到的、被反复触发的判断。

In Progress

数字分身 Agent v2

从全文 prompt 升级到 RAG,加上对话历史。让分身真的能"代我回答"。

Exploring
§ 05

About

关于这份日报

清华本科 · 大厂 AI PM · 亿级用户产品增长 · AI Pilled

我是 Carol,白天做 AI 产品,晚上把看见的、想到的、喜欢的人和观点写下来。这个站不是博客,更像一份给自己也开放给别人的 AI-Pilled 日报

AI 让信息变多、让判断变少。所以这份日报每天替你过一遍,然后把我看完之后真正想说的那部分留下来。每一篇都希望值得被读两遍.

如果你也在思考 AI 与产品、与表达、与生活方式的关系,欢迎留个联系方式,或者直接去跟我的数字分身聊聊。