Why Anthropic Wins with 5% of the Users
On October 16, 2025, while the tech world was still digesting OpenAIâs Instant Checkout within ChatGPT, Anthropic quietly released something called âSkills.â No flashy demos. No performance charts comparing against GPT-5. Just a simple announcement about folders containing Markdown files and optional scripts. Within hours, Simon Willisonâone of the most productive voices in AI developmentâcalled it âmaybe a bigger deal than MCP.â Hereâs what makes this strange: Claude has only 5% of ChatGPTâs user base. But it generates 40% of OpenAIâs revenue. ...
Writing is Thinking: Human Metacognition in the Age of AI
The Nature Editorial That Caught My Eye A recent Nature Reviews Bioengineering editorial titled âWriting is Thinkingâ made a compelling case for preserving human-written scientific papers in the age of large language models. The editorial argues that writing isnât merely about reporting resultsâitâs a cognitive tool that helps us uncover new thoughts and ideas. As the editorial puts it: âWriting compels us to thinkânot in the chaotic, non-linear way our minds typically wander, but in a structured, intentional manner.â This resonated deeply with me, especially as someone who started this blog as a way to maintain âconscious effortâ in my thinking. ...
I Asked AI to Help Me Read My Great-Grandfather's Homework. It Failed Spectacularly.
When Dad Becomes Your Research Assistant My dad texted me a photo yesterday from his antique shop. âFound this old Cantonese childrenâs book from the 1920s,â he said. âThought you might find it interesting.â I opened the image and immediately felt that familiar mix of nostalgia and confusion. I could mostly read itâtraditional Chinese characters arranged in neat columns, reading right to left, top to bottom. But there were nuances I wasnât catching, cultural references that felt just out of reach. ...
Context Engineering Lessons from Building Manus
I just read an excellent deep dive from the Manus team about context engineering for AI agents, and itâs packed with practical insights that anyone building AI systems should know about. The Core Insight The Manus post1 opens with a striking observation: âMost agent failures are not model failures anymore, they are context failures.â Weâve moved past the era where model limitations were the primary bottleneck. Todayâs models are remarkably capableâthe challenge is giving them the right information at the right time. ...
Blanked Out During Claude's Demo. How Does It Just Work So Well?
I had that sinking feeling during Anthropicâs product demo last week. You know the oneâwhen youâre supposed to be the expert in the room, the person who knows Claude inside and out, but suddenly everything feels foreign. The Anthropic team was walking through their enterprise offerings: Projects, Knowledge, the new Financial Services Analyst. Each feature seemed isolated, almost mundane when presented individually. Projects were just âworkspaces to house user uploaded documents.â Knowledge was simply âparsing PDFs into markdown.â The Financial Services Analyst was just âMCP with enterprise support.â ...
The Perfect Storm: How China Created the AI Talent Pipeline Dominating Silicon Valley
Mark Zuckerberg is paying $100 million signing bonuses for AI researchers. Not $100,000â$100 million.1 And when you look at who Meta is hiring, a pattern emerges that reveals one of the most fascinating talent pipelines in modern technology. Seven of Metaâs eleven publicly listed Superintelligence Labs recruits graduated from prestigious Chinese universities before pursuing advanced degrees in the US.2 Shengjia Zhao, Jiahui Yu, Shuchao Bi, Hongyu Renâthese arenât just individual success stories. They represent a systematic phenomenon thatâs reshaping the global AI landscape. ...
Things I Found Fascinating in Calvin's OpenAI Reflections
I stumbled across Calvin French-Owenâs reflections on his time at OpenAI and found myself completely absorbedânot by the high-level insights about AI safety or AGI timelines, but by the operational details buried throughout. The kind of specifics that make you go âwait, they did what in 7 weeks?â Thereâs something deeply fascinating about getting a peek behind the curtain of a company operating at the technological frontier. Calvinâs post is full of these moments where you realize just how different building products looks when you have the resources, talent, and mandate to try audacious things from scratch. ...
After Kimi K2's Release: No Longer Just a ChatBot
Translation Note: This is a translation of the original Chinese blog post âĺĺ¨ Kimi K2 ĺĺ¸äšĺďźĺäšä¸äť äť ćŻ ChatBotâ by Justin Wong (bigeagle), published on July 13, 2025. All opinions expressed are those of the original author. Introduction Two days ago, the Kimi K2 weâd been working on for most of the year was finally released. After pulling an all-nighter before launch, I slept soundly for two days and finally have some time to write about my thoughts. ...
The AI Honeymoon Period: What Context Engineering Taught Me About Hype vs Reality
When Optimists Meet Realists Two posts crossed my timeline recently that perfectly captured the current moment in AI adoption. The first, from Yangyi, radiated the kind of infectious optimism you see from someone who just completed several projects with Claude Code: âAfter using Claude Code to complete several projects today, I have a strong feeling that the changes we envisioned in 2023 have now crossed a leveraged qualitative change moment⌠We can quickly use Claude Codeâs Hook to complete long-process task planning and implementation. This means that 24-hour AI engineering teams will appear soon, and the entire process will accelerate exponentially.â ...
Translation - Kimi K2: A Deep Evaluation Beyond the Chat Interface
Note: This is a translation of the original Chinese article by grapeot, published on July 12, 2025. The original article can be found at: https://yage.ai/kimi-k2.html Recently, whenever a new large language model is released, we can always see a wave of evaluation reports. But thereâs an interesting phenomenon: the vast majority of evaluations are invariably confined to an environment weâre most familiar withâthe chat interface. This is actually a fundamental limitation. In 2025, when Agentic AI is all the rage, evaluating a model designed for Agentic capabilities within a chat interface is like using whiteboard coding to interview a development director. The feedback you get has almost no relation to its core capabilities. ...