02-03-Daily AI News Daily

Today’s Summary

Claude Sonnet 5 allegedly launching tomorrow, Vertex AI logs leak model ID, SWE-bench hits 80.9% crushing the competition.
GUI Agent becomes the new darling of AI circles, Li Xiang calls out six products all solving the same problem: making AI actually work for you.
Keep your eyes on Anthropic tomorrow—the wait-and-see crowd might win again.

⚡ Quick Navigation

📰 Today’s AI News - Latest updates at a glance

💡 Tip: Want to experience the latest AI models mentioned in this article (Claude 4.5, GPT, Gemini 3 Pro) right away? No account? Grab one at Aivora —one minute to get started, hassle-free support.

Today’s AI News

👀 One-Liner

Claude Sonnet 5 allegedly launching tomorrow, Vertex AI logs already leaked the model ID.

🔑 3 Key Hashtags

#Claude5Leak #OpenAIDoubleQuota #GUIAgentRising

🔥 Top 10 Highlights

1. Claude Sonnet 5 Leak: Codename “Fennec,” Possibly Launching Tomorrow

The wait-and-see crowd wins again? Error logs from Vertex AI revealed claude-sonnet-5@20260203—yep, that’s tomorrow’s date. Word is this new model hit 80.9% on SWE-bench, absolutely crushing every existing coding model. Get this: it’s supposedly half the price of Opus 4.5 but outperforms it across the board. The 1M token context window stays, and it’s even faster. If this pans out, Anthropic just pulled off a “dimensional strike.” We’ll know tomorrow.

2. OpenAI Codex App Officially Launches, Paid Users Get Double Quota for Two Months

Sam Altman himself is hyping it: Codex App for Mac is live. He says he’s “surprised how much I like it”—when a CEO says that, either it’s genuinely great or it’s hitting KPIs. The real deal: all paid users get their rate limits doubled for two months to celebrate. Free users and Go plan subscribers can use it too. Developers are thrilled. If you want to jump on the latest AI coding tools, now’s the time.

3. Li Xiang Breaks Down 2025-2026’s Most Breakthrough AI Products: GUI Agent Takes Center Stage

Li Xiang posted a note calling out six products: Claude Code, Douyin Phone, Manus, OpenClaw, MoltBook, Chrome Gemini. They seem unrelated, but they’re all tackling the same problem—how to make AI actually do work for you. The answer: GUI Agent. AI watches your screen, simulates clicks, bypassing the API wall that most apps throw up. Users want Jarvis, but Jarvis keeps hitting locked doors. GUI Agent is the master key.

4. DeepThink Open-Sourced: Multi-Expert Collaboration System Mimicking Gemini 3 Pro DeepThink

Gemini 3 Pro DeepThink’s too pricey and runs as a black box. So someone built an open-source version. Dynamic multi-expert collaboration—a Planner automatically assigns 2-7 experts based on problem complexity and runs them in parallel. Integrates Exa and Tavily search, injecting real-time data. WebSocket streams each expert’s monologue, drafts, and reasoning chains so you can watch the deep thinking unfold. OpenAI API compatible, frontend-backend separated, easy to maintain and scale. Poor man’s DeepThink, genuinely solid.

5. Stepfun Step 3.5 Flash Open-Sourced: 196B Parameters, Free on OpenRouter

Stepfun open-sourced Step 3.5 Flash—196B parameters, but only 11B activate per token. The standout is inference speed: peak 350 tok/s, seriously fast. It’s free on OpenRouter during the promo phase. Redditors on r/LocalLLaMA say for 128GB setups running local models, this is the new king. Want something fast and powerful for local inference? Worth a shot.

6. Yoroll: AI-Powered Interactive Narrative Game Creation Platform, One Person Handles Everything

After Genie 3 dropped last week, the AI interactive game space exploded. Yoroll’s the new player riding this wave: create branching narrative games with zero code and zero filming. The platform keeps character faces and voices consistent, each choice triggers real-time AI-generated video responses, plus QTE-style game interactions. Interactive narrative games sold solid on Steam China last year—now if you’ve got a good script, one person can build the whole game.

7. Microsoft CTO’s Internal Email Exposed: Why OpenAI’s Board Fired Sam Altman Back Then

An internal email from November 2023 surfaced, with Microsoft CTO Kevin Scott detailing the real reasons OpenAI’s board fired Sam Altman. Core conflict: Ilya Sutskever vs. Sam over resource allocation—research team and product team fighting over GPUs, Ilya saw it as zero-sum. More personal: after Jakub got promoted, he achieved breakthroughs Ilya couldn’t pull off for years. This email reads like palace intrigue but reveals real tensions inside AI companies.

8. Claude Code Usage Tips: Anthropic’s Internal Summary, Parallel Work Is the #1 Secret

Claude Code creator Boris shared Anthropic’s internal usage tips again. The #1 secret: parallel work—spin up 3-5 git worktrees simultaneously, each running an independent Claude session. For complex tasks, plan first, maintain CLAUDE.md as a long-term investment, turn repetitive ops into skills. Here’s a pro move: let Claude auto-fix bugs, enable Slack MCP, paste the bug discussion thread to it, just say “fix it.” No context switching needed.

9. Jike Launches Super Coconut: AI Voice Input Method Supporting Prompt Mode

Jike dropped an AI voice input method called Super Coconut, similar to Typeless and Lightning Say. Supports both transcription and Prompt modes, with local and cloud model options. Works outside input fields too, like Raycast AI Shortcut voice edition. Select an article, voice-say “summarize this,” handles any text, images, screenshots with Prompt processing. Screenshot features are solid—scroll translation, arrow annotations, pixelation, watermarks, super practical. Mac only for now.

10. Karpathy’s nanochat: The Best ChatGPT You Can Buy for $100

Karpathy strikes again. The nanochat project claims to be “the best ChatGPT you can buy for $100,” already sitting at 40K+ Stars on GitHub. It’s a minimal ChatGPT implementation—small codebase but feature-complete. If you want to learn ChatGPT’s core principles or build your own lightweight chat system, this is the best teaching material. Karpathy’s work, guaranteed quality.

📌 Worth Watching

[Product] Yuanbao’s UI is nice but context engineering needs work - Tencent’s Yuanbao looks great, but AI comprehension needs improvement
[Open Source] claude-mem: Persistent memory for Claude Code - Auto-captures coding sessions, injects into future context, 18K Stars
[Open Source] PageIndex: Vector-free RAG solution - Replaces vector retrieval with tree structures, 12K Stars
[Tool] OpenClaw spending breakdown: $30 in 3 days, 1 billion tokens - User seeking cheaper API alternatives
[Business] Alibaba drops 3 billion yuan in red packets promoting Tongyi Qianwen - Aiming to reshape the AI super-app landscape during Chinese New Year
[Research] Southeast University releases world’s first concrete science large model - AI meets engineering materials crossover attempt

😄 AI Fun

Perplexity Goes Haywire: Asking About Xiaomi Speaker Triggers Crazy Query Mode

Today’s wildest AI news: someone asked Perplexity about Xiaomi speakers, and when they hit “Does OH2P support higher quality Bluetooth audio?”, the AI suddenly entered full-blown crazy query mode, eventually just… losing it. The screenshot shows the AI in what looks like an infinite loop. Comments: “AI: I don’t care, I’m searching the entire internet for Bluetooth audio info!” 😂

🔮 AI Trend Predictions

Claude Sonnet 5 Official Launch

Predicted Time: February 3-7, 2026
Confidence: 75%
Reasoning: Today’s Claude Sonnet 5 leak + Vertex AI logs showing claude-sonnet-5@20260203, launch window points to tomorrow

GUI Agent Becomes Mainstream Interaction Method

Predicted Time: Q2 2026
Confidence: 70%
Reasoning: Today’s Li Xiang breakdown + Douyin Phone, OpenClaw, Chrome Gemini all betting on this direction

OpenAI vs. Anthropic Coding Model War Escalates

Predicted Time: February-March 2026
Confidence: 80%
Reasoning: Today’s Codex App launch + Claude Sonnet 5 leak, competition in coding will intensify

Local LLM Wave Surges Again

Predicted Time: Q1 2026
Confidence: 65%
Reasoning: Today’s Step 3.5 Flash open-source + more high-performance models runnable on 128GB devices

❓ Related Questions

How to Experience Claude Sonnet 5?

Claude Sonnet 5 hasn’t officially launched yet, but based on leaked info, it might go live on February 3, 2026. You’ll need a paid Anthropic account. Domestic users might face payment or registration restrictions.

Solution: Visit Aivora to get ready-made accounts—instant delivery, worry-free support.

Last updated on 2026/02/03 22:28:30

02-04-Daily 02-02-Daily