03-07-Daily AI News Daily
Daily Summary
GPT-5.4's million-token context window sounds impressive, but it can't even figure out whether to walk or drive 50 meters to a car wash—common sense remains the weak spot.
Xiaomi jumped on the bandwagon with a mobile Agent, OpenClaw burns through $25 a day, the era of AI doing your work has arrived but your wallet's already bankrupt.
Models keep getting smarter and pricier, wait-and-see folks can hold out one more round without losing out.⚡ Quick Navigation
- 📰 Today’s AI News - Latest updates at a glance
💡 Tip: Want to experience the latest AI models mentioned in this article (Claude 4.5, GPT, Gemini 3 Pro) right now? No account? Grab one at Aivora —instant activation, hassle-free support.
Today’s AI News
👀 One-Liner
GPT-5.4 rolled out with a million-token context, but it still can’t decide whether you should drive to the car wash.
🔑 3 Keywords
#GPT5.4Flop #LobsterBurnsMoneyFast #PixelOffice
🔥 Top 10 Headlines
1. OpenAI Launches GPT-5.4 Series: Million-Token Context Window, Pro and Thinking Versions Debut Simultaneously
Thought a 200k-token context was enough? OpenAI just cranked GPT-5.4’s API up to 1 million tokens. This time they’re rolling out three flavors: standard, reasoning (GPT-5.4 Thinking), and high-performance (GPT-5.4 Pro). Benchmarks in finance and law are seriously impressive, and token efficiency got a solid bump. But controversy came with it—pricing that’ll make your wallet cry, and safety scores actually dropped. My take: capabilities are climbing, but OpenAI’s “premium strategy” is pushing some developers toward competitors.

2. GPT-5.4 Thinking Still Fails the Car Wash Test
GPT-5.4 Thinking, supposedly with massive reasoning upgrades, got stumped by a question a grade-schooler could nail: “The car wash is 50 meters away—should I walk or drive?” The answer’s obvious—just walk. But GPT-5.4 Thinking couldn’t figure it out. What’s the lesson? Models are genuinely stronger at math and logic, but common-sense reasoning is still the Achilles heel. The “smarter” the model, the more likely it crashes on the simplest questions. Pretty ironic.

3. Microsoft Bing Fully Integrates Sora 2, Free Video Generation Now Open to Everyone
Used to need an OpenAI Pro account, a waitlist spot, and cash to use Sora. Now Microsoft just dropped Sora 2 straight into Bing Video Creator—free for all. Photo-quality output, built-in sound effects, cross-scene narrative continuity. Even better: C2PA watermarking so every video’s traceable. Free credits, then swap points for more—basically unlimited. CapCut, watch your back.

4. Xiaomi Launches First Mobile Agent Product Xiaomi miclaw, Closed Beta Kicks Off
After OpenClaw blew up, phone makers finally got antsy. Xiaomi dropped Xiaomi miclaw, a mobile Agent built on their homegrown MiMo model—basically “lobster for your phone”—letting AI directly control your device to handle complex tasks. Picture this: tell your phone “book me a conference room tomorrow at 3 PM” and AI opens the calendar, fills it in, sends invites. Currently closed beta, but the direction’s crystal clear—AI’s moving from “chatbox” to “doing it for you.”

5. Someone Built a Pixel Office for OpenClaw Lobster, GitHub Project Went Viral
Ever wonder what your AI Agent’s actually doing while it’s grinding away in the background? This open-source project turned it into a pixel-art character working in a virtual office. AI thinking? It’s frantically typing at the desk. Idle? Grabbing coffee. Code error? Facing the wall in shame. Hit 1.5k stars in days, supports multi-Agent hangouts and mobile monitoring. Real talk—this is how devs should slack off: watching AI work for you.
6. OpenAI Releases Codex Security Agent, Research Preview Now Live
Coders know the pain: spend an hour finding the bug, five minutes fixing it—the worst part’s pinpointing where it is. OpenAI’s new Codex Security is an AI Agent built specifically for app security, analyzing your entire project context to auto-detect, verify, and patch complex vulnerabilities. Key phrase: “higher confidence, fewer false positives”—no more traditional security scanners flooding you with red warnings that turn out to be nothing. Still in research preview, but the need is real.
7. Alibaba’s AI Strategy in Crisis: What’s Behind Lin Junyang’s Departure
Lin Junyang leaving Alibaba’s Tongyi team set the industry buzzing. Surface-level read: talent drain. Deeper issue? One sharp observer nailed it: Alibaba missed every critical window in the AI ecosystem. Coding, Agents, OpenClaw—none of them got locked into Tongyi. Meanwhile, MiniMax and Moonshot rode the lobster wave and their token consumption flipped the script. Falling behind on model capability for a moment isn’t scary; scary is when your ecosystem can’t keep up. Alibaba officially denied “mass exodus,” but the strategic reckoning’s probably just starting.

8. OpenClaw Burns Through 25 Million Tokens in a Day, $25 Wallet Completely Drained
Lobster’s great, but your bank account suffers. A dev deployed OpenClaw on their own server using MiniMax’s M2.5 model, just doing simple stuff like “set up a Telegram bot, write some scheduled tasks.” Result? Half a day, $25 gone, 25 million tokens, 200+ requests. The culprit: Agents go crazy “thinking + calling tools,” triggering dozens of model calls per task. This might be the realest pain point of the Agent era—capability’s there, but you can’t afford it.
9. AI Cracks Dia Browser Cookie Encryption, Actually Succeeds
Dia browser uses a pretty complex custom encryption: v10 prefix + 16-byte nonce + AES ciphertext, then after decryption there’s another 16-byte header, and the actual cookie value starts at byte 17. Sounds hardcore, right? AI walked through it step-by-step and actually cracked it. Two takeaways: AI’s reverse-engineering chops are wild, and every security team just got a wake-up call—your encryption scheme? AI might understand it better than you do.
10. VAST Closes $50M Funding Round, Alibaba and Baidu Race to Back 3D Generation’s New King
3D content creation’s always been the “high barrier to entry” game—modeling, rendering, materials, takes months to learn. VAST’s TripoAI platform already has 6.5 million creators and has generated nearly 100 million 3D models. This $50M Series A, co-led by Alibaba and Hengxu Capital, goes toward algorithm iteration and building a UGC ecosystem. Goal’s clear: make 3D creation as easy as posting to social media. With AI video generation already a bloodbath, 3D might be the next explosion.
📌 Worth Watching
- [Product] Codepilot Nails Long-Term Memory and Assistant Features — AI coding assistant finally “remembers you,” no more explaining your project from scratch every time
- [Product] OpenClaw Hooks Up Feishu Bot, Controls Music, Writes Docs, Does It All — After connecting lobster to Feishu, you actually get that “AI secretary” vibe
- [Product] Get Notes Launches OpenClaw Skill: Say It Once, Notes Are Saved — No app switching, no copy-paste, info flows through and sticks
- [Open Source]
Skill Publisher: One-Click Publish Your Skill to GitHub
— Git-phobic friends, you’re saved—
npx skills addhandles everything - [Product] Roblox Rolls Out AI Real-Time Rewrite, Auto-Converts Rule-Breaking Content to Polite Talk — Not just “###” censoring anymore—AI rewrites it for you, false positives dropped 20x
- [Business] Ctrip Voluntarily Shut Down “AI Business Assistant,” Pushing Hotel Pricing Back to Sanity — Sometimes turning AI off is the smarter move
- [Product] yt-dlp Actually Supports Bilibili Video Downloads and Subtitle Transcription — Old tool, new discovery—pair it with NotebookLM Skill to turn videos into articles fast
- [Other] Douyin Still Crushing It, Post-CNY AI App DAU Landscape Unchanged — Everyone’s throwing ad money around, but users stick with ByteDance—lower-tier market barriers are tougher than expected
😄 AI Fun Stuff
Grok Android App Translated “Memory” as “RAM Count”
A user opened Grok’s Android app settings and found the memory feature was labeled “RAM Count”—yep, machine translation fail. Even weirder: when they checked what Grok “remembered,” it had quietly saved all their old spicy conversations, and you gotta delete them one by one. 😂 Lesson: chat fast with AI, regret later.
Used Bilibili for 10+ Years, Just Found Out It Has a Forum
A decade-long Bilibili user hunting for OpenClaw communities stumbled on Bilibili’s own forum called “Bilibili Small Station.” Comments exploded—everyone saying “didn’t know this existed.” OpenClaw’s viral moment accidentally resurfaced a product Bilibili almost forgot about. 😂
🔮 AI Trend Predictions
Phone Maker Agent Wars Go Full Throttle
- Timeline: April-May 2026
- Confidence: 80%
- Reasoning: Today’s news Xiaomi launches Xiaomi miclaw + Huawei, OPPO, Vivo all have on-device models in the works, OpenClaw proved Agent demand is real, phone makers will follow
Agent Cost Crisis Triggers New “Model Price War”
- Timeline: April 2026
- Confidence: 75%
- Reasoning: Today’s news OpenClaw burns 25M tokens daily + Agent scenarios consume 10-50x more tokens than chat, model makers must cut prices to keep devs
Alibaba Tongyi Team Undergoes Major Strategic Pivot
- Timeline: Q2 2026
- Confidence: 70%
- Reasoning: Today’s news Alibaba AI strategy ecosystem gap analysis + Lin Junyang departure getting industry attention, Alibaba likely catching up on ecosystem integration (especially Agent/OpenClaw)
AI Security Audit Tools Become Standard Issue
- Timeline: May-June 2026
- Confidence: 60%
- Reasoning: Today’s news OpenAI launches Codex Security + AI cracks Dia encryption , both offense and defense using AI, security audit tools will be enterprise must-haves
3D Generation Track Enters Capital Intensive Phase
- Timeline: Q2 2026
- Confidence: 55%
- Reasoning: Today’s news VAST raises $50M + video generation track already saturated, capital hunting next growth vector, 3D generation’s the most likely candidate
❓ Related Questions
How do I experience GPT-5.4’s latest models?
GPT-5.4 series just launched with standard, Thinking reasoning, and Pro high-performance versions. API now supports million-token context, but pricing’s steep and mainland users face payment and access hurdles.
Solution: Head to Aivora for ready-made accounts—instant delivery, worry-free support.