01-25-Daily AI News Daily

Today’s Summary

OpenAI finally can't sit still—Sam Altman teases intensive Codex updates starting next week to go head-to-head with Claude Code.
Cursor's built-in "pause" feature gets exposed—AI asks for your input after writing code; the fully automated pipeline for generating MVs from a single sentence is also up and running.
AI coding tools are battling it out, early adopters get your wallets ready, and developers should rush to try out the new features.

⚡ Quick Navigation

💡 Tip: Want to experience the latest AI models mentioned in this article (Claude 4.5, GPT, Gemini 3 Pro) right away? No account? Head over to Aivora to grab one—get started in a minute with hassle-free support.

Today’s AI News

👀 One-Liner

Sam Altman drops the mic: Codex is about to make big moves starting next week.

🔑 3 Keywords

#OpenAICounterattack #CursorHiddenFeature #OnesentenceMVGeneration


🔥 Top 10 Highlights

1. OpenAI Launches Intensive Codex Model Updates Starting Next Week

Just when Claude Code and Cursor were stealing all the spotlight, Sam Altman finally decided to make a move. He personally tweeted a heads-up: Over the next month, major Codex-related updates will roll out one after another, starting next week. The confidence in his tone is unmistakable—“We hope you will be delighted.” He also revealed that OpenAI is about to reach the “advanced” level of the cybersecurity maturity framework. Looks like OpenAI isn’t about to hand over the AI coding pie to anyone. Early adopters, time to loosen those purse strings!

Image


2. Cursor’s Hidden “Pause” Feature: Interactive Feedback Without Installing MCP

You might not know that Cursor has a hidden gem called AskQuestion. Previously, if you wanted AI to stop at critical moments and ask for your input, you’d need to install an MCP plugin. Turns out this feature is built-in! The effect is that AI won’t just run off after writing code—instead, it pops up options for you to confirm or add requirements. The post even includes a complete prompt template. The core idea: Force AI to call AskQuestion after every response to ask for feedback, and prohibit it from ending the conversation on its own. A lazy person’s dream—one less plugin to install.

Image


3. Gemini CLI + Chrome MCP: Replicate Manus-Style Task Flows

Want AI to directly control the Chrome browser you’re using? This tutorial walks you through it step by step. The core is enabling Chrome’s remote debugging port (9222), then using Chrome MCP Tools to let Gemini take over the browser. What can you do after taking over? Screenshot and analyze UI, diagnose page performance, check console errors, extract web data, even auto-click buttons. The entire workflow is almost identical to Manus-style Agent web operations, but completely free. Recommended models: Gemini 3 Flash/Pro or Nvidia’s free GLM 4.7.

Image


4. Generate Music Videos from One Sentence: Suno + Whisper + Jingmei Fully Automated Pipeline

“Write me a song about programmers working late and turn it into an MV.” This kind of request used to take hours to pull off, now it’s done in one sentence. The workflow is: LLM writes lyrics → reverse-engineer Suno API to generate music → Whisper transcribes with timestamps → LLM corrects and generates visual descriptions → Jingmei generates images → FFmpeg composites video. The author even fixed a two-year-old Suno reverse-engineering library with Opus, now supporting the latest V5 model (codename “Raven”). The lyrics might be a bit silly, but the entire automation pipeline is genuinely awesome.

Image


5. Claude Code Subtitle Magic: Download Videos + Bilingual Subtitles in One Sentence

Adding bilingual subtitles to videos used to require Arctime for timing, Capcut for translation, and learning ASS syntax. Now? One sentence: “Download this video for me, add bilingual Chinese-English subtitles, English in green and Chinese in yellow, place them above the video.” A few minutes later, you get a 1080p video with perfectly aligned subtitles ready to use. This omni-captions-skills uses Claude for translation directly—no extra LLM setup needed. Install command: npx skills add https://github.com/lattifai/omni-captions-skills. Subtitle enthusiasts rejoice.

Image


6. VibeMark: One-Click AI Watermark for Any Photo to Make It Look “AI-Generated”

This tool’s purpose is a bit… delicate. It can add official watermarks from major AI platforms to any image—Google’s starburst, Doubao, Jingmei, Tongyi Wanxiang, Zhipu Qingyan, and even custom ones. Why would you do this? The author puts it bluntly: “When you have some photos you don’t want people to know you took yourself, you can add a watermark saying it’s AI-generated.” 😂 Pure frontend static webpage, no data uploads, supports batch processing. Open source on GitHub—go find it if you want to play.

Image


7. fast-tavern: Use Tavern’s Prompt Processing Logic Outside of Tavern

SillyTavern’s prompt ecosystem is already quite mature—presets, world books, character cards, regex scripts, macro variables—a full combo that delivers amazing results. But the problem is, this logic only works inside Tavern. Now someone extracted it into a standalone library supporting TypeScript and Python. This means you can reuse Tavern’s prompt assembly workflow in your own projects without reinventing the wheel. For developers wanting to build character roleplay applications, this is a major win.


8. Open Source Banana Pro NSFW Manga Translation Project: Clever Workaround

Banana Pro can translate manga, but it refuses to handle NSFW content. This guy came up with a brilliant hack: instead of sending the complete NSFW image to AI, only send the parts with dialogue. AI can’t see the sensitive content, so it naturally won’t refuse. The translated text then gets auto-filled back into the original image. The entire workflow: upload image → circle out dialogue areas → call Banana Pro API to translate → refill original image. It’s a bit of a “workaround,” but it actually works.

Image


9. Non-Programmer’s Guide to AI Coding: Ben’s One-Day Crash Course

If you’re not a programmer but want to use AI to write code, this guide is a must-read. It doesn’t just cover AI coding tools—it also explains Git version control, terminal commands, environment variables, and dependency management in plain language. The author Ben is the founder of Ben’s Bites, and he always writes in an accessible way. Core takeaway: AI can help you write code, but you need to know what environment the code runs in, how to manage versions, and how to roll back if something breaks. One day to go from zero to collaborating with AI—definitely worth bookmarking.

Image


10. baoyu-skills Project Iteration Model: Spot Issue → Analyze → Let AI Solve → Verify

Baoyu shared his workflow for maintaining open source projects. The core is running the “spot issue-analyze-solve-verify” loop, where the “solve” step is handed directly to Claude Code. For example, today he noticed commit messages were all meaningless version numbers, so he had AI split each module’s changes into separate commits. Describe the requirements in a few sentences, AI modifies the code itself, then he verifies the results. This human-AI collaboration model is incredibly efficient.

Image


📌 Worth Watching


😄 AI Fun Fact

Canadian Representative Signs in Wrong Spot—Historic Moment

If you think you made a huge mistake today, remember this story: In 1945, when signing Japan’s surrender document, the Canadian representative signed his name in the French representative’s section. This was the document ending World War II! So next time AI writes buggy code for you, don’t beat yourself up—humans have made way more ridiculous mistakes on far more important occasions. 😂

Image


🔮 AI Trend Predictions

Major OpenAI Codex Update Release

  • Predicted Timeline: Late January–Early February 2025
  • Confidence Level: 85%
  • Reasoning: Today’s news Sam Altman teases Codex updates + explicit mention of “starting next week” makes the timeline crystal clear

AI Coding Tools Enter “Skills/Plugin” Ecosystem Competition Phase

  • Predicted Timeline: Q1 2025
  • Confidence Level: 75%
  • Reasoning: Multiple today’s news items involve Claude Code Skills (subtitles, music MVs, baoyu-skills) + Cursor’s built-in tools being discovered, indicating the ecosystem is maturing rapidly

Browser Automation Agent Tools Explosion

  • Predicted Timeline: Q1 2025
  • Confidence Level: 70%
  • Reasoning: Today’s news Gemini CLI + Chrome MCP tutorial + continued popularity of Manus-type products, technical barriers are dropping

❓ Related Questions

How to Experience Claude Code’s Skills Feature?

Claude Code’s Skills feature requires a Claude Pro subscription or API access. Domestic users may face payment difficulties or account registration restrictions.

Solution: Visit Aivora to get ready-made accounts—instant delivery with reliable support.

Last updated on