Anthropic Launches Claude Opus 4.8 After $65B Raise
ALSO: Wix cuts 1,000 jobs, Amazon’s tokenmaxxing backfires
Welcome to Guru99 AI Report!
๐ค Meet Claude Opus 4.8: The Most Capable Model Yet
- It’s four times less likely to let coding flaws slip by unnoticed, and more willing to flag uncertainty rather than bluff its way through.
- A new effort control lets you decide how hard Claude works, trading speed against depth.
- Dynamic workflows let Claude Code run hundreds of parallel subagents to handle huge projects, like codebase-wide migrations.
- Pricing stays the same: $5 per million input tokens, $25 per million output.
- Anthropic teased Mythos, an even more intelligent model class, arriving “in the coming weeks.”
๐ Robinhood just handed AI the keys to your portfolio
- Robinhood uses MCP โ a standard that connects AI tools to outside apps โ to link agents to a dedicated trading account.
- Agents can analyze portfolios, suggest strategies, and execute trades within limits you define.
- Robinhood plans to expand beyond stocks into options, crypto, futures, event contracts, and prediction markets.
- Gold Card users also get virtual cards, letting an assistant spend within set caps.
- The bigger shift: agent apps now need permissions, spending limits, audit logs, and a panic switch baked in. (See how it works)
๐ Wix Cuts 1,000 Jobs as AI and a Strong Shekel Reshape the Business
- Headcount falls from 5,277 to about 4,200, with more than 60% of the team based in Israel.
- A surging shekel has inflated costs for a company that earns mostly in dollars but pays most salaries in shekels โ a structural squeeze better products can’t fix.
- Abrahami called AI the biggest shift in how companies are built since the 1970s, flattening management and rolling out AI-native roles dubbed “Xengineer” and “Creators.”
- Wix joins Meta, Cisco, and Intuit in tying layoffs to AI, even as its stock has slid over 50% this year.
๐ Amazon’s ‘Tokenmaxxing’ Backfires as Staff Game AI Metrics
- Amazon set a goal for 80%+ of developers to use AI weekly, tracking model and token usage through staff rankings this year.
- Its in-house tool MeshClaw lets employees build AI agents that can deploy code, sort emails, and operate across company software.
- Staff told the Financial Times the pressure created “perverse incentives,” with people wasting tokens to inflate their stats.
- Amazon says the numbers aren’t used in performance reviews, and has since limited who can see individual usage data.
๐ OpenAI just built a tax AI that keeps getting smarter on its own
- The breakthrough isn’t the accuracy โ it’s the self-improvement loop: every time a human accountant fixes an error, the system logs it, and Codex proposes a tested code change to stop it recurring.
- Results came fast โ returns hitting 75% field accuracy jumped from 25% to 86% in six weeks, eventually reaching up to 97% draft accuracy.
- Across 30+ firms and ~7,000 returns, it cut prep time by about a third and lifted throughput roughly 50%.
- The tricky part wasn’t simple W-2s but messy K-1s, rental schedules, and spreadsheets โ the judgment-heavy work that usually eats an accountant’s hours.
โ๏ธ One prompt, every AI: the side-by-side comparison
Step-by-step:
Open OpenRouter Fusion and choose how you’d like to pay for AI usage โ either OpenRouter credits or API keys you already pay for.
In Fusion, pick the models you want to compare โ we tested Opus 4.7 vs. GPT 5.4 vs. Grok โ and run one benchmark prompt at a time, keeping it identical across models.
“You are advising a 20-person SaaS company deciding whether to replace its weekly status meeting with an async written update. Write a recommendation memo with 3 benefits, 3 risks, and a 2-week implementation plan. Keep it concise and practical.”
Open the responses, read the side-by-side analysis, and note which model performs strongest. In the demo, roughly 10 comparisons cost around 40 cents.
Hey! I’m Krishna Rungta
Founder of Guru99.com, Editor-in-chief & Technology Expert
Was this email forwarded to you? Sign up for free here.
