The AI Report
Posts
Anthropic joins Apple for coding revolution?

Anthropic joins Apple for coding revolution?

⚠️ Google’s AI safety regression?

Martin Crowley, Liam Lawson & Amanda Greenwood
May 05, 2025

WORK WITH US • COMMUNITY • PODCASTS • SIGN UP

Monday’s AI Report

• 1. 💡 Anthropic joins Apple for coding revolution?
• 2. 📽️ Create how-to videos with Guidde
• 3. ⚠️ Google’s AI safety regression?
• 4. 👀 OpenAI assures fixes for sycophancy
• 5. 💼 Become an AI consultant with Innovating with AI
• 6. ⚙️ Trending AI Tools
• 7. 🏗️ Practical AI Applications
• 8. 📑 Recommended Resources

Read Time: 5 minutes

✅ Refer your friends and unlock rewards. Scroll to the bottom to find out more!

Anthropic joins Apple for coding revolution?

🚨 Our Report

Apple is reportedly teaming up with Amazon-backed AI start-up, Anthropic, to build an AI-powered coding platform that will use AI to help developers write, edit, and test code.

🔓 Key Points

The new platform will be powered by Anthropic’s Claude Sonnet model and integrated into Apple’s existing code-writing tool, Xcode, so developers can use a chat interface to ask questions, test code, and fix bugs.
Apple will initially release the new version to internal software engineers (who currently rely on Xcode to develop/release Apple products), for internal testing, and if it’s successful, will roll it out to third-party developers.
Xcode has previously faced criticism over hallucinations and slow performance, whereas the Claude series is popular with developers for its ability to handle complex tasks and integrate with different platforms.

🔐 Relevance

This comes after Microsoft CEO, Satya Nadella, announced that 20-30% of its code has been written by AI, and Meta’s Mark Zuckerberg predicted that up to 50% of its code will be written by AI by 2026, and OpenAI is reportedly considering a $3B deal to buy AI-powered coding tool, Windsurf. This just highlights the trend towards using AI tools for coding.

🎥 Guidde - Create how-to video guides quick and easy with AI

Tired of explaining the same thing over and over again to your colleagues?

It’s time to delegate that work to AI. Guidde is a GPT-powered tool that helps you explain the most complex tasks in seconds with AI-generated documentation.

1️⃣ Share or embed your guide anywhere

2️⃣ Turn boring documentation into stunning visual guides

3️⃣ Save valuable time by creating video documentation 11x faster

Simply select ‘capture’ on the browser extension and the app will automatically generate step-by-step video guides complete with visuals, voiceover and clear instructions on what to do.

The good bit? The extension will cost you nothing.

🚨 Our Report

Google has released a technical report that reveals its newest model (which is still in preview)—Gemini 2.5 Flash—scored worse than its predecessor— Gemini 2.0 Flash—on safety benchmarking tests.

🔓 Key Points

Although Gemini 2.5 Flash follows instructions better than Gemini 2.0 Flash, it ”performs worse” on two key safety metrics: “text-to-text safety” and “image-to-text safety,” and is more likely to violate its safety guidelines.
2.5 Flash regressed by 4.1% on the “text-to-safety” metric (measures when a model violates guidelines), and 9.6% on the “image-to-text safety” metric (measures how well it follows boundaries when processing images).
Google said the regressions are the result of false positives in training data, but also admitted that the new model is prone to sometimes generating “violative content” when explicitly asked.

🔐 Relevance

This demonstrates a wider issue within the AI industry: Tech companies are trying to find a balance between getting their AI models to provide helpful, comprehensive responses, with multiple perspectives, to all queries, including controversial or sensitive ones, while also making sure they don’t violate safety policies and cause harm.

Which should come first with AI models: Safety or helpfulness?

Last week, OpenAI had to revert changes made to its latest AI model, GPT-4o, due to complaints of sycophancy, being insincere and overly agreeable, and validating dangerous ideas and decisions.
CEO Sam Altman has confirmed it willl allow opt-in users to test and give feedback on the models before they launch, and it will also adjust its safety review process to consider “model behavior issues,” like this.
OpenAI will also “communicate updates” they’re making to the models, whether ‘subtle’ or not, and will “commit to blocking launches based on proxy measurements, even when metrics like A/B testing look good.”

🧰 The Tools, Templates & Playbook for Your AI Consultancy

The AI consulting market is about to grow by a factor of 8X – from $6.9B to $54.7B in 2032.

But how does an AI enthusiast become an AI consultant?

How well you answer that question makes the difference between just “having AI ideas” and being handsomely compensated for your contribution to an organization’s AI transformation.

Thankfully, you don’t have to go it alone – our friends at Innovating with AI have welcomed 700 new students into The AI Consultancy Project, their new program that trains you to build a business as an AI consultant

Some of the highlights current students are excited about:

The tools and frameworks to find clients and deliver top-notch services
A 6-month plan to build a 6-figure AI consulting business
Students getting their first AI client in as little as 3 days

And as a reader of The AI Report, you get early access to the next enrollment cycle.

Prompt Inspiration

After typing this prompt, you will get some effective networking strategies that will enable you to create valuable connections and grow your professional network.

What are some effective networking strategies for building professional connections?

P.S. Use the Prompt Engineer GPT by The AI Report to 10x your prompts.

STARTUPS

Name: Observe AI
Value: $1.3B
Funding raised: $222M (Series C)

Observe is an AI-powered contact center that uses AI algorithms to analyze customer interactions and provide real-time insights and feedback on customer calls. Observe’s suite of AI tools can use historical analysis to develop and personalize live on-call guidance and coaching.

PODCASTS

Balancing AI productivity and human intelligence

This podcast explores the question: Are we trading our brains for convenience with AI? Data strategist, Sumit Gupta, who has built data strategies for Notion, Snowflake, and Dropbox, discusses how AI is both supercharging productivity and quietly eroding our skills.

QUICK HITS

We read your emails, comments, and poll replies daily.

Hit reply and tell us what you want more of!

Got a friend who needs to learn more about AI?

Sign them up for The AI Report here.

Until next time, Martin, Liam, and Amanda.

P.S. Unsubscribe if you don’t want us in your inbox anymore.

Anthropic joins Apple for coding revolution?

⚠️ Google’s AI safety regression?

Anthropic joins Apple for coding revolution?

🚨 Our Report

🔓 Key Points

🔐 Relevance

🎥 Guidde - Create how-to video guides quick and easy with AI

🚨 Our Report

🔓 Key Points

🔐 Relevance

Which should come first with AI models: Safety or helpfulness?

🧰 The Tools, Templates & Playbook for Your AI Consultancy

STARTUPS

PODCASTS

QUICK HITS

We read your emails, comments, and poll replies daily.

What did you think of this edition?