Microsoft slams AI coding models

💡 Google AI to fix electricity shortages?

Friday’s AI Report

• 1. 🧩 Microsoft slams AI coding models
• 2. 📈 Boost outbound sales with Artisan
• 3. 💡 Google AI to fix electricity shortages?
• 4. 🧠 ChatGPT gets personalized memory
• 5. 💼 Become an AI consultant with Innovating with AI
• 6. ⚙️ Trending AI Tools
• 7. 🏗️ Practical AI Applications
• 8. 📑 Recommended Resources

Read Time: 5 minutes

NEW AI Report podcast episode: Liam talks to co-founder of Mindstream (the daily AI newsletter that was acquired by HubSpot) and HubSpot’s Head of Brand, Adam Biddlecombe, about building trust, at scale, in the era of AI.

Refer your friends and unlock rewards. Scroll to the bottom to find out more!

Microsoft slams AI coding models

🚨 Our Report 

Microsoft’s R&D division has released a new study showing that even top AI models—from the likes of OpenAI and Anthropic, for example—still struggle to debug minor software issues.

🔓 Key Points

  • The study tested 9 AI models (including Anthropic’s Claude 3.7 Sonnet and OpenAI’s o1 and o3-mini models) using the development benchmark, SWE-bench Lite, which gave each model 300 software debugging tasks to fix.

  • All the models failed to fix half the debugging tasks: Claude 3.7 Sonnet—although the highest scorer—successfully fixed just 48.4%, OpenAI’s o1 fixed 30.2% and its o3-mini fixed 22.1%.

  • The study revealed that AI models still struggle to work out which tool to use to fix certain types of bugs, and there’s a lack of training data that represents the iterative, step-by-step human bug-fixing process.

🔐 Relevance 

This comes as more and more tech companies are turning to AI-powered code generators to reduce headcount and improve efficiency and productivity (including Google, which recently announced that 25% of their new code is generated by AI), but serves as a stark reminder that, although AI is improving, it still has major limitations and can’t match human programmers—a sentiment which Microsoft co-founder, Bill Gates, Replit’s CEO, Amjad Masad, and IBM’s CEO, Arvind Krishna have all echoed, believing that “programming as a profession is here to stay”

Hire Ava, the Industry-Leading AI BDR

Your BDR team is wasting time on things AI can automate.

Our AI BDR Ava automates your entire outbound demand generation so you can get leads delivered to your inbox on autopilot.

She operates within the Artisan platform, which consolidates every tool you need for outbound:

  • 300M+ High-Quality B2B Prospects, including E-Commerce and Local Business Leads

  • Automated Lead Enrichment With 10+ Data Sources

  • Full Email Deliverability Management

  • Multi-Channel Outreach Across Email & LinkedIn

  • Human-Level Personalization

🚨 Our Report

Google and PJM Interconnection—North America’s largest electricity grid operator—are working together to develop and deploy a set of AI tools that will help manage and optimize power generation to the PJM electric grid, which serves over 67M people, and other electricity grid operators, to help resolve the looming electricity shortage crisis and mounting pressure to use renewable energy sources to hit net zero targets.

🔓 Key Points

  • This is Google’s “biggest step to using AI for a stronger, more resilient electricity system” and will help PJM connect different energy sources to its grid quickly, making electricity more reliable and affordable.

  • Google is also partnering with other, global grid operators—like the non-profit operator, AES, for example—to use its AI tools to forecast electricity supply and demand in advance, more accurately.

  • AES is piloting an AI system that accurately predicts electricity load up to one week in advance, which has reduced forecasting errors by 20%, enabling it to “make better decisions about where and when to send electricity.”

🔐 Relevance

This comes as energy experts are desperately calling for smart systems that can manage electricity load without compromising reliability, price, and climate goals, as electricity consumption is expected to surge in the coming years, largely thanks to increased demand for electricity to fuel electric vehicles and AI data centers.

  • OpenAI has introduced a new ChatGPT memory feature which will allow the chatbot to answer questions using a user's “past chats” which will enable it to “provide more personalized responses.”

  • The feature will make conversations more fluid, relevant, and conversational and will roll out to all Pro and Plus subscribers, except those in the UK, EU, Iceland, Liechtenstein, Norway, and Switzerland.

  • Although OpenAI is committed to launching the “reference saved memories” feature in those countries, it still needs to complete additional external reviews to comply with local regulations.

Want to build a 6-figure business as an AI consultant?

The AI consulting market is about to grow by a factor of 8X – from $6.9B to $54.7B in 2032.

But how does an AI enthusiast become an AI consultant?

How well you answer that question makes the difference between just “having AI ideas” and being handsomely compensated for your contribution to an organization’s AI transformation. 

Thankfully, you don’t have to go it alone – our friends at Innovating with AI have welcomed 300 new students into The AI Consultancy Project, their new program that trains you to build a business as an AI consultant.

Some of the highlights current students are excited about:

  • The tools and frameworks to find clients and deliver top-notch services

  • A 6-month plan to build a 6-figure AI consulting business

  • Students getting their first AI client in as little as 3 days

And as a reader of The AI Report, you get early access to the next enrollment cycle.

Prompt Inspiration

After typing this prompt, you will get a user testing and quality assurance plan to ensure your virtual shopping assistant is effective.

Generate a plan for conducting user testing and quality assurance to ensure the effectiveness of our virtual shopping assistant

P.S. Use the Prompt Engineer GPT by The AI Report to 10x your prompts.

STARTUPS

Name: PlanGrid
Funding raised: $69M

PlanGrid (which was acquired by Autodesk) is an AI-driven, cloud-based platform that allows construction contractors and owners to collaborate and manage blueprints, specs, photos, RFIs, field reports, and punch lists and has been used on over 1M projects in 84 countries, securing funding from Sequoia Capital, Tenaya Capital, and other top firms.

PODCASTS

From 0 to HubSpot: Scaling a Newsletter to 150K Subscribers

In this episode of The AI Report podcast, Liam sits down with Adam Biddlecombe, co-founder of Mindstream—the AI newsletter that scaled to 150K+ subscribers and was acquired by HubSpot in just 17 months—and Hubspot’s Head of Brand to discuss how he builds trust at scale in the AI era.

QUICK HITS

We read your emails, comments, and poll replies daily.

Hit reply and tell us what you want more of!

Got a friend who needs to learn more about AI?

Sign them up for The AI Report here.

Until next time, Martin, Liam, and Amanda.

P.S. Unsubscribe if you don’t want us in your inbox anymore.

What did you think of this edition?

Login or Subscribe to participate in polls.