Apple's new AI model outperforms GPT-4

PLUS: OpenAI adds image editing

Welcome, AI enthusiasts.

Apple researchers just revealed a new AI model that can ‘see’ and understand screen context — and they say it can ‘substantially outperform‘ GPT-4 on benchmarks.

If research papers are any indications of future launches, we’re gearing up for a wild ride at Apple WWDC 2024. Let’s get into it.

In today’s AI rundown:

  • Apple’s ReALM ‘outperforms‘ GPT-4

  • OpenAI adds image editing to DALL-E 3

  • Generate green screen product mockups with AI

  • 6 new AI tools & 4 new AI jobs

  • More AI & tech news

Read time: 3 minutes

LATEST DEVELOPMENTS

APPLE

The Rundown: In a new research paper, Apple researchers introduced ReALM, a new AI system that can understand on-screen tasks, conversational context, and background processes.

The details:

  • ReALM uses a new approach of converting screen info to text — allowing it to bypass bulky image recognition parameters for more efficient on-device AI.

  • The model takes into account both what's on the user's screen and what tasks are active.

  • According to the paper, Apple's larger ReALM models substantially outperformed GPT-4, despite having fewer parameters.

Example use case: If scrolling through a website and you want to call a business, a user could tell Siri to “call the business“, and Siri would be able to “see“ the phone number on the website and call it directly.

Why it matters: ReALM is a big step forward in making voice assistants more context-aware. By understanding on-screen info and additional context, the next Siri update could provide a more seamless and hands-free user experience.

SPONSORED BY MASTERWORKS

The Rundown: You’ve probably heard the Masterworks pitch. Ya know, the platform where you can invest in shares of blue-chip art like Banksy and Basquiat. But how can you know it's legit?

Here are the facts:

  • All investable offerings are SEC-qualified

  • Over $55 million total proceeds distributed back to investors

  • Median returns of 14.6%, 16.4%, and 17.6%, among their 21 exits

Just use this exclusive link to unlock VIP access to limited-time offerings.

Sponsor disclosure: Past performance is not indicative of future returns. Investing involves risk. See Important disclosures at www.masterworks.com/cd

OPENAI DALL-E

The Rundown: OpenAI has introduced a new feature that allows users to edit images generated by DALL-E 3 directly within ChatGPT, providing a more streamlined way to customize AI-generated images.

The details:

  • The DALL-E editor enables users to select specific areas of an image and prompt changes.

  • Users can add, remove, or modify objects and characteristics within a selected region of an AI-generated image.

  • The editor is accessible via the web interface and the ChatGPT mobile app, with slight variations in the editing process between platforms.

Why it matters: Integrating image editing capabilities directly into DALL-E 3 is a big step forward in expanding the image-generation possibilities using ChatGPT. However, with Midjourney having similar features for a while, it looks like OpenAI is finally the one playing catch-up.

AI TRAINING

The Rundown: This Midjourney or DALL-E 3 prompt can create stunningly realistic computer, phone, or product mockups, complete with a green screen to customize for your business or product easily.

Step-by-step:

  1. Open DALL-E 3 or Midjourney and type /imagine.

  2. Prompt to generate an image of someone holding/using a device in front of a solid green background. For example: “Over the shoulder shot of a person in front of an entirely <color> computer screen”

  3. Customize and tweak the prompt as necessary to get the image just right — then export and open in an editing app (Photoshop, Canva, etc.) to replace the colored screen background with your own imagery.

  4. Note — be mindful of reflections, such as green screens bleeding into other aspects of the image.

NEW TOOLS & JOBS

  • 🐇 CodeRabbit- Automated code review tool with contextual feedback

  • 🖼️ Living Images- Optimize your website images with generative A/B testing

  • 📊 RAFA- AI Agents for personalized investment insights

  • 🗃️ fynk- Create, review, track, sign, and analyze contracts

  • 🎨 DoDoBoo- Transforms doodles into AI-generated art

  • 👩‍💻 IntelSwift- Customer service automation with AI technology

  • 💰 Anthropic - Pricing Lead

  • 💻 Fiddler AI - Staff Backend Engineer

  • 🧾 OpenAI - Billing Operations Manager

  • 🛡️ Shield AI - Principal Software Engineer, Architecture and Integration

QUICK HITS

OpenAI is rolling out a new ChatGPT feature that allows new users to start using ChatGPT instantly without needing to sign up.

Google for Developers sent an email to all Gemini API Developers that they will update their terms of service to pay-as-you-go pricing for the Gemini API starting May 2nd.

Stanford Medicine researchers devise SyntheMol, a new AI model that develops potential new drugs for antibiotic-resistant bacteria.

Samsung is redefining its voice assistant Bixby with generative AI to enable more natural conversation and support across its ecosystem of products.

Perplexity plans to start selling native ads in the form of brand-influenced related questions.

Google AI just introduced AutoBNN, an open-source machine learning framework that combines the interpretability of traditional Bayesian approaches with the scalability of neural networks.

MultiOn released its Agent API in public beta, allowing developers to embed AI agents in applications to automate tasks and workflows on the web.

THAT’S A WRAP

SPONSOR US

Get your product in front of over 500k+ AI enthusiasts

Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world. Get in touch today.

FEEDBACK

How would you rate today's newsletter?

Vote below to help us improve the newsletter for you.

Login or Subscribe to participate in polls.

If you have specific feedback or anything interesting you’d like to share, please let us know by replying to this email.

Join the conversation

or to participate.