- The Rundown AI
- Posts
- ChatGPT's new voice
ChatGPT's new voice
PLUS: Meta's AI-powered 'camerabuds'
Sign Up | Advertise | Tools | AI University
Welcome, AI enthusiasts.
A new future of how we interact with AI is here, and it's speaking loud and clear.
OpenAI just dropped its impressive new ‘GPT-4o’ model, alongside new voice capabilities you have to see (and hear) to believe. Let’s break it down…
In today’s AI rundown:
OpenAI unveils GPT-4o and new voice capabilities
Meta developing AI-powered ‘Camerabuds’
Create a website in seconds with AI
7 new AI tools & 4 new AI jobs
More AI & tech news
Read time: 4 minutes
LATEST DEVELOPMENTS
OPENAI
Image source: OpenAI
The Rundown: OpenAI just unveiled GPT-4o, a new advanced multimodal model that integrates text, vision and audio processing, setting new benchmarks for performance – alongside a slew of new features.
The new model:
GPT-4o provides improved performance across text, vision, audio, coding, and non-English generations, smashing GPT-4T’s performance.
The new model is 50% cheaper to use, has 5x higher rate limits than GPT-4T, and boasts 2x the generation speed of previous models.
The new model was also revealed to be the mysterious ‘im-also-a-good-gpt2-chatbot’ found in the Lmsys Arena last week.
Voice and other upgrades:
New voice capabilities include real-time responses, detecting and responding with emotion, and combining voice with text and vision.
The demo showcased feats like real-time translation, two AI models analyzing a live video, and using voice and vision for tutoring and coding assistance.
OpenAI’s blog also detailed advances like 3D generation, font creation, huge improvements to text generation within images, sound effect synthesis, and more.
OpenAI also announced a new ChatGPT desktop app for macOS with a refreshed UI, integrating directly into computer workflows.
Free for everyone:
GPT-4o, GPTs, and features like memory and data analysis are now available to all users, bringing advanced capabilities to the free tier for the first time.
The GPT-4o model is currently rolling out to all users in ChatGPT and via the API, with the new voice capabilities expected to arrive over the coming weeks.
Why it matters: Real-time voice and multimodal capabilities are shifting AI from a tool, to an intelligence we collaborate, learn, and grow with. Additionally, a whole new group of free users (who might’ve been stuck with a lackluster GPT 3.5) are about to get the biggest upgrade of their lives in the form of GPT-4o.
If you missed it, you can rewatch OpenAI’s full demo here.
TOGETHER WITH AE STUDIO
The Rundown: While 85% of AI projects fail, AE Studio excels in turning ambitious ideas into reality. The secret to success? Superpowers. That, and treating each project like their own startup.
Partner with AE Studio for:
Proven expertise in AI, blockchain, BCI, and more
End-to-end strategic development, from ideation to deployment
Customized solutions tailored to building your unique product
A track record of success with clients like Berkshire Hathaway, Global Shop Solutions, IncomeMax, and Protocol Labs
Share your big idea today and let AE Studio make it a reality.
META
Image source: Midjourney
The Rundown: Meta is reportedly in the early stages of developing AI-powered earphones, known internally as "Camerabuds,” — aiming to compete with OpenAI and Apple as tech giants rush to infuse AI into wearable devices.
The details:
‘Camerabuds’ would map user surroundings, capable of identifying objects and translating foreign languages using built-in cameras.
Meta already has its AI-powered Ray Ban smart glasses, while OpenAI and Apple are also exploring similar AI wearable earbud tech.
Potential challenges include bulkiness, heat generation, and privacy concerns, especially for users with long hair that might obstruct the cameras.
Why it matters: Despite Meta’s shaky track record with hardware ventures, Mark Zuckerberg is investing heavily in a future that he believes includes AI embedded into every device. But will standalone devices like this be able to win over users if and when a fully AI-integrated phone hits the market?
AI TRAINING
The Rundown: Gamma’s free AI website builder helps users create websites in seconds by simply describing what they want – no coding skills required.
Step-by-step:
Head over to Gamma and sign up for free.
On the left-hand menu, select “Sites” and then press “Create Site”
You’ll be directed to a menu where you can either choose a given prompt or describe your website in detail.
Pick your theme, edit it as much as you want with its no-code interface, and hit “Publish”.
Hot tip: Gamma can also generate images for your website!
PRESENTED BY GALILEO
The Rundown: Stop AI hallucinations in real-time with Galileo Protect – a revolutionary firewall that intercepts hallucinations, prompt attacks, security threats, and more.
Attend Galileo’s upcoming webinar to learn:
How the firewall works
The key features of Protect
What makes it the first-of-its-kind
Register now to stop hallucinations and prompt attacks in your AI applications.
NEW TOOLS & JOBS
📊 Osum - Instant, deep market research on any product or business with AI (use code RUNDOWN for 25% off for life)*
🎙️ Voicenote - AI Note-Taker that transcribes your thoughts instantly
💻 Julius AI - GPT-4o is live in Julius to analyze datasets, visualize, solve math and more
🎨 ComfyUI - The most powerful Stable Diffusion GUI
🌍 Globe Explorer - Explore knowledge efficiently and systematically
🤖 Zapier Central Beta - New feature that now supports web browsing and instant chat actions
🛡️ Inspect - Open-source framework for testing the safety of LLMs
🔍 Perplexity AI - Search Engineer
📋 Shield AI - Project Manager
🌍 C3 AI - General Manager, ESG
☁️ Scale AI - Software Engineer, Cloud Infra
*Sponsored listing
QUICK HITS
Google teased a new camera-powered AI feature ahead of its I/O Developer Conference, showing off real-time object recognition and voice command responses.
NVIDIA launched nine new Grace Hopper-powered supercomputers, delivering 200 exaflops of energy-efficient AI processing power to accelerate scientific research.
The UAE’s Technology Innovation Institute released Falcon 2, a family of open-source AI text and vision models that compete with other top open-source competitors.
The U.S. and China are set to begin high-level talks on AI risks this week in Geneva, aiming to prevent disastrous accidents and unintended war amid the ongoing AI arms race.
Microsoft announced a $4.3B investment in France to enhance AI and cloud infrastructure, aiming to train 1M people and support 2,500 AI startups by 2027.
Anthropic’s Claude model is now available in Europe for both individuals and businesses, offering advanced AI capabilities through a web-based version, an iOS app, and a team plan subscription.
NASA appointed David Salvagnini as its first chief AI officer, who will guide the agency’s responsible use of the tech.
THAT’S A WRAP
SPONSOR US
Get your product in front of over 600k+ AI enthusiasts
Our newsletter is read by thousands of tech professionals, investors, engineers, managers, and business owners around the world. Get in touch today.
FEEDBACK
How would you rate today's newsletter?Vote below to help us improve the newsletter for you. |
If you have specific feedback or anything interesting you’d like to share, please let us know by replying to this email.
Reply