- The Rundown AI
- Posts
- Exclusive: Microsoft AI launches Copilot Vision
Exclusive: Microsoft AI launches Copilot Vision
PLUS: Infinite memory, AI companions, AI agents, and more
Welcome, AI enthusiasts.
We have an exclusive for you today.
On Thursday, Microsoft launched Copilot Vision in its Edge browser — a new AI that can see your screen and talk with you in real-time as you navigate the internet.
Long story short: it’s one of the most insane products we’ve tried this year.
So we partnered up with Microsoft and Mustafa Suleyman (CEO of Microsoft AI) to chat about his unique insights, infinite memory, AI companions, AI agents, and more.
In today’s AI rundown:
Copilot Vision: A new era of human-computer interaction
How Microsoft AI is differentiating from OpenAI
User data privacy with Copilot Vision
Living amongst a co-intelligence in 10+ years
Memory, learning, gaming, and AI agents
EXCLUSIVE Q&A MUSTAFA SULEYMAN
MICROSOFT COPILOT
Image credits: Kiki Wu / The Rundown
The Rundown: Microsoft just launched its next-generation AI assistant, Copilot Vision, which can see everything on your screen, and speak back to you in real time on its Edge browser, marking a fundamental shift in how we interact with computers.
Cheung: "Can you give us a quick rundown of everything released and why this is an important moment for AI?"
Suleyman: “We’re launching Vision… and it’s really a magical experience that is quite different than any kind of AI or even general kind of computer interaction experiences that we’ve seen before.”
Suleyman added: "We are on a mission to create a true AI companion. And to me, an AI companion is one that can hear what you hear and see what you see and live life essentially alongside you."
Cheung: “When is Copilot Vision being rolled out?"
Suleyman: "It’s going to be available in Copilot Labs to paying Copilot subscribers, who will get special access to trial it, experiment with it, and give us feedback… Sometime in the early part of next year is when it will go into GA [general availability].”
Why it matters: Copilot Vision isn't just another AI feature—it's Microsoft's attempt to fundamentally transform how we interact with computers. By replacing traditional clicking and typing with voice and real-time screen understanding, Microsoft is betting that the future AI will be more like talking to a friend than operating a machine.
COPILOT VS CHATGPT
Image credits: Kiki Wu / The Rundown
The Rundown: Microsoft is emphasizing its focus on creating a true AI companion that feels more personal and interactive, with Copilot Vision featuring emotional intelligence, Edge browser integration, and the ability to push back like a real friend.
Cheung: "Microsoft is a major investor in OpenAI, and ChatGPT and has a yet-to-be-released version of their vision-like product. How is Microsoft AI differentiating itself from other competitors?"
Suleyman: "The main thing is that we're really leaning into the idea of it being a proper companion. So just the fluency of our voice and how smooth it is, how fast it is, it's very interruptible, very easy to talk to."
Suleyman added: "Putting vision inside of the browser is the next step. Edge having it [Vision], and being there with you all the time able to watch, learn, and talk to you is a really big differentiator."
Cheung: "Something that really stood out to me as well, talking with Copilot Vision, was how personable it really was. It even gave me like some sass at some points."
Suleyman: "When it occasionally pushes back on you, that's a profound moment because a true friend would do that. No one wants a sycophantic AI that just always mirrors you and always obeys you. That's not going to be interesting for very long."
Suleyman added: “If you’re really dour and sad and you slow down the pace of your words, it will bring an appropriate vibe for that. But if you’re super fast and excited and enthusiastic, it will mirror that energy.“
Why it matters: Microsoft's approach focuses on creating an AI that feels more like a friend than a tool. By building an AI that can match your energy and emotion, push back with sass occasionally, and live inside your browser, Microsoft isn't just creating another chatbot—it's reimagining AI as a digital companion that truly understands you.
PRIVACY
Image credits: Kiki Wu / The Rundown
The Rundown: Microsoft is addressing privacy concerns around Copilot Vision by implementing session-based data deletion, with plans to develop more sophisticated privacy infrastructure as the technology continues to evolve.
Cheung: "With any sort of powerful AI application, such as Copilot Vision, it needs copious amounts of data to kind of really be accurate and helpful. But with this amount of personal data, there's always a new set of privacy concerns for users. How is Microsoft tackling this right now? How do users know that their data is safe?"
Suleyman: "We're keeping a very open mind on this. Some users will want to keep their ephemeral session. So at the moment, Copilot Vision throws away the contents of what it has seen at the end of the session."
Suleyman added: "If we are to [add memory], it's going to need a new privacy and security infrastructure to be able to store that kind of content because it's going to be very rich. It's going to describe in immense detail, not just moments in time, but strings of activity over hours and days."
Why it matters: Microsoft is taking a privacy-first approach with Copilot Vision—defaulting to session-based data deletion after every chat. This strategy lets Microsoft test Vision’s capabilities with users now, while building the secure infrastructure needed for persistent memory features in early 2025.
AI COMPANIONS
Image credits: Kiki Wu / The Rundown
The Rundown: Suleyman predicts a future where AI companions become deeply integrated into our daily lives, understanding our emotions, preferences, and daily needs — potentially becoming "a new digital species."
Cheung: "Looking forward 10 years from now, what do you think these personal AI assistants will have in our lives?"
Suleyman: "I think of it as outsourcing a lot of the mental processing to a very reliable, highly accurate, completely interactive thought partner and companion that is going to help make me much smarter, more productive, feel more supported... it's very, very different to just using a computer in the way that we do today."
Suleyman added: "Your computer, or your AI, your Copilot, is clearly going to understand everything that you're bringing to the table—your emotional state, your intellectual state, what you need to get done that day, your interests, your hobbies, your personal knowledge graph, your family, your dislikes."
Suleyman added: “It’s going to feel…like a new digital species. It is going to feel like a member of the family.“
Why it matters: Suleyman sees AI evolving from basic tools that boost productivity into digital "family members"—understanding emotions, remembering preferences, and living alongside us. In his vision, AI companions won't just assist us with tasks, they'll become a new form of intelligence that experiences life with us.
WHAT’S NEXT
Image credits: Kiki Wu / The Rundown
The Rundown: Suleyman revealed Microsoft AI’s ambitious roadmap for Copilot—expanding beyond today's screen understanding into memory features, learning assistance, gaming integration, and agentic capabilities.
Cheung: "How deeply does Copilot Vision understand you as a user?"
Suleyman: "Memory is the key thing that is coming soon... it's important that it remembers your preferences and is able to reason over them to give you advice based on knowing you."
Cheung: "Something I'm excited for is when AI can guide me through learning new apps by controlling my screen—like when learning Photoshop."
Suleyman: "The future of Copilot Vision is definitely Copilot help, step by step, taking you through troubleshooting when you're trying to fix your computer or you're trying to learn a new piece of software."
Suleyman added: "Imagine having your Copilot talk to you about the worlds that you're building in Minecraft or hang out with you in Call of Duty... It's probably going to feel like an ever-present companion in whatever setting you're in."
Cheung: "Are there any plans for Copilot to become agentic and have the ability to take control of your computer and do regular tasks just like a human could?"
Suleyman: "We're working hard on how it navigates the browser, fills in forms, calls APIs... [with online shopping] it will populate your basket in advance. It will ask you if you want this or that, or it will find a bunch of prices and be like, 'You know, there's a better opportunity.'"
Why it matters: Microsoft's long-term goals for Copilot extend far beyond the screen and browser understanding launched this week. Suleyman hints at a roadmap where Copilot evolves into a full-on autonomous companion—learning your preferences, guiding you through software, and even gaming alongside you.
GO DEEPER
INTERVIEW
In the full interview with Mustafa Suleyman & Rowan Cheung chat about:
Microsoft's vision for personalized AI companions
How Copilot adapts to your emotions in real-time
Why traditional computer interfaces will "get washed away"
Predictions for a future with billions of personalized AI agents
…and much more
Listen on YouTube, Twitter/X, Spotify, or Apple Music.
Reply