Home > Tech

OpenAI brings video to ChatGPT Advanced Voice Mode

AVM finally gets vision capabilities

By Cecily Mauran on December 12, 2024

ChatGPT now has vision capabilities for Advanced Voice Mode Credit: Jaque Silva / NurPhoto / Getty Images

ChatGPT's Advanced Voice Mode now has video and screenshare capabilities.

The feature was last May with the release of GPT-4o, but only the audio modality has been live. Now users can chat with ChatGPT using a phone camera and the model will "see" what you see.

SEE ALSO: OpenAI's Sora is officially here

In the livestream, CPO Kevin Weil and other OpenAI team members demoed ChatGPT assisting with how to make pour-over coffee. By pointing the camera at the action, AVM demonstrated that it understood the principle of the coffee maker and walked the team through the brewing of their beverage. The team also showed how ChatGPT supports screensharing by understanding an open message on a phone with Weil wearing a Santa beard.

Mashable Light Speed

Want more out-of-this world tech, space and science stories?

Sign up for Mashable's weekly Light Speed newsletter.

By signing up you agree to our Terms of Use and Privacy Policy.

Thanks for signing up!

The long-awaited announcement comes a day after Google unveiled the next generation of its flagship model, Gemini 2.0. The new Gemini 2.0 can also process visual and audio inputs and has more agentic capabilities, meaning it can perform multi-step tasks on the user's behalf. Gemini 2.0's agent features currently exist as a research prototype under three different names: Project Astra for a universal AI assistant, Project Mariner for specific AI tasks, and Project Jules for developers.

Not to be outdone, OpenAI's demo showcased how ChatGPT's vision modality accurately identified objects — and was even interruptible. And yes, part of this included a Santa voice option in Voice Mode, complete with a deep, jolly voice and lots of "ho-ho-hos." You can chat with OpenAI's version of Santa by tapping the snowflake icon in ChatGPT. No word yet on whether the real Santa Claus contributed his voice for AI training or OpenAI used his voice without prior consent.

Oddly, when selecting the Santa voice in the ChatGPT app, the user is warned that the voice is only for people 13 and older.

Tweet may have been deleted

Starting today, video and screenshare are available to ChatGPT Plus and Pro users, with Enterprise and Edu availability coming in Jan.

Topics ChatGPT OpenAI

Cecily Mauran

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before getting her master's degree at Columbia Journalism School, she spent several years working with startups and social impact businesses for Unreasonable Group and B Lab. Before that, she co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. You can find her on Twitter at @cecily_mauran.

Recommended For You

ChatGPT’s Advanced Voice Mode could get a new 'Live Camera' feature

Sleuths have spotted "Live camera" references in the ChatGPT code.

11/19/2024

By Cecily Mauran

Website of ChatGPT GPT-4o seen in an iPhone.

Website of ChatGPT GPT-4o seen in an iPhone.

OpenAI: No plans for ads in ChatGPT Search

No ads... for now? More details from OpenAI on how ChatGPT Search works.

10/31/2024

By Cecily Mauran

Why is ChatGPT's Santa Mode only for ages 13 and up?

So... who is this for exactly?

12/12/2024

By Cecily Mauran

Santa Voice mode in chatgpt on a smartphone

Santa Voice mode in chatgpt on a smartphone

OpenAI announces a ChatGPT organizing system called Projects

A mid-tier tool for the middle of "12 Days of OpenAI."

12/13/2024

By Cecily Mauran

ChatGPT was messaging users first — but OpenAI said this wasn’t supposed to happen

A freaky ChatGPT conversation went viral for being a little too proactive.

09/17/2024

By Cecily Mauran

Trending on Mashable

NYT Connections hints today: Clues, answers for December 15, 2024

Everything you need to solve 'Connections' #553.

12/15/2024

By Mashable Team

A phone displaying the New York Times game 'Connections.'

A phone displaying the New York Times game 'Connections.'

Wordle today: Answer, hints for December 15

Here are some tips and tricks to help you find the answer to "Wordle" #1275.

12/15/2024

By Mashable Team

a phone displaying Wordle

a phone displaying Wordle

Spacecraft makes daring approach of metal object in Earth's orbit

A "historic approach."

12/12/2024

By Mark Kaufman

A view of the large rocket debris captured by the Astroscale ADRAS-J spacecraft

A view of the large rocket debris captured by the Astroscale ADRAS-J spacecraft

NYT Strands hints, answers for December 15

Every hint, nudge and outright answer you need to complete today's NYT Strands puzzle.

12/15/2024

By Mashable Team

A game being played on a smartphone.

A game being played on a smartphone.

'SNL' cold open questions why the internet finds Luigi Mangione so hot

Emil Wakim, this is your moment.

12 hours ago

By Chase DiBenedetto

Sarah Sherman dressed up as Nancy Grace in front of a blue screen.

Sarah Sherman dressed up as Nancy Grace in front of a blue screen.

The biggest stories of the day delivered to your inbox.

This newsletter may contain advertising, deals, or affiliate links. Subscribing to a newsletter indicates your consent to our Terms of Use and Privacy Policy. You may unsubscribe from the newsletters at any time.

Thanks for signing up. See you at your inbox!