Menu

ChatGPT Learns to Listen, Speak, Recognize Images, and Search Online

By
Photo: ChatGPT Learns to Listen, Speak, Recognize Images, and Search Online. Source: Collage The Gaze \ by Leonid Lukashenko
Photo: ChatGPT Learns to Listen, Speak, Recognize Images, and Search Online. Source: Collage The Gaze \ by Leonid Lukashenko

The Chatbot ChatGPT has received a significant update. It now incorporates a neural network that understands voice commands, responds using synthesized speech, and can recognize the content of images. According to an OpenAI press release, these new features offer a more intuitive form of interaction, allowing users to engage in voice conversations or visually convey information to ChatGPT.

"For instance, while traveling, you can take a photo of a landmark and discuss its interesting aspects. When you're at home, you can photograph the contents of your refrigerator to plan dinner (and ask additional questions for recipes). After dinner, you can help your child solve a math problem: take a photo, annotate it, and ask for hints," describes OpenAI regarding the new interaction possibilities with ChatGPT.

Additionally, ChatGPT developers have enabled it to access the internet and provide links to the sources from which it derives information, allowing users to fact-check the chatbot's responses. This feature is currently available only through a paid subscription.

Back in March, the company announced that GPT-4 would operate based on multimodal models. This means the algorithm possesses a multimodal dictionary where some tokens correspond to text processing, while others handle images, audio, and more.

The voice capabilities were also made possible by using a new model. It requires only a short audio sample to generate a voice similar to a human's. Furthermore, OpenAI utilizes its Whisper algorithm to transcribe spoken words into text.

OpenAI acknowledges that these new capabilities come with potential risks. For example, a system that can create voices could be exploited by malicious actors for impersonation and fraud. Therefore, the company currently restricts the use of this technology to voice chats only.

Additionally, testing teams have scrutinized how the new algorithm interacts with images, paying special attention to photos that may contain misinformation or extremist messages.

You can see the functionality and capabilities of the updated ChatGPT in a video by Business Today.

Recommended

Latest news

US Warns Apple and Google to Remove TikTok from App Stores on 19 January

12.16.2024 16:22
Life

The Best Christmas Trees and Markets in Europe

12.14.2024 09:05
Economics

Cryptocurrency Market: Greed Above All

12.13.2024 15:30
Culture

Christmas Is All Around You

12.13.2024 13:07
Technology

Latest Gaming Releases of 2024

12.12.2024 16:05

Similar articles

We use cookies to personalize content and ads, to provide social media features and to analyze our traffic. We also share information about your use of our site with our social media, advertising and analytics partners who may combine it with other information that you've provided to them. Cookie Policy

Outdated Browser
Для комфортної роботи в Мережі потрібен сучасний браузер. Тут можна знайти останні версії.
Outdated Browser
Цей сайт призначений для комп'ютерів, але
ви можете вільно користуватися ним.
67.15%
людей використовує
цей браузер
Google Chrome
Доступно для
  • Windows
  • Mac OS
  • Linux
9.6%
людей використовує
цей браузер
Mozilla Firefox
Доступно для
  • Windows
  • Mac OS
  • Linux
4.5%
людей використовує
цей браузер
Microsoft Edge
Доступно для
  • Windows
  • Mac OS
3.15%
людей використовує
цей браузер
Доступно для
  • Windows
  • Mac OS
  • Linux