ChatGPT Voice Conversations Are Scarily Good...
Education
Introduction
In case you missed it, ChatGPT came out with a new voice feature a couple of months ago. I had no idea this existed until very recently. There's this new icon in the bottom right of the ChatGPT app; you tap it, and you can start talking to it with your voice. It will generate responses and talk back to you. So far, my experience has been mind-blowing. It has shifted my entire perception of what an AI assistant is capable of. This sounds dramatic, but as soon as I show you, you'll understand.
What makes this all possible are large language models (LLMs). These machine learning algorithms are trained on massive amounts of human text data, and now with a voice behind them, it's quite remarkable.
Interaction 1
User: How are you?
ChatGPT: I'm doing well, thanks for asking. How about you?
User: I'm good. I've been thinking about tech lately.
ChatGPT: Tech is always an interesting topic. What's been on your mind about it?
User: I've been thinking about AI assistants and large language models.
ChatGPT: That's a fascinating topic. What's specifically been on your mind about AI assistants and large language models?
User: It's crazy how fast the tech is moving. I wouldn't have thought this was possible even a couple of years ago.
ChatGPT: Absolutely, the pace of technological advancement can be mind-boggling. How do you think this tech is going to evolve in the next five years?
Initial Impressions
- Voice Quality: The voice sounds very natural, using different rhythms, intonations, and even emotion.
- Structured Responses: It provides follow-up questions to better understand your query, mimicking human interaction.
- Response Time: The conversation flows quickly and smoothly.
Interaction 2
For this scenario, I'm taking you back about two years when I quit my job to do YouTube full-time.
User: Hey, I want to talk about maybe quitting my job.
ChatGPT: Sure, I'm here to listen. What's making you consider quitting your job?
User: I've been uploading YouTube videos, and they started gaining traction. I'm thinking about leaving my full-time job for YouTube.
ChatGPT: It's great your YouTube channel is gaining traction. Have you thought about how you'd manage financially if you quit your job?
User: I have enough savings to last a couple of years if YouTube doesn't pan out. It's worth a shot, and I'd regret not giving it a try.
ChatGPT: It's smart to have a financial cushion. It sounds like you've thought this through and are willing to take the risk for something you're passionate about.
It's incredible to have a machine that feels like it's listening, not just hearing you. ChatGPT achieves this in a way that makes it feel very human.
Comparing ChatGPT to Google Gemini (Bard)
Google has also come out with Gemini, formerly known as Bard. Let's see how it compares.
User: Hey Google, where should I take my next vacation?
**Google Assistant:** Choosing the perfect vacation destination depends on your interests and travel style. Here are some ideas:
- Relax on the beach
- Adventure hiking
- Exploring a new city
User: I'm getting my scuba Open Water certification, so somewhere tropical with diving, good food, and culture would be nice.
Google Assistant: How about Thailand? You can dive, enjoy Thai cuisine, experience vibrant culture, and hike through jungles or explore stunning islands.
Differences
- Visual Appeal: Google's interface is more colorful with pictures and formatting.
- Voice Quality: The voice of Google Gemini feels more robotic compared to ChatGPT.
- Interaction Depth: ChatGPT feels more tailored, asking follow-up questions to understand the user better.
Specific Requests
User: Hey Google, create a 7-day itinerary for Iceland from September 20th to the 28th.
Google Assistant: Here is a detailed itinerary with travel tips, flight costs, food recommendations, and must-see attractions.
ChatGPT: The cost of flights to Iceland varies. Here’s your detailed itinerary with must-see attractions, food recommendations, and travel tips.
Google Extensions
Google also supports additional integrations called extensions for various functionalities, such as:
- Finding Documents: "Hey Google, find my Iceland planning document."
- YouTube Integration: "Hey Google, find me a YouTube video about buying AirPods."
ChatGPT can switch languages within the same conversation, adding another layer of convenience.
User: What should I eat?
ChatGPT: What are you in the mood for? Tacos? Burritos?
User: Can you say that in English?
ChatGPT: Sure. Would you like to go to a nearby Mexican restaurant or make them at home?
Final Thoughts
The age of truly smart AI assistants might be here. It's crucial to consider which companies we trust with our information, given that every interaction potentially leaves a paper trail. ChatGPT, in particular, has impressed me greatly in its conversational abilities. After trying it out, definitely come back and share your thoughts!
Keywords
- ChatGPT
- Voice Feature
- Large Language Models (LLMs)
- AI Assistants
- Google Gemini (Bard)
- Natural Language Processing
- Conversational AI
FAQ
Q: What is the new voice feature in ChatGPT? A: The new voice feature allows you to converse with ChatGPT using your voice, and it responds in kind.
Q: How does the ChatGPT voice quality compare to Google Gemini? A: ChatGPT's voice sounds more natural and human-like, whereas Google Gemini's voice feels more robotic.
Q: Can ChatGPT understand and speak multiple languages? A: Yes, ChatGPT can switch between multiple languages within the same conversation.
Q: Are there any visual differences between ChatGPT and Google Gemini? A: Yes, Google Gemini offers a more visually appealing interface with pictures and formatted text, while ChatGPT primarily provides text responses.
Q: Does Google Assistant have additional integrations? A: Yes, Google Assistant supports extensions for integrating Google Workspace, YouTube, and more.
Feel free to explore these new features and share your experiences!