In the last few weeks, Open AI has been busy at work adding new features to ChatGPT. Meanwhile Google’s trying to keep up.
They’ve just released the latest version of DALL-E, the text-to-image AI model.
According to the company, DALL-E 3 is far better at analyzing requests and understanding context than previous versions.
One of the biggest concerns surrounding AI image generation is safety and copyright. Open AI has addressed it by programming DALL-E to decline requests to create images of public figures or in the style of living artists.
Artists can also prevent Open AI from using their art to train its image generation models. This comes following multiple lawsuits against the company by over a dozen authors for “flagrant and harmful” copyright infringements.
DALL-E also comes with a cool, new feature: It integrates with ChatGPT. Instead of worrying about creating the best prompt, users can now use ChatGPT’s help by telling the chatbot what they’re looking for. ChatGPT will then generate a detailed prompt optimized for the image model.
Speaking of the chatbot, it can now search the web in real time. Until this major change, ChatGPT had its knowledge cut off to September 2021. That’s one advantage Google’s Bard had over Open AI but now, they have this new feature called Browse with Bing.
And that’s not all – ChatGPT is also rolling out new voice and image capabilities, which will allow users to incorporate the chatbot more into their daily life.
Users will soon be able to chat via images and/or voice prompts, like “Look at my fridge and tell me what recipes I can make.” or “Tell me about the artist who designed this monument.”
What’s more, ChatGPT can also respond with voice notes, after partnering with voice actors to generate human-like audio.
When the conglomerate first released Bard, the consensus was that it wasn’t as good as ChatGPT. So, they’ve been working on changing that.
Bard is now integrated in Google apps and services, including YouTube, Gmail, and Google Workspace apps, which increases its usage potential.
Google is also soon releasing Gemini, their latest and greatest LLM, and it’s expected to power Bard.
One of their main selling points was that it would be multimodal but Open AI beat them to it by adding new multimodal features to ChatGPT. Google still has one major advantage: Their vast data sets.
In the meantime, we’ll keep watch as the battle between the AI players continues.