Google has announced the launch of its most advanced AI model, Gemini, designed to compete with OpenAI's GPT models. Gemini is a multimodal model that can process and understand various types of information including text, audio, images, video, and programming code. The model is expected to be integrated into Google's consumer apps, Android smartphones, and widely used products like Google Search and Chrome.
Gemini has been developed in response to the threat posed by OpenAI and other competitors. The project was named Gemini to symbolize the twinning of Google's two major AI labs. The model comes in three versions: Pro, Nano, and Ultra. The Ultra version is currently being tested externally and is expected to be released publicly in early 2024.
Gemini has reportedly outperformed rival AI models in reading comprehension, mathematical ability, and reasoning skills. However, some experts have expressed skepticism, arguing that Gemini's capabilities are not substantially different from previous models and may signal peak AI hype.
Despite these criticisms, Gemini Ultra is the first AI model to outperform human experts on multitask language understanding. Gemini Pro will be available to developers and Google enterprise customers starting on Dec. 13. Google is also in discussions with the UK government about testing Gemini through the AI Safety Institute.