Musk launches “Vision” Grok to rival ChatGPT

Musk reveals a new version of Grok-1.5, called Grok-1.5V that can process both images and text

April 15, 2024

Elon Musk’s AI company–xAI—recently released Grok 1.5 a large language model (LLM) that was expected to rival OpenAI’s GPT-4, the model that powers ChatGPT-4. However, Grok-1.5 could only process text, not images, far behind OpenAI’s capability.

Now, Musk has launched Grok-1.5 Vision, or Grok-1.5V, an AI chatbot that can process both text and visual information.

Grok-1.5V performance

Following new RealWorldQA benchmark tests that measure real-world understanding, Musk has revealed that Grok-1.5V has outperformed its competitors–which include OpenAI’s GPT-4, Anthropic’s Claude 3, and Google’s Gemini Pro–in its ability to process a wide variety of visual information, including documents, diagrams, charts, screenshots, and photographs.

Grok-1.5V use cases

Grok-1.5V can reason through complicated texts, science diagrams, charts, screenshots, and images in addition to responding to pictures and screenshots. It can perform tasks like translating a diagram into code, generating bedtime stories from hand-drawn pictures, pinpointing the largest object among a group of objects, and in the future, potentially even writing tweets for Premium X (formerly known as Twitter) users. That’s Musk’s end-goal, anyway, much to the discomfort of Grok-1.5 developers who are reportedly struggling to use xAI API because it’s so slow, and X employees who are concerned that Grok, who has previously created and promoted dangerous fake stories, could do the same, damaging the brand and causing harm to its users.

Grok-1.5 Vision’s availability

Although Grok-1.5V isn't currently available, it is “coming soon” to a handful of early testers and existing users as a preview. Musk remains tight-lipped about an official launch date, stating that xAI will make “significant improvements in both capabilities, across various modalities such as images, audio, and video" before it will be publicly available.

‍

OpenAI in data crisis?

A lack of real-world training data is hindering OpenAI’s new AI model progress

Musk launches “Vision” Grok to rival ChatGPT

Musk reveals a new version of Grok-1.5, called Grok-1.5V that can process both images and text

Grok-1.5V performance

Grok-1.5V use cases

Grok-1.5 Vision’s availability

Recommended

OpenAI in data crisis?

OpenAI models fail own benchmark test?

Musk’s Grok levels up!

Can Anthropic’s Claude control your PC?

OpenAI debuts new-look ChatGPT!

Google launches custom GPT alternative!

Anthropic's AI secrets exposed

A custom-made GPT-4o?

Musk launches Grok-2 (with worrying issue)

Major Russian bot farm uncovered

Asana's revolutionary AI teammate

Microsoft in battle with OpenAI?

ChatGPT rival Claude is now on mobile

Financial Times signs its content over to OpenAI

MetaAI comes to WhatsApp, Instagram, and Facebook

You can now use ChatGPT without signing in