Elon Musk’s Grok AI can now ‘understand’ images

Elon Musk’s AI chatbot can now “understand” images, including information-packed charts and graphs. Sorry, doesn’t everyone use the platform formerly known as Twitter for multidisciplinary research and optimizing their workflows?

Presented as Grok-1.5V — Or Grok 1.5 “Vision,” the company’s “first-generation multimodal model” — the robot will be able to not only respond to your uploaded images and screenshots, but also reason through complex documents, scientific diagrams, graphs, screenshots and photographs, the company says. Additionally, Grok-1.5V will gain “real-world spatial understanding” to better understand the physical world depicted in images uploaded by its users.

“Advancing both our multimodal understanding and our generation capabilities is an important step in creating beneficial AGI capable of understanding the universe,” the company wrote in its statement. “In the coming months, we plan to make significant improvements to both capabilities, in various modalities such as images, audio and video.”

Example use cases include translating a diagram into Python code, turning a child’s drawing into a bedroom story, identifying the largest object among a group of several, and indicating to a driver if he has enough space to go around an obstacle.

Grok-1.5V is released with xAI Real worldQAa dataset of images and prompts designed to test other GenAI models against Grok’s actual reasoning.

The tweet may have been deleted

But competition is the least of Grok’s worries. Despite xAI’s continued investment, Grok has yet to onboard early adopters and staff – a new report alleges its own developers are struggling to use the slow xAI API. This same report, published by Fortune this week highlighted X employees’ concerns over Musk’s suggestion of Grok write paid users’ posts for them, despite warnings from developers and staff. Last week, Grok was criticized for generate fake news headlines of an alternate reality where Iran had attacked Tel Aviv with a military arsenal – it is not the first time.

As GenAI chatbots hallucinate realities and generate fake news, Grok’s gaffe is indicative of another site-wide problem. The bot, which is Musk’s answer to ChatGPT, fits into a platform that has slowly reduced its defenses against AI gone wrong. Combined with .

Grok-1.5V will soon be available to early testers and select users.

We are social

Categories

Elon Musk’s Grok AI can now ‘understand’ images

Related Posts