Connect with us

Hi, what are you looking for?

Tech News

Elon Musk’s xAI is working on making Grok multimodal

Elon Musk grins in a photo illustration, lifting his arms over his head triumphantly
Illustration by Kristen Radtke / The Verge; Getty Images

Elon Musk’s AI company, xAI, is making progress on adding multimodal inputs to its Grok chatbot, according to public developer documents. What this means is that, soon, users may be able to upload photos to Grok and receive text-based answers.

This was first teased in a blog post last month from xAI which said Grok-1.5V will offer “multimodal models in a number of domains.” The latest update to the developer documents appear to show progress on shipping a new model.

In the developer documents, a sample Python script demonstrates how developers can use the xAI software development kit library to generate a response based on both text and images. This script reads an image file, sets up a text prompt, and uses the xAI SDK to generate a…

Continue reading…

    Get the daily email that makes reading the news actually enjoyable. Stay informed and entertained, for free.

    Your information is secure and your privacy is protected. By opting in you agree to receive emails from us. Remember that you can opt-out any time, we hate spam too!

    You May Also Like


    Collaboratively administrate turnkey channels whereas virtual e-tailers. Objectively seize scalable metrics whereas proactive e-services.


    Quickly coordinate e-business applications through revolutionary catalysts for change. Seamlessly underwhelm optimal testing procedures processes.

    Editor's Pick

    Jeffrey Miron and Jacob Winter Occupational licensing — for doctors, lawyers, plumbers, barbers, and innumerable other trades — claims to improve service quality. Much...

    Tech News

    Photo: Getty Images Gannett, the media company that owns hundreds of newspapers in the US, is launching a new program that adds AI-generated bullet...