Tag

Multimodal AI Tools

Explore AI tools tagged multimodal and filter them by pricing, API access, categories, and openness.

Platforms

Showing 1–10 of 10 platforms

ChatGPT

OpenAI

OpenAI's flagship assistant platform for writing, coding, research, image understanding, and multimodal workflows.

Text Generation
Coding
+3
APIClosedFree tierPaid plans

ChatGPT Image

OpenAI

OpenAI's native image generation inside ChatGPT — powered by GPT Image 1.5 / GPT Image 2.

Image Generation
API Platforms
APIClosedFree tierPaid plans

DeepSeek

DeepSeek

AI platform known for reasoning models, coding utility, open-weight releases, and cost-efficient APIs.

Text Generation
Coding
+3
APIOpenFree tierPaid plans

ERNIE Bot

Baidu Inc.

Baidu's large language model AI chatbot with multimodal generation capabilities.

Agents
Coding
+2
APIClosedFree tierPaid plans

Gemini

Google

Google's multimodal AI platform for search, Workspace, long-context analysis, and developer apps.

Image Generation
Research
+4
APIClosedFree tierPaid plans

Grok Imagine

xAI

xAI's creative image generation model integrated with Grok reasoning.

Image Generation
APIClosedFree tierPaid plans

HunyuanImage 3.0

Tencent

Large-scale open multimodal autoregressive image generation model.

Image Generation
APIOpenFree tierPaid plans

Kimi

Trending

Moonshot AI

AI assistant by Moonshot AI featuring industry-leading long-context windows, Mixture-of-Experts architecture, and advanced reasoning capabilities.

Text Generation
Coding
+3
APIOpenFree tierPaid plans

MiniMax

MiniMax

Multimodal AI platform with text, vision, audio, and video generation models, known for competitive pricing and strong video capabilities via Hailuo AI.

Text Generation
Video Generation
+3
APIOpenFree tierPaid plans

Operator

OpenAI

Autonomous web-browsing AI agent capable of performing real actions across websites and online workflows.

Agents
Automation
+1
No APIClosedPaid onlyPaid plans