๐ Analyze uploaded images, videos, audio, and documents with specialized tools โ powered by a lightweight language-only agent.
๐ Analyze uploaded images, videos, audio, and documents with specialized tools โ powered by a lightweight language-only agent. ๐งญ What It Does This workflow enables multimodal file analysis using Google Gemini tools connected to a text-only LLM agent. Users can upload images, videos, audio files, or documents via a chat interface. The workflow will: Upload each file to Google Gemini and obtain an accessible URL. Dynamically generate contextual prompts based on the file(s) and user message. A
Marketplace
Independent
Category
operations
More like this
Browse operations agents โ
Asana Intelligence
AI built into Asana to accelerate team execution
$10.99/mo
operationsLayer
Build visual tree structures of your projects and goals in just a few clicks
Free ยท Paid plans available
operationsEraser
Generate AI diagrams and docs from simple text prompts
Free ยท Paid plans available
operationsDocumind
Open-source platform for extracting structured data from documents
Free ยท Paid plans available