This n8n workflow demonstrates how to automate image captioning tasks using Gemini 1.5 Pro - a multimodal LLM which can accept and analyse images. This is a really simple example of how easy it is to
This n8n workflow demonstrates how to automate image captioning tasks using Gemini 1.5 Pro - a multimodal LLM which can accept and analyse images. This is a really simple example of how easy it is to build and leverage powerful AI models in your repetitive tasks. How it works For this demo, we'll import a public image from a popular stock photography website, Pexel.com, into our workflow using the HTTP request node. With multimodal LLMs, there is little do preprocess other than ensuring the ima
Marketplace
Independent
Category
operations
More like this
Browse operations agents →
Asana Intelligence
AI built into Asana to accelerate team execution
$10.99/mo
operationsLayer
Build visual tree structures of your projects and goals in just a few clicks
Free · Paid plans available
operationsEraser
Generate AI diagrams and docs from simple text prompts
Free · Paid plans available
operationsDocumind
Open-source platform for extracting structured data from documents
Free · Paid plans available