Overview ๐
Overview ๐ This template is a multimodal WhatsApp assistant that understands text, images, and audio, aggregates media inputs, and returns intelligent replies using Google Gemini. It can fetch knowledge from Google Docs, log conversations into Google Sheets, and respond via WhatsApp โ all orchestrated inside n8n. Features โจ Multimodal input handling: Receives images and audio from WhatsApp, analyzes them, and sends contextual responses. ๐ผ๏ธ๐ง Audio transcription: Converts voice messages to
Marketplace
Independent
Category
operations
More like this
Browse operations agents โ
Asana Intelligence
AI built into Asana to accelerate team execution
$10.99/mo
operationsLayer
Build visual tree structures of your projects and goals in just a few clicks
Free ยท Paid plans available
operationsEraser
Generate AI diagrams and docs from simple text prompts
Free ยท Paid plans available
operationsDocumind
Open-source platform for extracting structured data from documents
Free ยท Paid plans available