Autonomous AI agents with 90+ tools, browser automation, live vision, multi-model support, and Marathon Mode for multi-day execution.
/start
☁️ Welcome to Wispy!
Your autonomous AI agent. Just talk naturally!
Natural Language Control:
• "Build me a React dashboard" → I'll start working
• "How's it going?" → Check my progress
• "Yes" / "No" → Approve or reject actions
• "Pause" / "Continue" → Control the work
Voice Notes: 🎤
Send voice messages and I'll respond!
/voice - Toggle voice replies
Image Generation:
/image <description> - Generate AI images
Wallet:
/wallet - Check crypto wallet
Examples:
• "Create a landing page with Tailwind"
• "What's the status?"
• 🎤 Send a voice note
• /image A robot playing guitar
I work autonomously and keep you updated! 🚀
90+
Built-in Tools
1M
Context Tokens
4
Trust Levels
∞
Marathon Days
# Install globally with npm
$ npm install -g wispy-ai
$ wispy setup
$ wispy gateway
Works on macOS, Windows & Linux. The one-liner installs Node.js and everything else for you.
Dynamic thinking budget from minimal to high based on task complexity. Gemini 2.5 Pro for deep reasoning, Flash for speed. Multi-model routing with Claude, GPT, Groq, Ollama, and OpenRouter.
@google/genai, multi-provider
Multi-chain USDC payments on Base, SKALE (gasless), Solana, and Tempo. x402 HTTP-native micropayments, AP2 payment flows, BITE V2 conditional encryption, DeFi token swaps, ERC-8004 reputation, and on-chain audit reports.
x402, viem, ethers, @solana/web3.js
Google's Agent-to-Agent protocol for autonomous AI collaboration. Service discovery via /.well-known/agent.json, task delegation with streaming progress, capability querying, and cryptographically signed message exchange.
Google A2A Protocol
Playwright-powered browser engine with 39 tools. Navigate, click, type, screenshot, scroll, multi-tab sessions. 55 built-in scraper skills for Google, Amazon, LinkedIn, and more. Custom AI-generated automation.
playwright-core, 55 built-in skills
Semantic memory with Gemini text-embedding-004. SQLite-backed vector store with cosine similarity search. Store facts, conversations, and context that persist across sessions for long-term recall and personalization.
better-sqlite3, text-embedding-004
Deploy across Telegram, WhatsApp, Discord, Slack, Gmail, Zoho Mail, and REST API. Inline keyboards, voice notes, file sharing, image generation in-chat, cross-channel file access, and event broadcasting.
grammy, @whiskeysockets/baileys, discord.js
Webcam capture, video recording, and live vision streaming via ffmpeg. Imagen 3 image generation, screenshot analysis, and real-time frame analysis with Gemini vision.
ffmpeg, @google/genai (vision), Imagen 3
Schedule recurring tasks with cron expressions or natural language. Multi-day marathon research with automatic planning, execution, and progress tracking. Heartbeat monitoring for long-running tasks.
node-cron, marathon service
Deep web research, paper analysis, competitor intelligence with citation tracking.
wispy skill add researchFull-stack development with React, Next.js, Node.js, Python. Scaffolding and testing.
wispy skill add codegenProfessional PDFs with LaTeX. Reports, whitepapers, charts and flowcharts.
wispy skill add documentsMulti-chain wallets, x402 USDC payments, DeFi swaps, BITE encryption on Base, SKALE, and Solana.
wispy skill add web3The Wispy Platform lets non-technical founders set up AI agents, create workflows, and manage integrations -- all from a visual interface at app.wispy.cc. Get your agent live in under 2 minutes.
Create and fund dedicated wallets per agent. Connect Base, Coinbase, or MetaMask.
Connect Telegram, WhatsApp, and more with a few clicks. No terminal.
27+ integrations, 24+ skills. Drag, configure, deploy.
Use your own API keys and pay only hosting, or go fully managed.
Get updates on new features, integrations, and Wispy wisdom. No spam, unsubscribe anytime.