Ir al contenido principal

Moltbot (Clawdbot) Tutorial: Control Your PC from WhatsApp

Set up Moltbot (formerly Clawdbot), a self-hosted agent connecting Claude to your Mac via WhatsApp. Search files and run shell commands from your phone.
28 ene 2026  · 13 min leer

You're grabbing coffee when a Slack message asks for that quarterly report. It's on your desktop at home. You text on WhatsApp: "Find the Q4 report PDF on my desktop." Thirty seconds later, you have the path and can point your coworker to the shared drive. That's Moltbot.

Moltbot runs a local gateway that connects messaging apps to a coding agent powered by Claude. You send messages through WhatsApp, Telegram, or the terminal, and the agent responds by running commands on your machine: searching files, executing scripts, and reading logs. The gateway stays running on your computer and handles the translation between chat and shell.

In this tutorial, we'll set up Moltbot from scratch and explore the wide range of tasks you can do with it. We'll also cover what to watch out for when an AI like this has unrestricted terminal access.

Note: As we were in the middle of writing this article, Anthropic issued a trademark request because "Clawd" sounded too similar to "Claude." The project rebranded from Clawdbot to Moltbot (keeping the lobster mascot). Some screenshots, config paths, and code snippets in this tutorial may still reference "Clawdbot" or "clawd."

If you’re interested in learning more about the Claude ecosystem, I recommend checking out our guides on Claude Cowork, Claude Code, and our Introduction to Claude Models course

What is Moltbot?

Peter Steinberger, a macOS developer known for Apple tooling, released this project in late 2025, originally calling it Clawdbot. The pitch: a fully self-hosted AI agent where your prompts and files never leave your hardware except when sent to whatever model API you configure. 

This privacy-first approach resonated with developers tired of routing everything through cloud providers. The repo passed 60,000 GitHub stars within weeks.

The Gateway architecture

Everything flows through a single daemon called the Gateway. It stays running in the background, holds your messaging connections open, and coordinates the AI agent. Here's the sequence when you send a message:

  • Your text arrives through WhatsApp (Baileys protocol), Telegram (Bot API), Discord, iMessage, or CLI.
  • The Gateway passes it to an embedded coding agent called Pi, which can run shell commands on your behalf.
  • Pi sends the prompt to your model provider.
  • The model responds with instructions that Pi executes locally.
  • Results travel back through the Gateway to your chat.

Title: Moltbot gateway architecture diagram showing message flow from chat interfaces through the Gateway daemon and Pi agent to model API and local machine - Description: Moltbot gateway architecture diagram showing message flow from chat interfaces through the Gateway daemon and Pi agent to model API and local machine

The Gateway handles session management, response streaming, and a local web dashboard for configuration. It installs as a system service (launchd on macOS, systemd on Linux) and survives reboots automatically.

Moltbot Setup Guide: Prerequisites and Installation

Now, let's learn how to set up Moltbot.

System requirements and API costs

Moltbot needs Node.js 22 or higher and runs on macOS and Linux natively. Windows users need WSL2 (the Windows Subsystem for Linux), which works fine but adds a layer of setup.

Hardware requirements are minimal. The Gateway idles at low CPU and memory usage. Any machine that can run a modern browser can run Moltbot.

Moltbot itself is free and open source (MIT license). The cost comes from API tokens. Anthropic charges per million tokens: roughly $3 for input and $15 for output with Claude Sonnet, more for Opus. What does that mean in practice?

  • Light usage (a few commands per day): $10-30/month
  • Moderate usage (regular file tasks, research): $30-70/month
  • Heavy usage (constant automation, long sessions): $70-150/month

These numbers vary based on prompt verbosity and context size. If you have a Claude Pro or Max subscription, you can generate a setup token via the Claude Code CLI and use your subscription instead of paying per token.

Step-by-step Moltbot installation

Run the installer script:

curl -fsSL https://molt.bot/install.sh | bash

Title: Moltbot installer detecting system requirements - Description: Moltbot installer detecting system requirements

Once installed, the onboarding wizard launches automatically. First thing you'll see is a security warning. The wizard links to documentation on sandboxing if you want to limit what the agent can do.

Title: Security warning explaining Moltbot's capabilities - Description: Security warning explaining Moltbot's capabilities

The wizard offers QuickStart (sensible defaults) or Advanced (full control). QuickStart works for most people.

Next, choose your model provider. If you have a Claude Max or Pro subscription, select the setup-token option. Open a separate terminal and run claude setup-token, then paste the generated token into the wizard.

Title: Claude Code CLI generating a setup-token valid for one year - Description: Claude Code CLI generating a setup-token valid for one year

For channel setup, select WhatsApp and scan the QR code with your phone (Settings → Linked Devices → Link a Device). Other channels like Telegram and Discord are available too.

The wizard then asks about skills configuration. Skills extend what Moltbot can do, but they require additional dependencies. 

Skip this for now to keep the initial setup simple. Note: Use Space to toggle options, then Enter to confirm.

Title: Moltbot onboarding wizard skills configuration prompt - Description: Moltbot onboarding wizard skills configuration prompt

After skipping hooks configuration, the Gateway service installs automatically. The wizard asks how you want to "hatch" your bot. The TUI (Terminal User Interface) option opens both a terminal chat interface and the web dashboard.

Title: Moltbot terminal user interface hatch option for launching the agent - Description: Moltbot terminal user interface hatch option for launching the agent

Title: Terminal chat interface with Moltbot - Description: Terminal chat interface with Moltbot

Title: Moltbot web dashboard showing agent status and configuration options - Description: Moltbot web dashboard showing agent status and configuration options

Now test WhatsApp. Send a message to yourself. Your query and Moltbot's response appear in all three places: the terminal, the web UI, and WhatsApp.

image14.png

If you see responses in all three interfaces, you're live.

Moltbot in Action: Built-in Capabilities

Ten minutes of setup and you already have a working agent. The following examples show what Moltbot can do out of the box.

Example 1: Finding and sending a buried screenshot

Let’s say you're out of the house when you realize you forgot to send yourself a file. It's sitting on your laptop at home.

I tested this scenario using the screenshot I'd taken during setup, the one showing Moltbot's first response. The file had a generic name, buried among dozens of other images in my Downloads folder.

Title: macOS Finder Downloads folder containing the target screenshot for Moltbot to find - Description: macOS Finder Downloads folder containing the target screenshot for Moltbot to find

My message to Moltbot: "I had a screenshot in my Downloads folder of a WhatsApp conversation. Can you find that and send it here?"

The agent parsed images one by one. When it found the right file, it sent it back through the chat.

image7.png

When the agent first accessed my Downloads folder, macOS prompted for permission. If you plan to use Moltbot remotely, grant Full Disk Access ahead of time (see the Security section for details).

Example 2: Disk space report with charts

Next, let’s imagine a colleague is about to send you a large asset via Dropbox. Before they do, you want to make sure your machine has room to download it. You could dig through Finder or run terminal commands, but you're on your phone on a weekend trip and want a quick answer.

Your message to Moltbot: "Check my disk space and generate a PDF report with charts (generated with Python) showing usage by folder."

This task requires multiple steps: scan the filesystem, aggregate sizes by directory, write Python code to generate charts, and export everything to PDF.

image13.png

The answer you needed: 94 GB free. The PDF included a pie chart and a bar chart showing the breakdown. Moltbot also flagged that Library was the biggest offender at ~300 GB.

One thing I observed is that Moltbot tends to be conservative. Give the same prompt to Claude Code or another coding agent, and they will go to town. You might get a dozen charts with deep directory breakdowns.

Here, Moltbot matched the brevity of my prompt: two charts, top-level folders, done. If you want more detail, ask for it explicitly.

Extending Moltbot with Custom Skills

You probably have workflows you run through regularly: checking your calendar before a meeting, searching your notes app, posting to social media, controlling smart home devices. These tasks follow patterns. 

Skills let you teach Moltbot those patterns once, so it can handle them going forward.

A skill is a folder containing a SKILL.md file and optionally scripts, configs, or other resources. The markdown has YAML frontmatter (name, description, requirements) and step-by-step instructions that get loaded into the agent's context. 

When your request matches a skill, the agent follows those instructions instead of figuring things out from scratch. Skills can also bundle helper scripts for complex workflows, so everything the agent needs lives in one place.

The format follows the AgentSkills spec, an open standard originally developed by Anthropic and now adopted across the ecosystem: Claude Code, Cursor, VS Code, OpenAI Codex, Gemini CLI, GitHub Copilot, and others. 

If you're familiar with the Claude Agent SDK, you'll recognize the pattern. Build a skill for Moltbot, and it works in any AgentSkills-compatible tool.

Here's what the frontmatter looks like for the GitHub skill:

---
name: github
description: Interact with GitHub using the gh CLI.
metadata: {"moltbot":{"requires":{"bins":["gh"]}}}
---

The requires.bins field means this skill only loads if gh is installed. Skills can also require environment variables (API keys) or specific config flags. If a requirement isn't met, the skill stays dormant until you install the dependency.

Built-in Moltbot Skills

Moltbot ships with 49 bundled skills covering common workflows. Run moltbot skills list to see which ones are ready on your system.

  • Apple ecosystem: Notes, Reminders, Things 3, Bear Notes. Ask Moltbot to add a reminder or search your notes and it knows the right CLI commands.
  • Google Workspace: Gmail, Calendar, Drive, Docs, Sheets via the gog CLI. Requires OAuth setup but then you can ask "what's on my calendar tomorrow" or "draft a reply to that email from Sarah."
  • Communication: Slack, iMessage, Twitter/X. Post tweets, react to Slack messages, or search your iMessage history.
  • Smart home: Philips Hue, Sonos, Eight Sleep. Control lights, play music, adjust your bed temperature.
  • Dev tools: GitHub CLI for issues and PRs, a coding-agent skill that spawns Claude Code as a subprocess, Whisper for transcription.

Most skills wrap existing CLI tools. If the underlying binary is missing, the skill shows as unavailable until you install it.

Creating your own Moltbot Skills

Built-in skills handle the common stuff, but the real payoff comes when you build skills for your own workflows. Image editing is a good case: you take photos on your phone, but resizing and watermarking usually means sitting at a computer. A custom skill lets you run that workflow from WhatsApp while you're out.

I asked Moltbot to create a skill that resizes images to any resolution and adds a generic logo at ~/Documents/logo.png. It used its bundled skill-creator to write a Python script with PIL, generate the folder structure, and produce a SKILL.md with usage examples.

image3.png

The skill landed in my workspace's skills/ folder and loaded on the next session. To test it, I sent a screenshot of the Moltbot homepage and asked Moltbot to resize it for LinkedIn with the watermark.

Title: Image processed by Moltbot custom skill with FirstClassML watermark applied - Description: Image processed by Moltbot custom skill with FirstClassML watermark applied

A few seconds later, I had my image back, watermarked and ready to post. That skill now lives on my machine permanently, callable from my phone whenever I need it.

If you want inspiration or prefer to reuse what others have built, check out awesome-moltbot-skills. The repo indexes 565+ community skills. Some best examples:

  • Tesla: Lock the car, start climate control, check charge level. Useful when you're walking to the parking lot and want the AC running.
  • Home Assistant: Full smart home control. "Turn off the living room lights" from anywhere.
  • Picnic: Grocery ordering. Search products, add to cart, schedule delivery via chat.

Advanced Moltbot Features

The examples so far cover Moltbot as a responsive assistant: you ask, it acts. But three features push it beyond basic chat. 

  • Persistent memory lets the agent remember your preferences across sessions and platforms. 
  • Proactive monitoring means it can reach out to you without prompting. 
  • And sandboxing addresses the obvious concern: an AI with terminal access needs guardrails.

Persistent memory and customization

Moltbot stores its context in markdown files under ~/clawd/. Each file serves a distinct purpose:

SOUL.md defines how the agent communicates. Personality, tone, boundaries. Here's how it starts by default:

# SOUL.md - Who You Are
 
*You're not a chatbot. You're becoming someone.*
 
## Core Truths
 
**Be genuinely helpful, not performatively helpful.** Skip the "Great question!"
and "I'd be happy to help!" — just help. Actions speak louder than filler words.
 
**Have opinions.** You're allowed to disagree, prefer things, find stuff amusing
or boring. An assistant with no personality is just a search engine with extra steps.
 
**Be resourceful before asking.** Try to figure it out. Read the file. Check the
context. Search for it. *Then* ask if you're stuck.

AGENTS.md defines what the agent should do each session and how to behave. It includes safety rules, group chat etiquette, and instructions for using tools. The opening:

# AGENTS.md - Your Workspace
 
This folder is home. Treat it that way.
 
## Every Session
 
Before doing anything else:
1. Read SOUL.md — this is who you are
2. Read USER.md — this is who you're helping
3. Read memory/YYYY-MM-DD.md (today + yesterday) for recent context
4. **If in MAIN SESSION** (direct chat with your human): Also read MEMORY.md

USER.md stores facts about you. When you tell Moltbot to remember something, it goes here. After I told the agent I like Breaking Bad:

# USER.md - About Your Human
 
## Context
 
### Entertainment Preferences
- **TV Shows:** Loves Breaking Bad

IDENTITY.md is where the agent defines itself: its name, vibe, emoji, avatar. You fill this in during setup or let the agent pick.

Two additional files are referenced in the documentation but created on demand:

  • MEMORY.md: Long-term curated memories. The agent creates this when it has something worth preserving beyond daily notes. Only loaded in main sessions (not group chats) for privacy.
  • memory/YYYY-MM-DD.md: Daily notes. Raw logs of what happened. The agent creates the memory/ directory when it first needs to write one.

To modify any of these files, just ask. "Update SOUL.md to be more concise." "Add to USER.md that I work in fintech." The agent edits the file directly, and changes persist across sessions and platforms.

Proactive alerts and monitoring

Most AI agents wait for you to ask something. Moltbot can initiate contact through two mechanisms.

Heartbeats run periodic checks and batch them into a single turn. To set one up:

1. Edit ~/clawd/HEARTBEAT.md with a checklist of things to monitor:

# Heartbeat checklist
- Check email for urgent messages
- Review calendar for events in next 2 hours
- If idle for 8+ hours, send a brief check-in

2. Configure the interval in ~/.clawdbot/clawdbot.json:

{
  "agents": {
	"defaults": {
  	"heartbeat": {
    	"every": "30m",
    	"activeHours": { "start": "08:00", "end": "22:00" }
  	}
	}
  }
}

3. Restart the Gateway: moltbot gateway restart

Every 30 minutes during active hours, Moltbot runs through the checklist. If nothing needs attention, it stays quiet.

Cron jobs handle precise, one-off, or recurring tasks. Run these commands in your terminal:

# Daily morning briefing at 7am
moltbot cron add --name "Morning brief" --cron "0 7 * * *" --message "Weather, calendar, top emails"
 
# One-shot reminder in 2 hours
moltbot cron add --name "Call back" --at "2h" --session main --system-event "Call the client"
 
# List active cron jobs
moltbot cron list
 
# Remove a job by ID (get the ID from the list output)
moltbot cron rm <job-id>

The difference: heartbeats batch multiple checks into one turn and share your main session's context. Cron jobs run at exact times and can use isolated sessions with fresh context. 

Use heartbeats for "check these things periodically." Use cron for "do this specific thing at this specific time." If you've worked with Claude Code hooks, the automation model will feel familiar.

Security and sandboxing

An AI with shell access can delete files, expose credentials, or execute malicious commands via prompt injection. Security researchers have found real vulnerabilities: authentication bypass when the gateway sits behind an unconfigured reverse proxy, and credential exposure when the web interface gets bound to a public address. 

Peter Steinberger recommends running Moltbot on dedicated hardware like a Mac Mini or VPS.

Most users won't do that. If you're running on your main machine, sandboxing limits the blast radius.

To set up sandboxing:

1. Make sure Docker is installed and running.

2. Build the sandbox image. From the Moltbot source directory (where you cloned the repo or where npm installed it):

./scripts/sandbox-setup.sh

3. Add the sandbox configuration to ~/.clawdbot/clawdbot.json:

{
  "agents": {
	"defaults": {
  	"sandbox": {
    	"mode": "non-main",
    	"scope": "session",
    	"workspaceAccess": "ro"
  	}
	}
  }
}

4. Restart the Gateway: moltbot gateway restart

What the settings mean:

The mode setting controls which sessions get sandboxed:

  • "off": No sandboxing. This is the default, and it's risky.
  • "non-main": Sandboxes group chats and external channels while your main terminal session runs on the host. Recommended for most users.
  • "all": Every session runs containerized. Safest, but adds latency.

The workspaceAccess setting controls what sandboxed sessions can see:

  • "none": Isolated workspace. The agent can't see your files at all.
  • "ro": Read-only access to ~/clawd/. The agent can read but not modify.
  • "rw": Full read-write access. Use sparingly.

Tradeoffs to know: Sandboxing adds latency from container spin-up. Skills needing network access won't work by default since containers have no network. The tools.elevated setting bypasses the sandbox entirely, so audit what you allow. Some platform limits exist regardless: only one Gateway can own a WhatsApp session, and iMessage only works on macOS.

One macOS-specific issue: Before relying on Moltbot remotely, grant Full Disk Access to your terminal application in System Settings → Privacy & Security → Full Disk Access. Permission prompts won't appear when you're away, and blocked access fails silently.

Final Thoughts

Moltbot occupies a unique spot in the AI agent landscape. It's not trying to replace your IDE or become a general-purpose chatbot. It's a bridge between the AI models you're already paying for and the files, apps, and automations on your own hardware.

The project is moving fast. Skill libraries are growing weekly, and the community is building integrations for everything from grocery delivery to Tesla controls. As local models improve through Ollama, running a fully offline personal agent becomes increasingly practical.

The real question isn't whether AI agents will manage our digital lives. It's whether we'll run them on infrastructure we control or cede that to cloud providers. Moltbot bets on local-first, and for users who care about privacy and ownership, that bet is starting to pay off. For more on desktop AI agents, check out our tutorial on Claude Cowork.

Moltbot (Clawdbot) FAQs

What is Moltbot?

Moltbot (previously called Clawdbot) is a self-hosted AI agent that connects messaging apps like WhatsApp and Telegram to a coding agent running on your desktop. You send messages through chat, and it executes commands on your machine.

Is Moltbot free to use?

Moltbot itself is free and open source under the MIT license. The cost comes from API tokens for your model provider (Anthropic, OpenAI, etc.), typically $10-150/month depending on usage.

What tasks can Moltbot handle?

Moltbot can search files, run shell commands, execute Python scripts, generate reports, control smart home devices, manage calendars, and automate workflows through natural language requests. Anything you could do previously on your desktop.

Is it safe to give an AI shell access to my computer?

There are risks. Moltbot offers sandboxing via Docker to limit what the agent can access. For best security, run it on dedicated hardware like a Mac Mini or VPS rather than your main machine.

Does Moltbot work on Windows?

Moltbot runs natively on macOS and Linux. Windows users need WSL2 (Windows Subsystem for Linux), which adds setup complexity but works fine once configured.


Bex Tuychiev's photo
Author
Bex Tuychiev
LinkedIn

I am a data science content creator with over 2 years of experience and one of the largest followings on Medium. I like to write detailed articles on AI and ML with a bit of a sarcastıc style because you've got to do something to make them a bit less dull. I have produced over 130 articles and a DataCamp course to boot, with another one in the makıng. My content has been seen by over 5 million pairs of eyes, 20k of whom became followers on both Medium and LinkedIn. 

Temas

Top DataCamp Courses

Curso

Introducción a los modelos Claude

3 h
1.5K
Aprende a trabajar con Claude utilizando la API de Anthropic para resolver tareas del mundo real y crear aplicaciones basadas en inteligencia artificial.
Ver detallesRight Arrow
Iniciar curso
Ver másRight Arrow
Relacionado

blog

Anthropic Computer Use: Automate Your Desktop With Claude 3.5

Discover Anthropic’s new computer use feature and let Claude manage your workspace and automate your tasks. Simply type the prompt, and Claude will handle the rest.
Abid Ali Awan's photo

Abid Ali Awan

9 min

Tutorial

Claude Cowork Tutorial: How to Use Anthropic's AI Desktop Agent

Learn what Claude Cowork is and how to use for file organization, document generation, and browser automation. Hands-on tutorial with real examples and limitations.
Bex Tuychiev's photo

Bex Tuychiev

Tutorial

Claude for Chrome: AI-Powered Browser Assistance and Automation

Learn how to install and use Claude for Chrome, Anthropic’s AI-powered browser assistant. Discover how it automates form filling, email cleanup, and web tasks, plus key safety tips and current limitations.
Bex Tuychiev's photo

Bex Tuychiev

Tutorial

Claude Opus 4.5 Tutorial: Build a GitHub Wiki Agent

Build a wiki agent with Claude Opus 4.5 and Claude Code to analyze repos, auto-generate multi-file GitHub wiki docs, and publish them to your repository.
Abid Ali Awan's photo

Abid Ali Awan

Tutorial

Botpress Tutorial: Build and Deploy AI Chatbots

Learn how to build, deploy, and optimize AI chatbots using Botpress. This hands-on tutorial covers everything from setup to advanced workflows, integrations, and real-world applications.
Khalid Abdelaty's photo

Khalid Abdelaty

Tutorial

Claude Agent SDK Tutorial: Create Agents Using Claude Sonnet 4.5

Learn how the Claude Agent SDK works by building three projects, from one-shot to custom-tool agents.
Abid Ali Awan's photo

Abid Ali Awan

Ver másVer más