Skip to main content

Optional Skills Catalog

Official optional skills ship with the hermes-agent repository under optional-skills/ but are not active by default. Install them explicitly:

hermes skills install official/<category>/<skill>

For example:

hermes skills install official/blockchain/solana
hermes skills install official/mlops/flash-attention

Once installed, the skill appears in the agent's skill list and can be loaded automatically when relevant tasks are detected.

To uninstall:

hermes skills uninstall <skill-name>

Autonomous AI Agents

SkillDescription
blackboxDelegate coding tasks to Blackbox AI CLI agent. Multi-model agent with built-in judge that runs tasks through multiple LLMs and picks the best result.
honchoConfigure and use Honcho memory with Hermes — cross-session user modeling, multi-profile peer isolation, observation config, and dialectic reasoning.

Blockchain

SkillDescription
baseQuery Base (Ethereum L2) blockchain data with USD pricing — wallet balances, token info, transaction details, gas analysis, contract inspection, whale detection, and live network stats. No API key required.
solanaQuery Solana blockchain data with USD pricing — wallet balances, token portfolios, transaction details, NFTs, whale detection, and live network stats. No API key required.

Communication

SkillDescription
one-three-one-ruleStructured communication framework for proposals and decision-making.

Creative

SkillDescription
blender-mcpControl Blender directly from Hermes via socket connection to the blender-mcp addon. Create 3D objects, materials, animations, and run arbitrary Blender Python (bpy) code.
concept-diagramsGenerate flat, minimal light/dark-aware SVG diagrams as standalone HTML files, using a unified educational visual language (9 semantic color ramps, automatic dark mode). Best for physics setups, chemistry mechanisms, math curves, physical objects (aircraft, turbines, smartphones), floor plans, cross-sections, lifecycle/process narratives, and hub-spoke system diagrams. Ships with 15 example diagrams.
meme-generationGenerate real meme images by picking a template and overlaying text with Pillow. Produces actual .png meme files.
touchdesigner-mcpControl a running TouchDesigner instance via the twozero MCP plugin — create operators, set parameters, wire connections, execute Python, build real-time audio-reactive visuals and GLSL networks. 36 native tools.

Dogfood

SkillDescription
adversarial-ux-testRoleplay the most difficult, tech-resistant user for a product — browse in-persona, rant, then filter through a RED/YELLOW/WHITE/GREEN pragmatism layer so only real UX friction becomes tickets.

DevOps

SkillDescription
cliRun 150+ AI apps via inference.sh CLI (infsh) — image generation, video creation, LLMs, search, 3D, and social automation.
docker-managementManage Docker containers, images, volumes, networks, and Compose stacks — lifecycle ops, debugging, cleanup, and Dockerfile optimization.

Email

SkillDescription
agentmailGive the agent its own dedicated email inbox via AgentMail. Send, receive, and manage email autonomously using agent-owned email addresses.

Health

SkillDescription
fitness-nutritionGym workout planner and nutrition tracker. Search 690+ exercises by muscle, equipment, or category via wger. Look up macros and calories for 380,000+ foods via USDA FoodData Central. Computes BMI, TDEE, one-rep max, macro splits, and body fat — pure Python, no pip installs.
neuroskill-bciBrain-Computer Interface (BCI) integration for neuroscience research workflows.

MCP

SkillDescription
fastmcpBuild, test, inspect, install, and deploy MCP servers with FastMCP in Python. Covers wrapping APIs or databases as MCP tools, exposing resources or prompts, and deployment.
mcporterThe mcporter CLI — list, configure, auth, and call MCP servers/tools directly (HTTP or stdio) from the terminal. Useful for ad-hoc MCP interactions; for always-on tool discovery use the built-in native-mcp client instead.

Migration

SkillDescription
openclaw-migrationMigrate a user's OpenClaw customization footprint into Hermes Agent. Imports memories, SOUL.md, command allowlists, user skills, and selected workspace assets.

MLOps

The largest optional category — covers the full ML pipeline from data curation to production inference.

SkillDescription
accelerateSimplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP.
chromaOpen-source embedding database. Store embeddings and metadata, perform vector and full-text search. Simple 4-function API for RAG and semantic search.
clipOpenAI's vision-language model connecting images and text. Zero-shot image classification, image-text matching, and cross-modal retrieval. Trained on 400M image-text pairs. Use for image search, content moderation, or vision-language tasks without fine-tuning.
faissFacebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW).
flash-attentionOptimize transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Supports PyTorch SDPA, flash-attn library, H100 FP8, and sliding window.
guidanceControl LLM output with regex and grammars, guarantee valid JSON/XML/code generation, enforce structured formats, and build multi-step workflows with Guidance — Microsoft Research's constrained generation framework.
hermes-atropos-environmentsBuild, test, and debug Hermes Agent RL environments for Atropos training. Covers the HermesAgentBaseEnv interface, reward functions, agent loop integration, and evaluation.
huggingface-tokenizersFast Rust-based tokenizers for research and production. Tokenizes 1GB in under 20 seconds. Supports BPE, WordPiece, and Unigram algorithms.
instructorExtract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, and stream partial results.
lambda-labsReserved and on-demand GPU cloud instances for ML training and inference. SSH access, persistent filesystems, and multi-node clusters.
llavaLarge Language and Vision Assistant — visual instruction tuning and image-based conversations combining CLIP vision with LLaMA language models.
modalServerless GPU cloud platform for running ML workloads. On-demand GPU access without infrastructure management, ML model deployment as APIs, or batch jobs with automatic scaling.
nemo-curatorGPU-accelerated data curation for LLM training. Fuzzy deduplication (16x faster), quality filtering (30+ heuristics), semantic dedup, PII redaction. Scales with RAPIDS.
peft-fine-tuningParameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Train <1% of parameters with minimal accuracy loss for 7B–70B models on limited GPU memory. HuggingFace's official PEFT library.
pineconeManaged vector database for production AI. Auto-scaling, hybrid search (dense + sparse), metadata filtering, and low latency (under 100ms p95).
pytorch-fsdpExpert guidance for Fully Sharded Data Parallel training with PyTorch FSDP — parameter sharding, mixed precision, CPU offloading, FSDP2.
pytorch-lightningHigh-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks, and minimal boilerplate.
qdrantHigh-performance vector similarity search engine. Rust-powered with fast nearest neighbor search, hybrid search with filtering, and scalable vector storage.
saelensTrain and analyze Sparse Autoencoders (SAEs) using SAELens to decompose neural network activations into interpretable features.
simpoSimple Preference Optimization — reference-free alternative to DPO with better performance (+6.4 pts on AlpacaEval 2.0). No reference model needed.
slimeLLM post-training with RL using Megatron+SGLang framework. Custom data generation workflows and tight Megatron-LM integration for RL scaling.
stable-diffusion-image-generationState-of-the-art text-to-image generation with Stable Diffusion via HuggingFace Diffusers. Text-to-image, image-to-image translation, inpainting, and custom diffusion pipelines.
tensorrt-llmOptimize LLM inference with NVIDIA TensorRT for maximum throughput. 10-100x faster than PyTorch on A100/H100 with quantization (FP8/INT4) and in-flight batching.
torchtitanPyTorch-native distributed LLM pretraining with 4D parallelism (FSDP2, TP, PP, CP). Scale from 8 to 512+ GPUs with Float8 and torch.compile.
whisperOpenAI's general-purpose speech recognition. 99 languages, transcription, translation to English, and language ID. Six model sizes from tiny (39M) to large (1550M). Best for robust multilingual ASR.

Productivity

SkillDescription
canvasCanvas LMS integration — fetch enrolled courses and assignments using API token authentication.
memento-flashcardsSpaced repetition flashcard system for learning and knowledge retention.
siyuanSiYuan Note API for searching, reading, creating, and managing blocks and documents in a self-hosted knowledge base.
telephonyGive Hermes phone capabilities — provision a Twilio number, send/receive SMS/MMS, make calls, and place AI-driven outbound calls through Bland.ai or Vapi.

Research

SkillDescription
bioinformaticsGateway to 400+ bioinformatics skills from bioSkills and ClawBio. Covers genomics, transcriptomics, single-cell, variant calling, pharmacogenomics, metagenomics, and structural biology.
domain-intelPassive domain reconnaissance using Python stdlib. Subdomain discovery, SSL certificate inspection, WHOIS lookups, DNS records, and bulk multi-domain analysis. No API keys required.
duckduckgo-searchFree web search via DuckDuckGo — text, news, images, videos. No API key needed.
gitnexus-explorerIndex a codebase with GitNexus and serve an interactive knowledge graph via web UI and Cloudflare tunnel.
parallel-cliVendor skill for Parallel CLI — agent-native web search, extraction, deep research, enrichment, and monitoring.
qmdSearch personal knowledge bases, notes, docs, and meeting transcripts locally using qmd — a hybrid retrieval engine with BM25, vector search, and LLM reranking.
scraplingWeb scraping with Scrapling — HTTP fetching, stealth browser automation, Cloudflare bypass, and spider crawling via CLI and Python.

Security

SkillDescription
1passwordSet up and use 1Password CLI (op). Install the CLI, enable desktop app integration, sign in, and read/inject secrets for commands.
oss-forensicsOpen-source software forensics — analyze packages, dependencies, and supply chain risks.
sherlockOSINT username search across 400+ social networks. Hunt down social media accounts by username.

Contributing Optional Skills

To add a new optional skill to the repository:

  1. Create a directory under optional-skills/<category>/<skill-name>/
  2. Add a SKILL.md with standard frontmatter (name, description, version, author)
  3. Include any supporting files in references/, templates/, or scripts/ subdirectories
  4. Submit a pull request — the skill will appear in this catalog once merged