Skip to content

Custom Skills

The two skills haiku.rag ships work against any LanceDB database. When you want a domain-specific skill that bundles its own data, prompt, and tool surface (for example, a "recipes" skill that knows about cooking and ships with a recipes database), generate one with haiku-rag create-skill.

The generated package is a regular pip-installable Python package that registers as a haiku.skills entry point. Any haiku.skills-aware host (haiku.skills CLI, your own agent, the AG-UI adapter) discovers it automatically.

When to use a custom skill

  • The model should consult a specific knowledge base for a specific kind of question, alongside other skills.
  • You want a different instruction prompt than the generic rag skill (different tone, refusal style, domain rules).
  • You want to ship a knowledge base plus its prompt as one distributable unit.
  • You're running multiple skills against different databases in the same agent.

If you just want to point a haiku.rag database at your own model and prompt, configure haiku.rag.yaml and use the built-in rag skill. No custom package needed.

Generate

haiku-rag create-skill \
  --name recipes \
  --db /path/to/recipes.lancedb \
  --tools search,cite \
  --description "Recipe and cooking knowledge base" \
  --preamble "You are a culinary expert helping with recipes and cooking techniques."

Then install and use:

uv pip install -e ./recipes-skill

haiku-skills list --use-entrypoints
# recipes — Recipe and cooking knowledge base

haiku-skills chat --use-entrypoints --skill recipes

Flags

Flag Description Default
--name Skill name (lowercase alphanumeric and hyphens). Required.
--db Path to the LanceDB database to embed. Required.
--description One-line skill description. The agent reads this to decide when to invoke. Standard RAG description
--tools Comma-separated tool subset, or all. all
--preamble Custom preamble for the skill's instructions. Standard RAG preamble
--config-file Path to a haiku.rag.yaml to embed alongside the database. None
--output / -o Output directory. Current directory

Available tools

cite, execute_code, get_document, list_documents, search.

Drop execute_code from --tools if the skill shouldn't run sandboxed Python. That gives you a search-and-cite-only skill with no analysis capabilities.

Anatomy of a generated skill

{name}-skill/
├── pyproject.toml
└── {name}_skill/
    ├── __init__.py    # create_skill() entry point
    ├── SKILL.md       # Skill metadata and instructions
    └── assets/
        ├── {name}.lancedb/   # The embedded database
        └── haiku.rag.yaml    # Optional config (only if --config-file passed)
  • SKILL.md carries the instruction prompt the agent will follow. The frontmatter includes the skill name and description. Everything below is the prompt body. Edit this to change behavior.
  • __init__.py exposes create_skill() (the entry point) and visualize_chunk() for rendering visual grounding.
  • assets/{name}.lancedb/ is the database, shipped inside the package.
  • assets/haiku.rag.yaml (optional) pins provider settings the skill needs.

The package can be installed locally with uv pip install -e . or published to PyPI.

Generating visual grounding from a custom skill

Each generated skill exposes a visualize_chunk() function that returns the chunk's bounding boxes rendered onto its source page:

from recipes_skill import visualize_chunk

images = await visualize_chunk(chunk_id)
# images is a list of PIL.Image objects, one per page the chunk covers
images[0].save("citation.png")

Pass chunk IDs from skill citations or search results. Same prerequisites as elsewhere in haiku.rag: documents need stored page images, and the chunk must come from a PDF or other docling-converted source.

Multi-skill agents

Each generated skill is self-contained with its own database and instructions. Compose multiple skills in one agent and the model routes between them via their descriptions:

from recipes_skill import create_skill as create_recipes_skill
from medic_skill import create_skill as create_medic_skill
from haiku.skills.agent import SkillToolset
from haiku.skills.prompts import build_system_prompt
from pydantic_ai import Agent

recipes = create_recipes_skill()
medic = create_medic_skill()
toolset = SkillToolset(skills=[recipes, medic])

agent = Agent(
    "openai-chat:gpt-4o",
    instructions=build_system_prompt(toolset.skill_catalog),
    toolsets=[toolset],
)

await agent.run("What's the optimal temperature for braising short ribs?")
# Routes to recipes

await agent.run("What's the field treatment for tension pneumothorax?")
# Routes to medic

Each skill maintains state under its own namespace (recipes, medic, …), so citations and searches don't collide.

Writing a skill from scratch

create-skill is the convenience path. If you need full control over the tools, state model, or instruction loading, write the skill against haiku.skills directly. The generated package in {name}_skill/__init__.py is a good reference. It composes haiku.rag's _tools factory with a haiku.skills.Skill and registers under the haiku.skills entry point group in pyproject.toml.

See the haiku.skills repository for the full Skill contract.