Skills

Name: SHIPIT Agent
Author: SHIPIT

Use packaged skills, attach prebuilt skills to Agent and DeepAgent, and create your own custom skills catalog.

8 min read

38 sections

SHIPIT Agent supports a skills layer on top of the normal runtime. A skill is a reusable instruction block plus metadata that can be:

loaded from a packaged JSON catalog
attached explicitly to an Agent or DeepAgent
auto-matched from the user prompt
created locally and saved into your own skill catalog

Skills do not replace tools. They shape how the agent approaches the task. Use tools for capabilities. Use skills for reusable behavior, workflow, and domain guidance.

What a skill contains

Each Skill can include:

Field group	Fields
Identity	`id`, `name`, `display_name`
Descriptions	`description`, `detailed_description`, `long_description`
Discovery	`tags`, `features`, `use_cases`, `how_to_use`
Matching	`trigger_phrases`
Prompting	`prompt_template`
Runtime hints	`tools`, `requirements`, `mcps`

If a marketplace-style skill has no explicit prompt_template, SHIPIT derives prompt text from the skill metadata so it still works at runtime.

Packaged skills catalog

The library now ships with a packaged skills catalog:

python

from shipit_agent import DEFAULT_SKILLS_PATH, FileSkillRegistry

registry = FileSkillRegistry(DEFAULT_SKILLS_PATH)

print(DEFAULT_SKILLS_PATH)
print(len(registry.list()))

Search it:

python

matches = registry.search("database")
for skill in matches[:5]:
    print(skill.id, "-", skill.description)

The packaged file lives at:

text

shipit_agent/skills/skills.json

Detailed discovery example:

python

from shipit_agent import DEFAULT_SKILLS_PATH, FileSkillRegistry

registry = FileSkillRegistry(DEFAULT_SKILLS_PATH)

print(f"Catalog path: {DEFAULT_SKILLS_PATH}")
print(f"Total skills: {len(registry.list())}")

print("\nTop matches for 'database':")
for skill in registry.search("database")[:5]:
    print(f"- {skill.id}: {skill.description}")

print("\nTop matches for 'workflow':")
for skill in registry.search("workflow")[:5]:
    print(f"- {skill.id}: {skill.description}")

Agent with skills

Agent supports four main skill entry points:

skills=[...] — always attach these skills
default_skill_ids=[...] — attach known skill ids from the active catalog
auto_use_skills=True — auto-match skills from the user prompt
skill_source=... — point the agent at a custom skill catalog file

`skills` vs `auto_use_skills`

These two settings are related, but they do different jobs:

Setting	Behavior
`skills=[...]`	Explicitly attach these skills for every run
`auto_use_skills=False`	Disable prompt-based skill matching and keep behavior deterministic
`auto_use_skills=True`	Let the agent auto-match additional skills from the prompt
`skill_match_limit`	Cap how many auto-matched skills can be added for a run

Example:

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["code-workflow-assistant"],
    auto_use_skills=True,
    skill_match_limit=2,
)

What this means:

code-workflow-assistant is always active
the agent may auto-match more skills from the packaged catalog
the final run can use both the fixed skills and the auto-matched ones

If you want predictable behavior, set auto_use_skills=False. If you want a more adaptive agent, leave auto_use_skills=True.

Skills improve the approach, not the missing context

Skills help the agent respond with better structure, domain guidance, and workflow. They do not invent the missing details of your task.

Bad prompt:

python

result = agent.run(
    "Plan this feature, debug the slow path, and suggest database improvements."
)

That prompt does not tell the agent:

which feature
which service or codebase
which query or endpoint is slow
what the constraints are

Better prompt:

python

result = agent.run(
    """
    We are adding tenant-based billing alerts.

    Context:
    - Backend: FastAPI + PostgreSQL
    - Slow path: GET /api/billing/alerts takes 4.8s
    - Main query joins alerts, tenants, invoices, and users
    - Goal: reduce latency below 500ms

    Please:
    1. Plan the feature work
    2. Debug the likely slow path
    3. Suggest database improvements
    """
)

Use this mental model:

user prompt = the case
tools = the hands
skills = the playbook

Complete example:

python

from shipit_agent import Agent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["database-architect", "code-workflow-assistant"],
    auto_use_skills=True,
    skill_match_limit=2,
)

result = agent.run(
    """
    We are adding tenant-based billing alerts.

    Context:
    - Backend: FastAPI + PostgreSQL
    - Slow path: GET /api/billing/alerts takes 4.8s
    - Main query joins alerts, tenants, invoices, and users
    - Goal: reduce latency below 500ms

    Please:
    1. Plan the feature work
    2. Debug the likely slow path
    3. Suggest database improvements
    """
)
print(result.output)

More explicit end-to-end example:

python

from shipit_agent import Agent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["database-architect", "code-workflow-assistant"],
    auto_use_skills=True,
    skill_match_limit=2,
)

prompt = """
We are adding tenant-based billing alerts to an existing FastAPI service.

Project context:
- Backend: FastAPI
- Database: PostgreSQL 15
- Current endpoint: GET /api/billing/alerts
- Current performance: 4.8s p95
- Schema areas involved: alerts, tenants, invoices, users
- Constraint: no breaking API changes this sprint
- Goal: reduce the endpoint below 500ms and produce a safe implementation plan

What I want:
1. Summarize the likely problem areas
2. Propose a concrete implementation plan
3. Suggest database/index/query improvements
4. Call out testing and rollout risks
"""

result = agent.run(prompt)
print(result.output)

Attach more skills later:

python

agent.add_skill("portfolio-website-builder")

for skill in agent.skills:
    print(skill.id)

Search the active catalog from the live agent:

python

for skill in agent.search_skills("workflow")[:5]:
    print(skill.id)

Runtime-management example:

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    auto_use_skills=False,
    skills=["database-architect"],
)

print("Initially attached:")
for skill in agent.skills:
    print("-", skill.id)

print("\nSearch for a writing skill:")
for skill in agent.search_skills("brand")[:3]:
    print("-", skill.id)

agent.add_skill("brand-voice-guide")

print("\nAfter add_skill:")
for skill in agent.skills:
    print("-", skill.id)

Agent constructor parameters

Parameter	What it does
`skill_registry`	Supply an already constructed registry
`skill_source`	Load skills from a JSON file path
`skills`	Attach explicit skills by id or `Skill` object
`default_skill_ids`	Attach skill ids from the catalog
`auto_use_skills`	Enable prompt-based skill matching
`skill_match_limit`	Limit how many auto-matched skills are applied
`project_root`	Root directory used by skill-linked file and shell tools. Defaults to `/tmp`

Skills can also attach tools automatically

Selected skills do not only affect the prompt. They can also attach stronger built-in tools for that run.

For example, web-scraper-pro can bring in tools like:

web_search
open_url
playwright_browse
bash
read_file
edit_file
write_file
glob_files
grep_files

This means skills now shape both:

how the agent thinks
which tools the agent can use

Example:

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["web-scraper-pro", "code-workflow-assistant"],
    auto_use_skills=False,
)

result = agent.run(
    """
    Scrape the target pricing page, inspect the local parser under /tmp,
    patch the output shape, and save the cleaned data.
    """
)

print(result.metadata["used_skills"])
print(result.metadata["used_skill_tools"])

AgentResult.metadata now exposes:

Field	Meaning
`used_skills`	Final list of skill ids used for the run
`used_skill_tools`	Extra built-in tools injected because of those skills

Automatic iteration boost

When skills are active and the agent's max_iterations is at the default value (4), the runtime automatically boosts it to 8. This gives the agent enough turns to use the extra tools that skills inject — without the caller having to tune iteration counts manually.

If you set max_iterations explicitly, your value is always respected:

python

# Auto-boost: default 4 → becomes 8 because skills are active
agent = Agent.with_builtins(llm=llm, skills=["code-workflow-assistant"])

# Explicit override: stays at 20 regardless of skills
agent = Agent.with_builtins(llm=llm, skills=["code-workflow-assistant"], max_iterations=20)

Tool bundle validation

You can verify that every tool name referenced in the skill bundles actually exists:

python

from shipit_agent.builtins import get_builtin_tool_map
from shipit_agent.skills.tool_bundles import validate_tool_bundles

errors = validate_tool_bundles(set(get_builtin_tool_map(llm=llm).keys()))
assert errors == []  # no unknown tool references

DeepAgent with skills

DeepAgent supports the same skill API and passes it through to its inner Agent.

python

from shipit_agent import DeepAgent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

deep_agent = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["database-architect"],
    default_skill_ids=["code-workflow-assistant"],
    auto_use_skills=True,
)

result = deep_agent.run(
    """
    Investigate a slow database-backed feature.

    Context:
    - Service: billing-api
    - Endpoint: GET /api/billing/alerts
    - Current latency: 4.8s p95
    - Database: PostgreSQL
    - Known issue: query joins alerts, tenants, invoices, and users

    Please:
    1. Investigate the likely bottlenecks
    2. Plan a concrete fix
    3. Review the implementation approach
    """
)
print(getattr(result, "output", str(result)))

Detailed DeepAgent example:

python

from shipit_agent import DeepAgent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

deep_agent = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["database-architect"],
    default_skill_ids=["code-workflow-assistant"],
    auto_use_skills=True,
    max_iterations=10,
)

prompt = """
Investigate a slow database-backed feature in the billing-api service.

Context:
- Endpoint: GET /api/billing/alerts
- Current latency: 4.8s p95
- PostgreSQL backend
- Main join path: alerts -> tenants -> invoices -> users
- Requirement: no breaking API changes
- Requirement: keep the rollout safe for multi-tenant data

Please:
1. Investigate likely bottlenecks
2. Produce a step-by-step fix plan
3. Suggest query/index/schema improvements
4. Review the implementation and rollout risks
"""

result = deep_agent.run(prompt)
print(getattr(result, "output", str(result)))

Runtime management works the same way:

python

deep_agent.add_skill("incident-summary-writer")

for skill in deep_agent.skills:
    print(skill.id)

Search the catalog:

python

for skill in deep_agent.search_skills("brand")[:5]:
    print(skill.id)

Streaming with skills

Both Agent and DeepAgent support stream() for real-time event output. Skills work the same way in streaming mode — the agent gets skill prompts and tools, and you see events as they happen.

Agent streaming

python

from shipit_agent import Agent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["code-workflow-assistant", "database-architect"],
    auto_use_skills=True,
    max_iterations=10,
)

for event in agent.stream("Debug the slow billing query and suggest index improvements."):
    if event.type == "tool_called":
        print(f"  [tool] {event.message}")
    elif event.type == "tool_completed":
        print(f"  [done] {event.message}")
    elif event.type == "step_started":
        print(f"  [step] iteration {event.payload.get('iteration', '?')}")
    elif event.type == "run_completed":
        print(f"\n--- Final output ---")
        print(event.payload.get("output", ""))

DeepAgent streaming

python

from shipit_agent import DeepAgent

deep_agent = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["devops-automation"],
    auto_use_skills=False,
    max_iterations=8,
)

final_output = ""
for event in deep_agent.stream(
    "Create a CI/CD pipeline plan for a FastAPI + PostgreSQL service on AWS ECS."
):
    if event.type == "tool_called":
        print(f"  [tool] {event.message}")
    elif event.type == "tool_completed":
        print(f"  [done] {event.message}")
    elif event.type == "run_completed":
        final_output = event.payload.get("output", "")
        print(f"  [finished]")

print(final_output)

Event types you will see:

Event type	When it fires
`run_started`	Agent loop begins
`step_started`	Each LLM iteration starts
`tool_called`	A tool is invoked
`tool_completed`	A tool finishes
`tool_failed`	A tool errors out
`reasoning_started` / `reasoning_completed`	Model thinking blocks (when supported)
`run_completed`	Final output ready

Multi-turn chat with skills

Use chat_session() (Agent) or chat() (DeepAgent) for persistent multi-turn conversations. The agent retains full message history across turns — no clarifying questions needed.

Agent chat session

python

from shipit_agent import Agent
from shipit_agent import InMemoryMemoryStore

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["database-architect", "code-workflow-assistant"],
    auto_use_skills=True,
    memory_store=InMemoryMemoryStore(),
    max_iterations=10,
)

chat = agent.chat_session(session_id="billing-debug")

# Turn 1: Provide context
r1 = chat.send(
    """
    I'm working on billing-api (FastAPI + PostgreSQL 15).
    The GET /api/billing/alerts endpoint joins alerts, tenants,
    invoices, and users. Currently at 4.8s p95.
    Help me diagnose the root cause.
    """
)
print(r1.output)

# Turn 2: Follow up — agent remembers everything from turn 1.
r2 = chat.send("Give me the exact CREATE INDEX statements I should run.")
print(r2.output)

# Turn 3: Another follow-up.
r3 = chat.send("How do I deploy these index changes with zero downtime?")
print(r3.output)

DeepAgent chat

python

from shipit_agent import DeepAgent
from shipit_agent import InMemoryMemoryStore

deep = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skills=["full-stack-developer", "database-architect"],
    auto_use_skills=True,
    memory_store=InMemoryMemoryStore(),
    max_iterations=10,
)

chat = deep.chat(session_id="deep-debug-session")

# Turn 1
r1 = chat.send("Our billing-api alerts endpoint is slow (4.8s). PostgreSQL 15, joins 4 tables.")
print(r1.output)

# Turn 2 — agent retains context, no re-asking.
r2 = chat.send("Now show me the query rewrite with the indexes you recommended.")
print(r2.output)

Key differences:

Feature	`Agent.chat_session()`	`DeepAgent.chat()`
Method	`chat_session(session_id=...)`	`chat(session_id=...)`
Returns	`AgentChatSession`	`AgentChatSession`
Send	`chat.send(prompt)`	`chat.send(prompt)`
Stream	`chat.stream(prompt)`	`chat.stream(prompt)`
History	`chat.history()`	`chat.history()`
Session store	Auto-created or pass explicitly	Auto-created or pass explicitly

Create your own skills

Use the authoring helpers:

skill_id_from_name(...)
create_skill(...)
SkillCatalog(...)

Create a skill object

python

from shipit_agent import create_skill, skill_id_from_name

print(skill_id_from_name("Release Notes Writer"))

skill = create_skill(
    name="Release Notes Writer",
    description="Write concise release notes from code changes.",
    category="Development",
    tags=["release", "changelog", "docs"],
    trigger_phrases=["write release notes", "create a changelog"],
    prompt_template=(
        "Organize the answer into Highlights, Fixes, Known Issues, and Upgrade Notes. "
        "Prefer concise bullets and call out breaking changes clearly."
    ),
)

Save it into a local catalog

python

from pathlib import Path
from shipit_agent import SkillCatalog

catalog_path = Path("custom_skills.json")
catalog = SkillCatalog(catalog_path)

catalog.add(skill)

Or create-and-save in one step:

python

catalog.create(
    name="Incident Summary Writer",
    description="Turn outage notes into a concise incident summary.",
    category="Operations",
    trigger_phrases=["summarize this incident", "write an incident report"],
    prompt_template="Structure the answer as Summary, Timeline, Root Cause, Impact, and Follow-ups.",
)

Detailed custom-skill authoring example:

python

from pathlib import Path
from shipit_agent import SkillCatalog, create_skill

catalog_path = Path("custom_skills.json")
catalog = SkillCatalog(catalog_path)

release_notes_skill = create_skill(
    name="Release Notes Writer",
    description="Write concise release notes from code changes.",
    category="Development",
    tags=["release", "changelog", "docs"],
    use_cases=["Summarize merged pull requests into release notes",
        "Write internal release notes for engineering and support teams",],
    how_to_use=["Provide merged PR summaries or a change list",
        "Ask for highlights, fixes, known issues, and upgrade notes",],
    trigger_phrases=["write release notes",
        "create a changelog",
        "summarize this release",],
    prompt_template=(
        "Organize the answer into Highlights, Fixes, Known Issues, and Upgrade Notes. "
        "Prefer concise bullets and call out breaking changes clearly."
    ),
)

catalog.add(release_notes_skill)

print("Saved skill ids:")
for skill in catalog.list():
    print("-", skill.id)

Use a custom skill catalog in Agent

Point the agent at your custom file and attach the new skill by id:

python

from shipit_agent import Agent

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skill_source="custom_skills.json",
    auto_use_skills=False,
    skills=["release-notes-writer"],
)

result = agent.run(
    "Write release notes for these changes: fixed login redirect loop and improved dashboard load time."
)
print(result.output)

Detailed custom-catalog Agent example:

python

from shipit_agent import Agent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skill_source="custom_skills.json",
    auto_use_skills=False,
    skills=["release-notes-writer"],
)

prompt = """
Write release notes for version 2.4.0.

Changes:
- Fixed login redirect loop for expired sessions
- Improved dashboard load time by reducing duplicate invoice queries
- Renamed Billing Settings to Billing Preferences
- Added tenant-based billing alerts
- Known issue: export jobs may still time out for large tenants

Audience:
- customer-facing release notes
- concise wording
- call out anything users need to know before upgrading
"""

result = agent.run(prompt)
print(result.output)

Use the same custom catalog in DeepAgent

python

from shipit_agent import DeepAgent

deep_agent = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp",
    skill_source="custom_skills.json",
    auto_use_skills=False,
    skills=["release-notes-writer"],
)

result = deep_agent.run(
    "Write release notes for these changes: fixed login redirect loop and improved dashboard load time."
)
print(getattr(result, "output", str(result)))

Detailed custom-catalog DeepAgent example:

python

from shipit_agent import DeepAgent
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

deep_agent = DeepAgent.with_builtins(
    llm=llm,
    skill_source="custom_skills.json",
    auto_use_skills=False,
    skills=["release-notes-writer"],
    max_iterations=8,
)

prompt = """
Prepare release notes for version 2.4.0.

Changes:
- Fixed login redirect loop for expired sessions
- Improved dashboard load time by reducing duplicate invoice queries
- Renamed Billing Settings to Billing Preferences
- Added tenant-based billing alerts
- Known issue: export jobs may still time out for large tenants

Please:
1. Organize the release notes cleanly
2. Highlight user-visible changes first
3. Call out the known issue clearly
4. Add upgrade notes only if needed
"""

result = deep_agent.run(prompt)
print(getattr(result, "output", str(result)))

When to use skills vs tools

Use skills when...	Use tools when...
You want a reusable workflow or style	You need a real capability like web search or file IO
You want domain-specific guidance	You need to call an external system
You want to preload how the agent should think	You want the agent to do work outside the model

In practice:

pair skills with built-in tools
use skills to steer behavior
use tools to execute

Real-world examples

These examples show skills + tools working together on practical tasks. Each skill auto-attaches the right tools — the agent can read/write files, run code, search the web, and more without manual tool wiring.

Build a full project (full-stack-developer)

The full-stack-developer skill brings 13 tools including write_file, edit_file, workspace_files, bash, run_code, and plan_task. The agent can scaffold an entire project from scratch.

python

from shipit_agent import Agent
from shipit_agent import InMemoryMemoryStore
from examples.run_multi_tool_agent import build_llm_from_env

llm = build_llm_from_env("bedrock")

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/my-project",
    skills=["full-stack-developer"],
    auto_use_skills=False,
    memory_store=InMemoryMemoryStore(),
    max_iterations=15,
)

result = agent.run(
    """
    Create a FastAPI REST API project with the following:

    1. Project structure:
       - app/main.py (FastAPI app with CORS)
       - app/models.py (SQLAlchemy models: User, Task)
       - app/routes/tasks.py (CRUD endpoints for tasks)
       - app/database.py (SQLite connection)
       - requirements.txt
       - README.md

    2. The Task model should have: id, title, description, status, created_at, user_id
    3. Endpoints: GET /tasks, POST /tasks, GET /tasks/{id}, PUT /tasks/{id}, DELETE /tasks/{id}
    4. Include proper error handling and status codes

    Create all the files and verify the structure.
    """
)

print(f"Skills used: {result.metadata['used_skills']}")
print(f"Tools injected: {result.metadata['used_skill_tools']}")
print(result.output)

Scrape and analyse data (web-scraper-pro)

The web-scraper-pro skill brings web_search, open_url, playwright_browse, plus file tools to save scraped data.

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/scrape-output",
    skills=["web-scraper-pro"],
    auto_use_skills=False,
    memory_store=InMemoryMemoryStore(),
    max_iterations=12,
)

result = agent.run(
    """
    Research the top 5 Python web frameworks in 2024.
    For each framework, find:
    - GitHub stars count
    - Latest version
    - Key differentiator

    Save the results to /tmp/scrape-output/frameworks.json as structured JSON,
    then create a markdown summary in /tmp/scrape-output/summary.md.
    """
)

print(result.output)

Build a static website (portfolio-website-builder)

The portfolio-website-builder skill brings write_file, edit_file, workspace_files, bash, and run_code.

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/portfolio-site",
    skills=["portfolio-website-builder"],
    auto_use_skills=False,
    max_iterations=12,
)

result = agent.run(
    """
    Create a personal portfolio website with:
    - index.html (responsive landing page with hero section, about, projects, contact)
    - styles.css (modern design, dark theme, CSS grid layout)
    - script.js (smooth scrolling, dark/light toggle)

    Use semantic HTML, modern CSS (no frameworks), and vanilla JS.
    Include 3 sample project cards with placeholder content.
    """
)

print(result.output)

Security audit (security-engineer)

The security-engineer skill brings bash, web_search, grep_files, read_file, run_code, and verify_output for a thorough audit.

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/my-project",
    skills=["security-engineer"],
    auto_use_skills=False,
    memory_store=InMemoryMemoryStore(),
    max_iterations=10,
)

result = agent.run(
    """
    Audit the project at /tmp/my-project for common security issues:

    1. Check for hardcoded secrets, API keys, or credentials in source files
    2. Look for SQL injection vulnerabilities in database queries
    3. Check for missing input validation on API endpoints
    4. Review CORS configuration
    5. Check dependency versions for known CVEs

    Save a security report to /tmp/my-project/SECURITY_REPORT.md with
    findings ranked by severity (Critical, High, Medium, Low).
    """
)

print(result.output)

DevOps pipeline (devops-automation)

The devops-automation skill brings 12 tools including plan_task, verify_output, web tools for docs lookup, and file tools for generating configs.

python

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/my-project",
    skills=["devops-automation"],
    auto_use_skills=False,
    memory_store=InMemoryMemoryStore(),
    max_iterations=12,
)

result = agent.run(
    """
    Set up a CI/CD pipeline for a FastAPI + PostgreSQL project:

    1. Create a Dockerfile (multi-stage build, non-root user)
    2. Create docker-compose.yml (app + postgres + redis)
    3. Create .github/workflows/ci.yml (lint, test, build, push)
    4. Create .github/workflows/deploy.yml (deploy to AWS ECS)
    5. Create a Makefile with common dev commands

    Write all files and verify the Docker build would work.
    """
)

print(result.output)

DeepAgent streaming with skills

Stream events in real time while the agent works through a complex task.

python

from shipit_agent import DeepAgent
from shipit_agent import InMemoryMemoryStore

deep = DeepAgent.with_builtins(
    llm=llm,
    project_root="/tmp/stream-demo",
    skills=["full-stack-developer"],
    auto_use_skills=False,
    memory_store=InMemoryMemoryStore(),
    verify=True,
    max_iterations=12,
)

final_output = ""
for event in deep.stream(
    """
    Create a Python CLI tool that takes a GitHub repo URL and generates
    a project summary. Write it to /tmp/stream-demo/repo_summary.py.
    Include argument parsing, error handling, and a README.md.
    """
):
    if event.type == "step_started":
        print(f"  [step {event.payload.get('iteration', '?')}]")
    elif event.type == "tool_called":
        print(f"  [tool] {event.message}")
    elif event.type == "tool_completed":
        print(f"  [done] {event.message}")
    elif event.type == "run_completed":
        final_output = event.payload.get("output", "")

print("\n--- Final output ---")
print(final_output)

Multi-turn chat: iterative project building

Use chat sessions to build a project iteratively, turn by turn.

python

from shipit_agent import Agent
from shipit_agent import InMemoryMemoryStore

agent = Agent.with_builtins(
    llm=llm,
    project_root="/tmp/chat-project",
    skills=["full-stack-developer", "database-architect"],
    auto_use_skills=True,
    memory_store=InMemoryMemoryStore(),
    max_iterations=12,
)

chat = agent.chat_session(session_id="project-build")

# Turn 1: Scaffold
r1 = chat.send("Create a FastAPI project with a User model and auth endpoints at /tmp/chat-project.")
print("--- Turn 1: Scaffold ---")
print(r1.output[:500])

# Turn 2: Add features (agent remembers the project structure)
r2 = chat.send("Now add a Task model with CRUD endpoints. Tasks belong to users.")
print("\n--- Turn 2: Add features ---")
print(r2.output[:500])

# Turn 3: Optimize (agent remembers both models)
r3 = chat.send("Add database indexes for the queries and create a migration script.")
print("\n--- Turn 3: Optimize ---")
print(r3.output[:500])

# Turn 4: Deploy config
r4 = chat.send("Create a Dockerfile and docker-compose.yml for this project.")
print("\n--- Turn 4: Deploy ---")
print(r4.output[:500])

Examples and notebooks

Relevant notebooks in this repo:

notebooks/27_skills_catalog_and_usage.ipynb — catalog browse, tool bundles, memory, multi-turn chat
notebooks/28_create_and_use_custom_skills.ipynb — custom skill authoring and catalogs
notebooks/29_deep_agent_with_skills.ipynb — DeepAgent + skills + memory + verification + reflection + sub-agents

These cover:

fetching all packaged skills
attaching prebuilt skills to Agent with memory
attaching prebuilt skills to DeepAgent
multi-turn chat with context retention
verification and reflection loops
sub-agent delegation with skills
creating a new custom skill
saving a custom skill catalog
using that custom catalog from both agent types

API quick reference

python

from shipit_agent import (
    Agent,
    DeepAgent,
    Skill,
    SkillCatalog,
    SkillRegistry,
    FileSkillRegistry,
    create_skill,
    skill_id_from_name,
)

Main runtime methods:

agent.available_skills()
agent.search_skills(query)
agent.add_skill(skill_id_or_skill)
deep_agent.search_skills(query)
deep_agent.add_skill(skill_id_or_skill)

Skill Pipeline — How it works

The Skill Pipeline is a multi-stage process that ensures the right domain knowledge is applied to every request. It works identically for both Agent and DeepAgent.

Execution Flow

mermaid

graph TD
    A[User Prompt] --> B[Registry Lookup]
    B --> C{Match Found?}
    C -- Yes --> D[Load Skill Prompt]
    C -- Yes --> E[Attach Required Tools]
    C -- No --> F[Standard Prompt]
    D --> G[Agent Thinking]
    E --> G
    F --> G
    G --> H[Final Output]

1. Skill Resolution (Agent)

For a standard Agent, the pipeline resolves skills once at the start of the run().

python

from shipit_agent import Agent

agent = Agent.with_builtins(
    llm=llm,
    auto_use_skills=True  # Enables the matching pipeline
)

# Pipeline triggers: "database" matches the 'database-architect' skill
result = agent.run("Optimize this SQL query: SELECT * FROM users")

print(result.metadata["used_skills"]) # ['database-architect']

2. Deep Agent Integration

For a DeepAgent, the Skill Pipeline is persistent. The selected skills shape the initial plan and remain active across every iteration of the loop.

python

from shipit_agent import DeepAgent

# Skills are fanned out to all sub-agents in the DeepAgent factory
deep_agent = DeepAgent.with_builtins(
    llm=llm,
    skills=["code-workflow-assistant"] # Fixed skill in the pipeline
)

result = deep_agent.run("Refactor the auth module")

Summary of the Pipeline

Explicit Skills: Always bypass matching and stay in the pipeline.
Auto-Matched Skills: Dynamically enter the pipeline based on the prompt.
Tool Injection: Skills can automatically upgrade the agent's "hands" by injecting specialized tools (e.g., a SQL skill injecting a db_query tool).

Recommended pattern

Start with the packaged catalog.
Search for a skill that already matches your use case.
Attach it explicitly with skills=[...] for deterministic behavior.
Leave auto_use_skills=True if you also want prompt-based matching.
Create a custom catalog only when your workflow is specific to your team or product.

What a skill contains

Packaged skills catalog

Agent with skills

skills vs auto_use_skills

Skills improve the approach, not the missing context

Agent constructor parameters

Skills can also attach tools automatically

Automatic iteration boost

Tool bundle validation

DeepAgent with skills

Streaming with skills

Agent streaming

DeepAgent streaming

Multi-turn chat with skills

Agent chat session

DeepAgent chat

Create your own skills

Create a skill object

Save it into a local catalog

Use a custom skill catalog in Agent

Use the same custom catalog in DeepAgent

When to use skills vs tools

Real-world examples

Build a full project (full-stack-developer)

Scrape and analyse data (web-scraper-pro)

Build a static website (portfolio-website-builder)

Security audit (security-engineer)

DevOps pipeline (devops-automation)

DeepAgent streaming with skills

Multi-turn chat: iterative project building

Examples and notebooks

API quick reference

Skill Pipeline — How it works

Execution Flow

1. Skill Resolution (Agent)

2. Deep Agent Integration

Summary of the Pipeline

Recommended pattern

`skills` vs `auto_use_skills`