Quick Start¶

This guide will help you get started with ArbiterOS-alpha in minutes.

Basic Workflow¶

Create an ArbiterOSAlpha instance
Define your policies (checkers and routers)
Decorate your functions with @instruction
Use LangGraph normally with added governance

Step-by-Step Example¶

1. Define Your State¶

from typing import TypedDict

class State(TypedDict):
    query: str
    response: str
    confidence: float

2. Create ArbiterOS Instance and Policies¶

from arbiteros_alpha import ArbiterOSAlpha
from arbiteros_alpha.policy import HistoryPolicyChecker, MetricThresholdPolicyRouter
from arbiteros_alpha.instructions import GENERATE, EVALUATE

# Create instance
arbiter_os = ArbiterOSAlpha()

# Add policy checker to prevent direct toolcall after generate
arbiter_os.add_policy_checker(
    HistoryPolicyChecker(
        name="no_direct_toolcall",
        bad_sequence=[GENERATE, TOOL_CALL]
    )
)

# Add policy router to regenerate when confidence is low
arbiter_os.add_policy_router(
    MetricThresholdPolicyRouter(
        name="regenerate_on_low_confidence",
        key="confidence",
        threshold=0.6,
        target="generate"
    )
)

3. Define and Decorate Functions¶

@arbiter_os.instruction(GENERATE)
def generate(state: State) -> dict:
    """Generate AI response."""
    response = "AI generated response"
    return {"response": response}

@arbiter_os.instruction(EVALUATE)
def evaluate(state: State) -> dict:
    """Evaluate response quality."""
    confidence = 0.8  # Calculate confidence
    return {"confidence": confidence}

4. Use with LangGraph¶

from langgraph.graph import StateGraph, END

# Create graph
graph = StateGraph(State)

# Add nodes (decorated functions)
graph.add_node("generate", generate)
graph.add_node("evaluate", evaluate)

# Define edges
graph.set_entry_point("generate")
graph.add_edge("generate", "evaluate")
graph.add_edge("evaluate", END)

# Compile and run
app = graph.compile()
result = app.invoke({"query": "What is AI?", "response": "", "confidence": 0.0})

# Print execution history
arbiter_os.history.pprint()

Understanding the Output¶

When you run the code, you'll see:

Debug logs: Showing policy checks and routing decisions
Execution history: Formatted display with:
Timestamps
Input/Output states (in YAML format)
Policy check results (✓ pass, ✗ fail)
Policy routing decisions (→ arrow indicates routing)

Policy Types¶

HistoryPolicyChecker¶

Validates instruction sequences to prevent forbidden patterns:

checker = HistoryPolicyChecker(
    name="my_checker",
    bad_sequence=[GENERATE, TOOL_CALL]  # Prevents GENERATE→TOOL_CALL
)

MetricThresholdPolicyRouter¶

Routes based on metric thresholds:

router = MetricThresholdPolicyRouter(
    name="my_router",
    key="confidence",      # Metric key in output
    threshold=0.6,         # Minimum acceptable value
    target="retry_node"    # Where to route if below threshold
)

Next Steps¶

Explore the API Reference for detailed documentation