Activity Buffer API¶

The Activity Buffer provides in-memory and persistent message tracking for SW4RM agents. It integrates with the Three-ID model for deduplication and workflow correlation, and tracks ACK stage progression for reliable message delivery.

Overview¶

The Activity Buffer serves as the durable log for message lifecycle management:

Message Recording: Track all incoming and outgoing messages
ACK Tracking: Monitor acknowledgment progression through stages
Deduplication: Prevent duplicate processing using idempotency tokens
Persistence: Survive restarts with optional persistent storage
Pruning: Manage memory with configurable eviction strategies

Source: sdks/py_sdk/sw4rm/activity_buffer.py

Three-ID Model Integration¶

The Activity Buffer works with the Three-ID model defined in the SW4RM protocol:

ID	Purpose	Buffer Usage
`message_id`	Per-attempt unique identifier	Primary lookup key
`idempotency_token`	Deduplication across retries	Checked via `get_by_idempotency_token()`
`correlation_id`	Workflow/conversation linking	Stored in envelope, accessible via record

ActivityRecord¶

Each recorded message is stored as an ActivityRecord:

from dataclasses import dataclass, field
from typing import Dict, Any

@dataclass
class ActivityRecord:
    message_id: str
    direction: str  # "in" or "out"
    envelope: Dict[str, Any]
    ts_ms: int = field(default_factory=lambda: int(time.time() * 1000))
    ack_stage: int = ACK_STAGE_UNSPECIFIED
    error_code: int = ERROR_CODE_UNSPECIFIED
    ack_note: str = ""

    def ack(self, stage: int, error_code: int = 0, note: str = "") -> None:
        """Update ACK stage with optional error code and note."""
        self.ack_stage = stage
        self.error_code = error_code
        self.ack_note = note

    @property
    def state(self) -> int:
        """Get envelope state from the envelope dict."""
        return self.envelope.get("state", ENVELOPE_STATE_UNSPECIFIED)

    @property
    def idempotency_token(self) -> str:
        """Get idempotency token from the envelope dict."""
        return self.envelope.get("idempotency_token", "")

    @property
    def correlation_id(self) -> str:
        """Get correlation ID from the envelope dict."""
        return self.envelope.get("correlation_id", "")

Fields¶

Field	Type	Description
`message_id`	`str`	Unique identifier for this message attempt
`direction`	`str`	`"in"` for incoming, `"out"` for outgoing
`envelope`	`Dict[str, Any]`	Full message envelope
`ts_ms`	`int`	Timestamp in milliseconds since epoch
`ack_stage`	`int`	Current ACK stage (see ACK Stage Progression)
`error_code`	`int`	Error code if failed (from `constants.py`)
`ack_note`	`str`	Human-readable note about the ACK

ACK Stage Progression¶

Messages progress through ACK stages as they are processed:

UNSPECIFIED (0) → RECEIVED (1) → READ (2) → FULFILLED (3)
                                          ↘ REJECTED (4)
                                          ↘ FAILED (5)
                                          ↘ TIMED_OUT (6)

Stage	Value	Description
`ACK_STAGE_UNSPECIFIED`	0	Not yet acknowledged
`RECEIVED`	1	Router durably accepted the message
`READ`	2	Target validated and accepted for processing
`FULFILLED`	3	Target completed processing successfully
`REJECTED`	4	Target rejected the message (validation failure)
`FAILED`	5	Processing failed (internal error)
`TIMED_OUT`	6	Processing exceeded timeout

Terminal States¶

Use is_terminal_state() to check if an envelope has reached a final state:

from sw4rm.activity_buffer import is_terminal_state
from sw4rm import constants as C

# Terminal states: FULFILLED, REJECTED, FAILED, TIMED_OUT
is_terminal_state(C.FULFILLED)    # True
is_terminal_state(C.REJECTED)     # True
is_terminal_state(C.READ)         # False - still processing

ActivityBuffer (In-Memory)¶

The in-memory ActivityBuffer is suitable for development and short-lived agents:

class ActivityBuffer:
    def __init__(
        self,
        *,
        max_items: int = 1000,
        strategy: Optional[BufferStrategy] = None,
        dedup_window_s: int = 3600
    ) -> None:
        """Create an in-memory activity buffer.

        Args:
            max_items: Maximum records before pruning (default: 1000)
            strategy: Pruning strategy (default: FIFOBufferStrategy)
            dedup_window_s: Deduplication window in seconds (default: 3600)
        """

Methods¶

record_incoming(envelope)¶

def record_incoming(self, envelope: Dict[str, Any]) -> ActivityRecord:
    """Record an incoming message.

    Args:
        envelope: The message envelope dictionary

    Returns:
        The created ActivityRecord
    """

record_outgoing(envelope)¶

def record_outgoing(self, envelope: Dict[str, Any]) -> ActivityRecord:
    """Record an outgoing message.

    Args:
        envelope: The message envelope dictionary

    Returns:
        The created ActivityRecord
    """

ack(ack)¶

def ack(self, ack: Dict[str, Any]) -> Optional[ActivityRecord]:
    """Process an ACK message.

    Args:
        ack: ACK message with 'ack_for_message_id', 'ack_stage',
             optional 'error_code' and 'note'

    Returns:
        Updated ActivityRecord if found, None otherwise
    """

get(message_id)¶

def get(self, message_id: str) -> Optional[ActivityRecord]:
    """Retrieve a record by message_id.

    Args:
        message_id: The message identifier

    Returns:
        ActivityRecord if found, None otherwise
    """

unacked()¶

def unacked(self) -> List[ActivityRecord]:
    """Get all records awaiting ACK.

    Returns:
        List of records with ack_stage in {UNSPECIFIED, RECEIVED, READ}
    """

recent(n)¶

def recent(self, n: int = 50) -> List[ActivityRecord]:
    """Get n most recent records.

    Args:
        n: Number of records to return (default: 50)

    Returns:
        List of most recent ActivityRecords
    """

update_state(message_id, new_state)¶

def update_state(self, message_id: str, new_state: int) -> Optional[ActivityRecord]:
    """Update record state.

    Args:
        message_id: The message identifier
        new_state: New envelope state from constants.EnvelopeState

    Returns:
        Updated ActivityRecord if found, None otherwise
    """

get_by_idempotency_token(token)¶

def get_by_idempotency_token(self, token: str) -> Optional[ActivityRecord]:
    """Find record by idempotency token for deduplication.

    Used to check if an operation with this token has already been
    processed within the deduplication window.

    Args:
        token: Idempotency token to look up

    Returns:
        ActivityRecord if found and within dedup window, None otherwise
    """

PersistentActivityBuffer¶

For production use, PersistentActivityBuffer provides disk persistence:

class PersistentActivityBuffer:
    def __init__(
        self,
        *,
        max_items: int = 1000,
        persistence: Optional[PersistenceBackend] = None,
        strategy: Optional[BufferStrategy] = None,
        dedup_window_s: int = 3600
    ) -> None:
        """Create a persistent activity buffer.

        Args:
            max_items: Maximum records before pruning (default: 1000)
            persistence: Storage backend (default: JSONFilePersistence)
            strategy: Pruning strategy (default: FIFOBufferStrategy)
            dedup_window_s: Deduplication window in seconds (default: 3600)
        """

Additional Methods¶

flush()¶

def flush(self) -> None:
    """Force save to persistent storage.

    Call this to ensure all changes are persisted immediately.
    """

reconcile()¶

def reconcile(self) -> List[PersistentActivityRecord]:
    """Return unacked outgoing messages that may need retry/reconciliation.

    Call this on startup to find messages that were sent but not
    acknowledged before shutdown.

    Returns:
        List of unacked outgoing records
    """

clear()¶

def clear(self) -> None:
    """Clear all activity records and persistent storage."""

Context Manager Support¶

PersistentActivityBuffer supports the context manager protocol for automatic flushing:

with PersistentActivityBuffer(persistence=JSONFilePersistence("./data")) as buffer:
    buffer.record_outgoing(envelope)
    # ... process messages ...
# Buffer is automatically flushed on exit

Buffer Strategies¶

Buffer strategies determine which records to evict when capacity is exceeded:

Strategy	Class	Behavior	Use Case
FIFO	`FIFOBufferStrategy`	Remove oldest first	Default, most predictable
LIFO	`LIFOBufferStrategy`	Remove newest first	Prioritize older messages
Random	`RandomBufferStrategy`	Random eviction	Load balancing

Using Custom Strategies¶

from sw4rm.buffer_strategy import LIFOBufferStrategy, RandomBufferStrategy

# Prioritize older messages
buffer = ActivityBuffer(max_items=500, strategy=LIFOBufferStrategy())

# Random eviction with reproducible seed
buffer = ActivityBuffer(
    max_items=500,
    strategy=RandomBufferStrategy(seed=42)
)

Creating Custom Strategies¶

Implement the BufferStrategy protocol:

from sw4rm.buffer_strategy import BufferStrategy, ItemId
from typing import Sequence, Iterable

class PriorityBufferStrategy:
    """Remove low-priority items first."""

    def __init__(self, get_priority):
        self._get_priority = get_priority

    def victims(self, order: Sequence[ItemId], excess: int) -> Iterable[ItemId]:
        if excess <= 0:
            return ()

        # Sort by priority (lowest first)
        sorted_items = sorted(order, key=self._get_priority)
        return sorted_items[:excess]

Usage Examples¶

Basic Recording and ACK Processing¶

from sw4rm.activity_buffer import ActivityBuffer
from sw4rm.envelope import build_envelope
from sw4rm import constants as C

# Create buffer
buffer = ActivityBuffer(max_items=1000)

# Record outgoing message
envelope = build_envelope(
    producer_id="agent-1",
    message_type=C.DATA,
    content_type="application/json",
    payload=b'{"action": "process"}'
)
record = buffer.record_outgoing(envelope)
print(f"Recorded message: {record.message_id}")

# Process incoming ACK
ack = {
    "ack_for_message_id": record.message_id,
    "ack_stage": C.RECEIVED,
}
updated = buffer.ack(ack)
print(f"Updated ACK stage: {updated.ack_stage}")  # 1 (RECEIVED)

# Later, fulfilled
ack["ack_stage"] = C.FULFILLED
buffer.ack(ack)
print(f"Final ACK stage: {updated.ack_stage}")  # 3 (FULFILLED)

Deduplication with Idempotency Tokens¶

from sw4rm.activity_buffer import ActivityBuffer, is_terminal_state

buffer = ActivityBuffer(dedup_window_s=3600)  # 1-hour window

def process_message(envelope):
    token = envelope.get("idempotency_token")

    if token:
        # Check for duplicate
        existing = buffer.get_by_idempotency_token(token)
        if existing:
            if is_terminal_state(existing.state):
                # Already processed, return cached result
                return {"status": "duplicate", "original": existing.message_id}
            else:
                # Still processing, reject duplicate
                return {"status": "in_progress", "original": existing.message_id}

    # Record and process new message
    record = buffer.record_incoming(envelope)
    result = do_actual_processing(envelope)

    # Update state based on result
    if result.success:
        buffer.update_state(record.message_id, C.FULFILLED_ENVELOPE)
    else:
        buffer.update_state(record.message_id, C.FAILED_ENVELOPE)

    return result

Reconciliation Pattern for Unacked Outgoing Messages¶

from sw4rm.activity_buffer import PersistentActivityBuffer
from sw4rm.persistence import JSONFilePersistence

def startup_reconciliation():
    """Retry unacked messages from previous run."""

    buffer = PersistentActivityBuffer(
        persistence=JSONFilePersistence("./agent_data/activity.json")
    )

    # Find messages that need retry
    unacked = buffer.reconcile()
    print(f"Found {len(unacked)} unacked outgoing messages")

    for record in unacked:
        print(f"Retrying message: {record.message_id}")

        # Re-send the message
        try:
            send_message(record.envelope)
        except Exception as e:
            print(f"Retry failed: {e}")
            buffer.update_state(record.message_id, C.FAILED_ENVELOPE)

    return buffer

Persistent Buffer with Recovery¶

from sw4rm.activity_buffer import PersistentActivityBuffer
from sw4rm.persistence import JSONFilePersistence
import signal
import sys

class AgentWithPersistence:
    def __init__(self, data_dir: str):
        self.buffer = PersistentActivityBuffer(
            persistence=JSONFilePersistence(f"{data_dir}/activity.json"),
            max_items=5000,
            dedup_window_s=7200,  # 2 hours
        )

        # Handle graceful shutdown
        signal.signal(signal.SIGINT, self._shutdown_handler)
        signal.signal(signal.SIGTERM, self._shutdown_handler)

    def _shutdown_handler(self, signum, frame):
        print("Shutting down, flushing buffer...")
        self.buffer.flush()
        sys.exit(0)

    def run(self):
        # Reconcile on startup
        unacked = self.buffer.reconcile()
        for record in unacked:
            self._retry_message(record)

        # Main processing loop
        while True:
            message = self._receive_message()
            self._process(message)

    def _process(self, message):
        record = self.buffer.record_incoming(message)

        try:
            result = self._handle_message(message)
            self.buffer.update_state(record.message_id, C.FULFILLED_ENVELOPE)
        except Exception as e:
            self.buffer.update_state(record.message_id, C.FAILED_ENVELOPE)
            raise
        finally:
            # Periodically flush
            self.buffer.flush()

Thread Safety¶

Not Thread-Safe

ActivityBuffer and PersistentActivityBuffer are not thread-safe. If using across threads, callers must synchronize access externally.

import threading

class ThreadSafeBuffer:
    def __init__(self):
        self._buffer = ActivityBuffer()
        self._lock = threading.Lock()

    def record_incoming(self, envelope):
        with self._lock:
            return self._buffer.record_incoming(envelope)

    def ack(self, ack):
        with self._lock:
            return self._buffer.ack(ack)