Graphorin API reference / @graphorin/provider

`@graphorin/provider`

Vendor-neutral LLM provider layer for the Graphorin framework.

The package owns four moving parts:

createProvider(...) — wraps any adapter in a stable Provider shape with sensitivity, capability, and reasoning-retention defaults.
Adapters — vercelAdapter (default cloud path; wraps the Vercel AI SDK), ollamaAdapter (direct Ollama HTTP), llamaCppServerAdapter (the upstream llama-server binary from llama.cpp), and openAICompatibleAdapter (LMStudio / LocalAI / vLLM / Together-style self-host endpoints).
Middleware — composeProviderMiddleware([...]) enforces a canonical order at startup and throws MiddlewareOrderingError on violation. Built-ins: withTracing, withRetry, withRateLimit, withCostLimit, withCostTracking, withFallback, and withRedaction (mandatory in production).
Token counting — pluggable TokenCounter dispatcher. Default JsTiktokenCounter for OpenAI-compatible models; per-vendor native counters for Anthropic, Google, and Bedrock; heuristic fallback for unknown providers with a one-time WARN.

Installation

bash

pnpm add @graphorin/provider
# Optional peer for the cloud adapter:
pnpm add ai
# Optional peer for the default token counter:
pnpm add js-tiktoken

For the in-process llama.cpp companion adapter, install the separate package:

bash

pnpm add @graphorin/provider-llamacpp-node node-llama-cpp

Quick start

import { composeProviderMiddleware, createProvider } from '@graphorin/provider';
import { vercelAdapter } from '@graphorin/provider/adapters/vercel';
import {
  withCostLimit,
  withFallback,
  withRateLimit,
  withRedaction,
  withRetry,
  withTracing,
} from '@graphorin/provider/middleware';
import { BUILT_IN_PATTERNS } from '@graphorin/observability/redaction/patterns';
import { openai } from '@ai-sdk/openai';

const provider = createProvider(vercelAdapter(openai('gpt-4o')));

const safeProvider = composeProviderMiddleware([
  withTracing(),
  withRetry({ maxRetries: 3 }),
  withRateLimit({ requestsPerMinute: 60 }),
  withCostLimit({ maxPerSession: 1.0 }),
  withFallback([fallbackProvider]),
  withRedaction({ patterns: BUILT_IN_PATTERNS }), // INNERMOST
])(provider);

Local-first stack

import { ollamaAdapter } from '@graphorin/provider/adapters/ollama';

// Auto-classified as 'loopback' — no warning, no first-run prompt.
const local = createProvider(
  ollamaAdapter({ model: 'llama3.1:8b', baseUrl: 'http://127.0.0.1:11434' }),
);

The same LocalProviderTrust classifier ('loopback' | 'private' | 'public-tls' | 'public-cleartext') drives the trust auto-detection, the sensitivity-tier defaults, and the withRedaction policy table for every baseUrl-driven adapter — ollamaAdapter, llamaCppServerAdapter, and openAICompatibleAdapter. The classifier lives at @graphorin/provider/trust. Public-cleartext URLs refuse to start with LocalProviderInsecureTransportError.

Cost-tier vocabulary

import type { ModelHint } from '@graphorin/core';
import { classifyModelTier } from '@graphorin/provider/model-tier';

classifyModelTier(provider); // ⇒ 'fast' | 'balanced' | 'smart' | undefined

The classifier is consumed by the agent runtime (Phase 12) to validate operator-supplied per-tier mappings and to surface tier-not-mapped recommendations.

Project metadata

Repository: https://github.com/o-stepper/graphorin

Modules

Module	Description
	@graphorin/provider — vendor-neutral LLM provider layer for the Graphorin framework.
adapters/llamacpp-server	Direct adapter for the upstream `llama-server` binary from the llama.cpp project. The binary speaks the OpenAI-compatible REST contract end-to-end (`POST /v1/chat/completions`, `POST /v1/completions`, `POST /v1/embeddings`); streaming is via `text/event-stream` chunks terminated by `data: [DONE]` exactly as the upstream OpenAI shape.
adapters/ollama	Direct adapter for the Ollama HTTP API. The adapter speaks the native Ollama streaming JSON protocol (`POST /api/chat` returning newline-delimited JSON objects). For operators who prefer the OpenAI-compatible variant exposed by recent Ollama releases, the generic openAICompatibleAdapter is the better choice — both adapters share the same LocalProviderTrust classifier and LocalProviderInsecureTransportError startup behaviour.
adapters/openai-compatible	Generic OpenAI-compatible adapter — works against any HTTP server that speaks the `/v1/chat/completions` REST contract. Tested deployments include LMStudio (default port 1234), LocalAI (default port 8080), vLLM (`python -m vllm.entrypoints.openai.api_server`, default port 8000), Together-style self-host endpoints, and any other server in the OpenAI-compatible ecosystem.
adapters/vercel	`vercelAdapter` — wraps a Vercel AI SDK `LanguageModel`-shaped value into a Graphorin Provider. The adapter is the default cloud path: it speaks the AI SDK's `streamText` / `generateText` API and maps the resulting events onto the canonical import('@graphorin/core').ProviderEvent discriminated union.
counters	Token-counter dispatcher and per-vendor strategies.
errors	Public error surface for `@graphorin/provider`.
middleware	Middleware barrel — the canonical-order composer plus seven built-in middlewares.
model-tier	Per-provider model-tier auto-classifier.
reasoning	Reasoning-content lifecycle helpers.
trust	Trust subsystem barrel — exports the shared classifier and the per-tier sensitivity defaults.

errors

Classes

Type Aliases

evaluator-optimizer

Interfaces

Type Aliases

Functions

factory

Functions

fallback

Interfaces

Type Aliases

Functions

fanout

Interfaces

Type Aliases

Functions

filters

Interfaces

Variables

Functions

preferred-model

Interfaces

Functions

progress

Interfaces

Functions

run-state

Interfaces

Variables

Functions

client

Classes

Interfaces

Type Aliases

errors

Classes

Type Aliases

reconnect

Interfaces

Functions

cli

Interfaces

Type Aliases

Functions

loaders

Functions

errors

Classes

Interfaces

Type Aliases

conflict

Variables

errors

Classes

facade

Interfaces

Functions

tools

Functions

openinference

Type Aliases

Variables

Functions

redaction

imperative-patterns

Interfaces

Type Aliases

Variables

Functions

patterns

Interfaces

Type Aliases

Variables

telemetry

Interfaces

Functions

client-message

Type Aliases

Variables