ctx.llm

Class: Llm

class Llm:
    def generate(
        self,
        messages: list[dict[str, str]],
        *,
        model: str | None = None,
        max_tokens: int | None = None,
        temperature: float | None = None,
        provider_options: dict | None = None,
    ) -> LlmResponse: ...

    def generate_object(
        self,
        messages: list[dict[str, str]],
        schema: dict,
        *,
        model: str | None = None,
        max_tokens: int | None = None,
        temperature: float | None = None,
        provider_options: dict | None = None,
    ) -> LlmResponse: ...

Methods

generate()

Generate text from an LLM. Parameters:

Parameter	Type	Required	Description
`messages`	`list[dict[str, str]]`	Yes	Conversation messages with `role` and `content`
`model`	`str \| None`	No	Model identifier (resolution cascade applies)
`max_tokens`	`int \| None`	No	Maximum tokens to generate
`temperature`	`float \| None`	No	Sampling temperature (0.0 - 2.0)
`provider_options`	`dict \| None`	No	Provider-specific options passthrough

Returns: LlmResponse Raises: LlmError on generation failure Example:

result = ctx.llm.generate(
    messages=[{"role": "user", "content": "Summarise this article"}],
    model="anthropic:claude-sonnet-4-6",
    max_tokens=1000,
    temperature=0.7,
)
print(result.text)

generate_object()

Generate structured output conforming to a JSON Schema. Parameters:

Parameter	Type	Required	Description
`messages`	`list[dict[str, str]]`	Yes	Conversation messages
`schema`	`dict`	Yes	JSON Schema for output structure
`model`	`str \| None`	No	Model identifier
`max_tokens`	`int \| None`	No	Maximum tokens
`temperature`	`float \| None`	No	Sampling temperature
`provider_options`	`dict \| None`	No	Provider-specific options

Returns: LlmResponse with .object populated Raises: LlmError on generation failure Example:

schema = {
    "type": "object",
    "properties": {
        "summary": {"type": "string"},
        "tags": {"type": "array", "items": {"type": "string"}},
    },
    "required": ["summary"],
}

result = ctx.llm.generate_object(
    messages=[{"role": "user", "content": "Analyse this"}],
    schema=schema,
    model="anthropic:claude-haiku-4-5",
)

data = result.object  # Parsed JSON object
print(data["summary"])
print(data.get("tags", []))

Model Resolution

Resolution cascade (first match wins):

Fully qualified per-call — model="anthropic:claude-sonnet-4-6" used directly
Bare per-call + decorator default — model="claude-sonnet-4-6" + @agent(llm={"provider": "anthropic"}) resolved to full identifier
Decorator default only — @agent(llm={"provider": "anthropic", "model": "claude-sonnet-4-6"}) used when no model specified
Error — No model specified and no decorator default

LlmResponse

@dataclass
class LlmResponse:
    text: str | None           # Generated text (None for generate_object)
    object: dict | None        # Structured output dict (None for generate)
    model: str                 # Model identifier used (e.g., "anthropic:claude-sonnet-4-6")
    usage: dict                # {"prompt_tokens": 120, "completion_tokens": 250}
    finish_reason: str         # "stop", "length", "content_filter", etc.

Error Handling

from friday_agent_sdk import LlmError, agent, err, ok

@agent(id="resilient", version="1.0.0", description="Handles LLM failures")
def execute(prompt, ctx):
    try:
        result = ctx.llm.generate(..., model="expensive-model")
    except LlmError as e:
        # Error message from host (e.g., "Rate limit exceeded", "Invalid API key")
        return err(f"Primary model failed: {e}")

    return ok({"output": result.text})

Provider Options

Pass provider-specific configuration:

result = ctx.llm.generate(
    messages=[...],
    model="claude-code:sonnet",
    provider_options={
        "systemPrompt": {
            "type": "preset",
            "preset": "claude_code",
        },
        "effort": "high",
        "repo": "owner/repo",
    },
)

Options vary by provider. Common patterns: Claude Code provider:

systemPrompt — Either {"type": "preset", "preset": "..."} or {"type": "custom", "content": "..."}
effort — "low", "medium", "high"
fallbackModel — Model to use if primary fails
repo — Repository to clone and work in

Message Format

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"},
    {"role": "assistant", "content": "Hi there!"},
    {"role": "user", "content": "Analyse this code..."},
]

Valid roles: system, user, assistant

Limitations

No streaming responses — Full response returned at once; streaming is not yet supported
5MB implicit limit — Via platform constraints on response size

Why Host-Managed?

Friday agents run as Python subprocesses with only the standard library and friday_agent_sdk available — packages like openai and anthropic are not installed in the agent environment. Host capabilities provide the same functionality while Friday manages API keys, rate limits, and provider routing centrally.

How to Call LLMs

Task-oriented guide

AgentContext

Parent context object

Getting started

Core concepts

Guides

Tools

Agent SDK

API reference

Resources

Class: Llm

Methods

generate()

generate_object()

Model Resolution

LlmResponse

Error Handling

Provider Options

Message Format

Limitations

Why Host-Managed?

See Also

How to Call LLMs

AgentContext

Getting started

Core concepts

Guides

Tools

Agent SDK

API reference

Resources

Documentation Index

​Class: Llm

​Methods

​generate()

​generate_object()

​Model Resolution

​LlmResponse

​Error Handling

​Provider Options

​Message Format

​Limitations

​Why Host-Managed?

​See Also

How to Call LLMs

AgentContext

Class: Llm

Methods

generate()

generate_object()

Model Resolution

LlmResponse

Error Handling

Provider Options

Message Format

Limitations

Why Host-Managed?

See Also