Skip to content

Concepts

This section covers the core concepts and architecture of Oryn.

Overview

Oryn is built on several key principles:

  1. Intent Over Implementation — Agents express what they want to do, not how to do it
  2. Semantic Abstraction — Web pages are presented as structured, labeled interactive surfaces
  3. Consistency by Design — Shared scanner/runtime model across deployment modes
  4. Progressive Intelligence — From atomic commands to high-level intents

Core Components

graph TD A[AI Agent] -->|Intent Commands| B[Intent Parser] B --> C[Intent Engine] C -->|Atomic Commands| D[Scanner Interface] D --> E[Universal Scanner] E -->|JSON Protocol| F[Browser Backend] F --> G[oryn-h: Chromium] F --> H[oryn-e: WebKit] F --> I[oryn-r: Extension]

Key Concepts

The Universal Scanner

At the heart of Oryn is the Universal Scanner—a JavaScript module that runs inside web pages and understands them the way agents need to.

The scanner: - Identifies all interactive elements - Labels them with simple numeric IDs - Classifies their types (input, button, link, etc.) - Infers their roles (email, password, search, submit) - Reports their states (required, disabled, checked) - Detects common patterns (login forms, search boxes, pagination)

This same scanner code runs in all environments, which improves cross-mode consistency while still allowing backend-specific differences.

Multi-Level Abstraction

Oryn supports commands at different levels of abstraction:

Level Description Example
Level 1: Direct Operate on element IDs click 5
Level 2: Semantic Operate on roles/text click "Sign in"
Level 3: Intent High-level workflows login "user" "pass"
Level 4: Goal Natural language goals (Agent/LLM layer)

Agents can mix levels as needed, using direct commands for precision and intents for common workflows.