Skip to main content

Xenon V2: The Next Generation of Autonomous Mobile Infrastructure

· 3 min read
Rabindra Biswal
Creator of Xenon

We're thrilled to announce Xenon V2 — a ground-up evolution of the autonomous mobile infrastructure platform. This release brings AI-powered diagnostics, self-healing locators, visual intelligence, and enterprise-grade operational features.

Note (current state)

This post documents Xenon V2 as it shipped on 2026-02-17 and is preserved as a historical record. Some details have evolved since:

  • Tier 4 of the self-healing engine no longer uses Florence-2; it now runs OCR (Tesseract.js) with the configured AI provider as the visual fallback. See Self-Healing and Omni-Vision.
  • Hub↔node communication is HTTP REST + Socket.IO (not gRPC). See Architecture and Remote Execution.
  • Hub-node authentication uses per-node (accessKey, token) pair auth (not a shared secret). See Security.

AI Root-Cause Diagnosis

Xenon V2 introduces multimodal AI failure analysis. When a session fails, the AI engine reasons over screenshots, device logs, and command sequences to produce actionable root-cause reports — no manual triage required.

Supported LLM Providers:

ProviderDefault ModelMultimodal
Google Geminigemini-3-flash-preview[Yes]
OpenAIgpt-4o[Yes]
Anthropicclaude-3-5-sonnet[Yes]
Ollamallama3[No] (text-only)

Configure via environment variables (GEMINI_API_KEY, OPENAI_API_KEY, ANTHROPIC_API_KEY) or the Dashboard Settings UI.


🩺 5-Tier Self-Healing Engine

When a locator breaks, Xenon's HealingOrchestrator cascades through five increasingly powerful strategies:

  1. ResilioTree — Structural tree-diff matching (free, instant)
  2. FuzzyXML — Attribute similarity scoring (free, instant)
  3. OCR — Text-based visual matching (free, local)
  4. Visual AI — Florence-2 element detection (local GPU)
  5. LLM Deep Healing — Full reasoning with GPT/Claude (API call)

Successful healings are autonomously learned via the Etalon Service, preventing the same break from recurring.


Omni-Vision (Visual Intelligence)

Powered by Florence-2, Omni-Vision enables:

  • Coordinate-free element detection — Tap elements by description, not XPath
  • Visual assertions — Verify screen state without brittle selectors
  • AI-guided interaction — Natural language commands for exploratory testing

Network Conditioning

Simulate real-world network conditions during test execution:

ProfileLatencyDownloadUpload
4G20ms15 Mbps7.5 Mbps
3G100ms2 Mbps1 Mbps
Edge400ms250 Kbps150 Kbps
Offline00

Platform-specific: Android (ADB svc), iOS Simulator (simctl network), plus proxy-level latency injection.


Webhook Notifications

Configure real-time alerts via Slack or generic HTTP webhooks:

  • Events: device_offline, session_failed, device_new
  • Custom templates: Use {{variable}} syntax for payload customization
  • Managed entirely through the Dashboard Settings UI

Deployment Flexibility

Three supported topologies:

  1. Standalone — Single node with SQLite
  2. Hub-Node — Distributed grid with PostgreSQL + gRPC
  3. Cloud — BrowserStack, SauceLabs, LambdaTest, HeadSpin, pCloudy

Dashboard Settings

The V2 dashboard includes a full settings panel for:

  • AI Configuration — Provider selection, model, API key testing
  • Maintenance — Retention policies, cleanup schedules
  • Infrastructure — Device types, session limits, timeouts

Additional Improvements

  • Session Heartbeat Service — Automatic idle session detection and release
  • Request Logging — Full HTTP request/response audit trail
  • Metrics & Profiling — CPU, memory, and thermal monitoring
  • Resource Isolation — Per-session resource boundaries
  • Video Pipeline — MJPEG proxy with reconnection handling
  • OpenTelemetry — Distributed tracing via OTLP export

Upgrading

appium plugin update @xenon-device-management/xenon

See the Setup Guide and Configuration Reference for details.