Skip to main content

Operations

Runbooks, scripts, and troubleshooting guides.

Guides

GuideDescription
RunbookDeployment, monitoring, runbook
ScriptsSeeding, diagnostics, geocoding, E2E
TroubleshootingCommon issues and resolutions
Carts & InventoryCart fleet and items inventory management
Admin Ops & MetricsHealth checks, queue monitoring, Prometheus metrics

Key Metrics

MetricDescription
teesheet_grid_load_timeDaily view load latency (p50, p95)
teesheet_ws_connect_rateWebSocket connection success rate
teesheet_api_latencyAPI endpoint latency by operation
teesheet_error_rateError rate by operation

Alerts

  • High API Latency: p95 > 500ms
  • WebSocket Failures: Connect rate < 95%
  • Weather Fetch Failures: > 5% error rate
  • Booking Conflicts: > 1% double-book attempts

Environment Variables

See getting-started/configuration.md for full list.