| KPI Registry and Lifecycle |
KPI Management |
/api/v1/kpi/defs |
Impact and cause KPI definitions |
Creates a stable monitoring and RCA foundation |
| Bulk KPI Import |
KPI Management |
/api/v1/kpi/defs/bulk-json, /bulk-csv |
Batch KPI onboarding |
Accelerates onboarding for large environments |
| Semantic KPI Search |
KPI Management |
/api/v1/kpi/search |
Ranked KPI candidates |
Speeds triage during incident response |
| Multi-Signal Failure Detection |
Failures |
/api/v1/unified/failures/detect |
Incident records with confidence |
Turns noisy telemetry into actionable failures |
| Transaction Failure Correlation |
Failures |
/api/v1/unified/failures/correlate |
Cascade and sequence context |
Improves blast-radius understanding |
| Statistical Cause Ranking |
Correlation |
/api/v1/unified/correlation |
Suspicion-ranked cause candidates |
Prioritizes root-cause investigation path |
| Temporal Ring Analysis |
Correlation |
/api/v1/unified/correlation |
R1 to R4 timing evidence |
Improves causal ordering and confidence |
| 5-WHY RCA Chains |
RCA |
/api/v1/unified/rca |
Human-readable root-cause narrative |
Reduces MTTR with explainable outcomes |
| Health and Metrics Endpoints |
Operations |
/health, /ready, /metrics |
Service health and telemetry |
Supports production reliability and observability |
| Deployment and Integration Options |
Operations |
Docker, Helm, Kubernetes |
Environment-specific deployment path |
Fits both local dev and enterprise clusters |