AI Monitoring & Observability
Tools for monitoring AI model performance and costs
18 tools
Arize Phoenix
Open SourceOpen-source LLM observability and evaluation. Trace visualization, embedding analysis, and evals.
AI Monitoring & ObservabilityBraintrust
FreemiumEnterprise AI evaluation and observability platform. Prompt playground, scoring, and dataset management.
AI Monitoring & ObservabilityDatadog LLM Observability
PaidLLM monitoring within Datadog ecosystem. Trace prompts, tokens, latency alongside infrastructure metrics.
AI Monitoring & ObservabilityDeepEval
Open SourceOpen-source LLM evaluation framework. 14+ metrics including hallucination, relevancy, and bias detection.
AI Monitoring & ObservabilityGalileo
FreemiumLLM evaluation and hallucination detection platform. Automated metrics for RAG quality and safety.
AI Monitoring & ObservabilityHelicone
FreemiumOpen-source LLM observability proxy. One-line integration, request logging, caching, and rate limiting.
AI Monitoring & ObservabilityHumanloop
FreemiumPrompt management and evaluation platform. Version prompts, run evals, and optimize LLM performance.
AI Monitoring & ObservabilityLangfuse
FreemiumOpen-source LLM observability. Traces, metrics, prompt management, and evaluation. Self-hostable.
AI Monitoring & ObservabilityLangSmith
FreemiumLangChain's observability platform. Trace, debug, and evaluate LLM applications with detailed run analytics.
AI Monitoring & ObservabilityOpenLLMetry
Open SourceOpen-source observability for LLMs based on OpenTelemetry. Works with Datadog, Grafana, Honeycomb.
AI Monitoring & ObservabilityOpik (Comet)
Open SourceOpen-source LLM evaluation and tracing platform. Track experiments, evaluate outputs, and debug prompts.
AI Monitoring & ObservabilityPortkey
FreemiumAI gateway with observability. Load balancing, fallbacks, caching, and guardrails for LLM APIs.
AI Monitoring & ObservabilityPostHog
FreemiumOpen-source product analytics with AI feature tracking. Session replay, feature flags, A/B testing.
AI Monitoring & ObservabilityPromptLayer
FreemiumPrompt engineering platform. Version control, A/B testing, and analytics for prompts across providers.
AI Monitoring & ObservabilityRAGAS
Open SourceEvaluation framework for RAG pipelines. Measures faithfulness, relevancy, and context precision.
AI Monitoring & ObservabilitySentry
FreemiumError tracking with AI/LLM monitoring support. Track exceptions, performance, and LLM-specific errors.
AI Monitoring & ObservabilityTraceloop
FreemiumLLM monitoring built on OpenTelemetry. Auto-instrumentation for LangChain, LlamaIndex, and OpenAI SDK.
AI Monitoring & ObservabilityWeights & Biases
FreemiumML experiment tracking and model monitoring. LLM-specific features for prompt tracking and evaluation.
AI Monitoring & Observability