Braintrust Data 2024年公開LLM Eval Platform。Pro 業界Pro Mainstream Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + Pro Closed + Pro Free Trial + Pro 累計2023-2026年3年Heritage継承代表機。
Braintrust LLM Eval(ブレイントラスト エルエルエム イバル)はBraintrust Data(米国SF + 2023-Ankur Goyal(元Impira CEO + Figma買収組) + Manu Goyal創業 + 累計$36M Funding(A16Z)) 2024年公開v1.0 GA LLM Eval Platformで、Pro 業界Pro Mainstream Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + Pro Eval Dataset Versioning + Pro Prompt Playground Diff View + Pro Loop Workflow GA + Pro OpenAI互換Proxy + Pro 採用Notion/Stripe/Airbnb/Replit + 累計2023-2026年3年Heritage Pro Top独占代表機。Braintrust主要機能: (1)Braintrust Data主導(米国SF + 2023設立 + Ankur Goyal(元Impira CEO + Figma買収組) + Manu Goyal + 累計$36M Funding(Andreessen Horowitz/A16Z))、(2)Pro 業界Pro Mainstream Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆、(3)Pro Eval(Dataset + Score + Snapshot Versioning)、(4)Pro Prompt Playground(Live Prompt Test + Diff View)、(5)Pro Loop Workflow(2024 GA/Eval結果駆動Iterative改善Workflow)、(6)Pro Custom Score Function(JavaScript/Python)、(7)Pro Built-in Score(LLM-as-Judge + Heuristic)、(8)Pro Logging(LLM呼び出しAuto-Capture)、(9)Pro OpenAI互換Proxy(Braintrust Proxy/30+ Model Routing)、(10)Pro Cache(Response Cache)、(11)Pro Trace(Span Hierarchy)、(12)Pro Comparison(Eval Run比較UI)、(13)Pro Promotions(Prompt Production昇格)、(14)Pro Datasets(Versioning + Test/Train Split)、(15)Pro Notebook(Jupyter風Eval Code)、(16)Pro JavaScript/Python SDK + CLI(braintrust)、(17)Pro Hosted Cloud(Free Trial + Pro Subscription)、(18)Pro 採用: Notion/Stripe/Airbnb/Replit等数百社B2B、(19)累計2023-2026年3年Heritage Pro Top独占。
| LLM Eval Platform | 機能 | Workflow | Proxy | 採用 |
|---|---|---|---|---|
| Braintrust | Eval Dataset + Prompt Playground + Loop Workflow | Loop Iterative改善 GA 2024 | OpenAI互換Proxy 30+ Model Routing | Notion/Stripe/Airbnb/Replit |
| LangSmith | Trace + Dataset + Annotation Queue | Manual + A/B Testing | - | Replit/Datadog/Klarna |
| Arize Phoenix | OTel Trace + RAG Eval + Embedding Drift | Manual | - | Datadog/Slack/Reddit |
| Langfuse | Trace + Eval + Prompt Management | Manual |
Braintrust選択ポイント: (1)Pro Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + 3年Heritage、(2)Pro Braintrust Data米国SF + Pro 2023設立 + Pro Ankur Goyal(元Impira CEO + Figma買収組) + Manu Goyal + 累計$36M Funding(A16Z)、(3)Pro Eval(Dataset + Score + Snapshot Versioning) + Pro Prompt Playground(Live Prompt Test + Diff View) + Pro Loop Workflow(2024 GA Eval結果駆動Iterative改善) + Pro Custom Score Function(JavaScript/Python) + Pro Built-in Score(LLM-as-Judge + Heuristic)、(4)Pro Logging(LLM呼び出しAuto-Capture) + Pro OpenAI互換Proxy(30+ Model Routing) + Cache + Trace + Comparison + Promotions(Production昇格) + Datasets Versioning + Test/Train Split + Notebook、(5)Pro JavaScript/Python SDK + CLI + Pro Hosted Cloud(Free Trial + Pro Subscription) + Pro 採用Notion/Stripe/Airbnb/Replit等数百社B2B + Pro Eval Dataset/Prompt Playground/Loop Workflow/A16Z採用重視採用 真価発揮。
LangSmith (2024): LangChain + Pro LangChain統合Trace+Dataset+Annotation Queue+Custom Evaluator対応LLM Eval Platform先駆 + Pro Closed + 累計2023-2026年3年Heritage。Braintrust(2024 v1.0 GA + Braintrust Data + Pro Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + Pro Closed + 累計3年Heritage)競合 + 直接競合 + Pro LangSmith → Pro Braintrust + Pro LangChain Framework深統合 → Pro Multi Framework SDK + Pro Annotation Queue → Pro Loop Workflow + Pro Replit/Datadog/Klarna → Pro Notion/Stripe/Airbnb + 同3年Heritage 比較競合。
Arize Phoenix (2024): Arize AI + Pro OTel Trace+RAG Eval+Embedding Drift対応OSS LLM Eval Platform先駆 + Pro Apache 2.0 OSS + 累計2020-2026年6年Heritage。Braintrust(2024 v1.0 GA + Braintrust Data + Pro Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + Pro Closed + 累計3年Heritage)競合 + 直接競合 + Pro Phoenix → Pro Braintrust + Pro Apache 2.0 OSS → Pro Closed Hosted + Pro OTel Trace → Pro Loop Workflow + Pro RAG Eval特化 → Pro Eval Dataset+Prompt Playground特化 + 6年 vs 3年Heritage。
Q1: Pro Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + 3年Heritage 効果? A: 2024 Braintrust v1.0 GA + Loop GA + Promotions GA発売Pro Famous Story類無し + Pro 業界Pro Mainstream Eval Dataset+Prompt Playground+Loop Workflow+OpenAI互換Proxy対応LLM Eval Platform先駆 + 累計2023-2026年3年Pro Mainstream LLM Eval Platform業界Top独占Heritage Pro Reference。
Q2: Pro Eval Dataset Versioning + Pro Prompt Playground Diff View + Pro Loop Workflow + Pro OpenAI互換Proxy + Pro Notion/Stripe/Airbnb採用 効果? A: Pro Eval(Dataset + Score + Snapshot Versioning) + Pro Prompt Playground(Live Prompt Test + Diff View) + Pro Loop Workflow(2024 GA / Eval結果駆動Iterative改善Workflow) + Pro Custom Score Function(JavaScript/Python) + Pro Built-in Score(LLM-as-Judge + Heuristic) + Pro Logging(LLM呼び出しAuto-Capture) + Pro OpenAI互換Proxy(Braintrust Proxy/30+ Model Routing) + Pro Cache(Response Cache) + Pro Trace(Span Hierarchy) + Pro Comparison(Eval Run比較UI) + Pro Promotions(Prompt Production昇格) + Pro Datasets(Versioning + Test/Train Split) + Pro Notebook(Jupyter風Eval Code) + Pro JavaScript/Python SDK + CLI(braintrust) + Pro Hosted Cloud(Free Trial + Pro Subscription) + Pro 採用Notion/Stripe/Airbnb/Replit等数百社B2B Heritage Pro Reference。
Q3: Pro Eval Dataset/Prompt Playground/Loop Workflow/A16Z採用重視採用 効果? A: Pro 採用: Pro Eval Dataset/Prompt Playground/Loop Workflow/A16Z採用重視 + Pro Braintrust系譜(設立 2023 Ankur Goyal 元Impira CEO + Figma買収組 + Manu Goyal/A16Z $36M Funding 2024/v1.0 GA + Loop GA + Promotions GA 2024/Self-Hosted Enterprise 2025予定継承) + Pro Multi-Generation Heritage Pro Reference。
| - |
| OSS Self-Hosted中心 |
| Helicone | Proxy Monitoring | - | OpenAI互換Proxy 30+ | Sunrun/Together AI |