Adversarial Chat Deck
Deploy models in opposing pairings, run prompt benchmarking debates, and visually compare code implementations with built-in real-time cost trackers.
psycopg2.
aiopg.
Run identical tasks side-by-side on OpenAI, Anthropic, or local Ollama instances to evaluate quality and performance variations immediately.
Track input and output token consumption in real time. Configure automated cost thresholds to freeze sessions before runaway bills accumulate.
All dialogue contexts, prompt metrics, and loop alerts are compiled in a local database. Zero cloud dependencies, zero external telemetry.
Let a third model audit debates, score implementations, and automatically suggest consensus pipelines to insert into your codebase.