VTKL Evals Dashboard

Live
Loading...
Loading evaluation data...
Production Demos
Pipeline Status
Phase 1 — Intelligence Intake
Operational
Drive intake
Slack monitoringActive
Stakeholder files
Phase 2 — Shadow Review
Operational
Total runs
Judge modelGLM-5.1
Rubrics6
Phase 3 — Correlation Engine
Operational
Correlation runs
Decisions tracked
Intel items
Cron Schedule
JobScheduleDescriptionStatus
shadow-review0 3 * * *Nightly shadow review of agent outputsActive
memory-consolidation0 4 * * *Consolidate daily memory into long-term storageActive
drive-intake*/30 * * * *Sync Google Drive shared files for analysisActive
tony-task-capture0 8,12,17 * * 1-5Capture and triage Tony's DM task backlogActive
bd-daily0 9 * * 1-5Generate and post BD daily briefingActive
correlation-engine0 5 * * 0Weekly cross-domain correlation analysisActive
Memory Layer
Stakeholder Profiles
Individual intelligence files
Rubrics Registered
6
behavioral, discovery, effort, process, product, sales
MLflow Experiment
warren-evals
Experiment ID: 1 • MLflow 3.12.0