Production-grade references for AI engineering teams
Checklists, playbooks, and cheatsheets from real deployments. Built for teams shipping production AI systems.
Choose the asset that matches your situation
Use them to route a specific engineering decision.
Unsure whether this should even be agentic?
Start with the enterprise assessment kit to score suitability, autonomy level, and governance risk before build effort compounds.
Open resource → PrioritizeMultiple enterprise AI initiatives are competing for budget
Use the portfolio triage worksheet to classify what should be funded, redesigned, simplified, or stopped before the roadmap hardens.
Open resource → ProcureComparing vendors, platforms, or advisory partners
Use the vendor evaluation scorecard when procurement and technical stakeholders need one architecture-based comparison frame.
Open resource → StabilizeAgent pilot works, but production feels fragile
Start with the production AI audit to review failure modes, state, observability, and rollout risk before scale exposes the gaps.
Open resource → BuildShipping LangGraph workflows with state and approvals
Use the LangGraph cheatsheet when the implementation problem is checkpointing, HITL gates, and state design.
Open resource → DebugRAG answers are wrong, weak, or unstable
Use the RAG quality checklist to walk the pipeline from ingestion and chunking through retrieval and final grounding.
Open resource → GovernAI governance needs a board-ready evidence package
Use the Board Evidence Package to brief executives, procurement, or audit committees with structured artifacts: maturity snapshot, governance map, kill list.
Open resource → MapProduction AI systems need a governance control map
Start with the Governance Control Map Sample to see how deployed systems are mapped to autonomy levels, permission violations flagged, and sovereignty gaps identified.
Open resource →Additional implementation references
AI Agent Engineering Playbook
Internal Production Standards
The engineering standards we apply to every agent system we build. 12 failure modes with mitigation patterns, production gates, and deployment checklists.
View resource →Architecture Decision Record Kit for AI Systems
Document AI Design Choices Like a Principal Engineer
ADR template for AI/ML systems with 10 pre-filled examples covering the architecture decisions that matter most: single-agent vs multi-agent, RAG vs fine-tuning, checkpoint backends, vector DB selection, orchestration framework, and more.
View resource →From checklist to production
These resources cover what to check. We handle the engineering to get you there.
[ SUBMIT SPECS ]