ai-reliability
Here are 57 public repositories matching this topic...
The open-source MultiAgentOps evaluation and verification harness for any industry business workflow.
-
Updated
May 15, 2026 - Python
Open-source AI model evaluation and benchmarking framework for LLMs (OpenAI, Ollama, Claude, Gemini)
-
Updated
May 16, 2026 - Python
The "Cloudflare for AI Agents". 7-layer security interceptor, real-time observability dashboard, and automated reliability testing for MCP and AI tool chains. Prevent hallucinations, prompt injection, and destructive tool calls.
-
Updated
May 4, 2026 - Python
Production-grade TypeScript AI runtime focused on reliability, governance, and reproducible LLM systems. Multi-provider gateway, agents, RAG, workflows, policy engine, audit trails, and deterministic testing — built for teams shipping AI in production.
-
Updated
May 13, 2026 - TypeScript
MCP server for the Ejentum Logic API. Exposes the four cognitive harnesses (reasoning, code, anti-deception, memory) as MCP tools any agentic client can call.
-
Updated
May 17, 2026 - JavaScript
Architectural standards and best practices for building reliable AI Agents and LLM workflows. Defining the framework for AI Reliability Engineering (AIRE).
-
Updated
Feb 14, 2026 - Dockerfile
Benchmark for evaluating advanced reasoning, recursive dependency resolution, and robustness capabilities of large language models in dynamic, noisy, and structurally challenging environments.
-
Updated
May 15, 2026 - Python
Sheldon K. Salmon — AI Reliability Architect. Creator of the AION Constitutional Stack and the CERTUS certainty‑engineering methodology. He designed, directed, and red‑teamed VERITAS — applying epistemic scoring, Uncertainty Mass, and permanent STP seals to community crisis data. Code is open source. The judgment is not.
-
Updated
May 16, 2026 - JavaScript
AION Scaffold — Intelligent tree-to-filesystem generator. Built by Sheldon K. Salmon, AI Reliability Architect. Part of the AION Constitutional Stack. Free forever. No tracking.
-
Updated
May 6, 2026 - HTML
AI evaluation and reliability system for reasoning about trust, performance, and failure modes in AI systems | AditiKhare.com — AI Product Ecosystem
-
Updated
Apr 20, 2026
Enterprise AI system for decision intelligence — transforming research into scalable, context-aware insights at production scale | AditiKhare.com — AI Product Ecosystem
-
Updated
Apr 20, 2026
A behavioral framework opposing native fluency to authentic fluency — the structural tension RLHF creates and Claude Mythos Preview makes urgent.
-
Updated
May 5, 2026
A multi-agent cognitive architecture solving the LLM state-dependency problem with persistent memory and a mandatory self-correction loop. An architecture that is built on a more profound and biologically resonant principle: memory is an active component of intelligence itself.
-
Updated
Aug 23, 2025
Orchestration runtime for AI agent workflows that preserves task-state fidelity, prevents reasoning drift, and reduces wasted computation in long-horizon pipelines.
-
Updated
Mar 19, 2026 - JavaScript
Developer-first prompt engineering patterns for grounded, testable, and reliable AI outputs.
-
Updated
Mar 27, 2026
Lightweight benchmark harness for AI-driven business workflows
-
Updated
Apr 24, 2026 - HTML
Research archive — eight published papers, Mahdi Ledger, and empirical foundations of the LC-OS governance framework.
-
Updated
May 9, 2026
Mine corrections from AI chat logs. Gate the next response before drift ships.
-
Updated
Apr 30, 2026 - Python
UAICP (Universal Agentic Interoperability Control Protocol): open reliability contract for AI agent workflows with evidence gating, policy controls, and auditability.
-
Updated
Feb 27, 2026 - TypeScript
Improve this page
Add a description, image, and links to the ai-reliability topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the ai-reliability topic, visit your repo's landing page and select "manage topics."