adversarial-testing

Here are 82 public repositories matching this topic...

0xSanei / darwinia

The Self-Evolving Agent Ecosystem — Trading agents that evolve through Darwinian selection and adversarial self-play

bitcoin trading genetic-algorithm quantitative-finance autonomous-agents backtesting ai-agents multi-agent-system evolutionary-computing streamlit adversarial-testing openclaw darwinian-evolution

Updated Apr 13, 2026
Python

IBM / ares

Star

AI Robustness Evaluation System

security ai owasp owasp-top-10 red-teaming blue-teaming agentic-ai automated-red-teaming adversarial-testing

Updated Jun 10, 2026
Python

humanbound / humanbound

Star

Open-source AI agent red-team engine, SDK, and CLI. Run offline or against the Humanbound Platform.

Updated Jun 9, 2026
Python

sherifkozman / the-red-council

Star

LLM Adversarial Security Arena — Jailbreak → Detect → Defend → Verify

security gemini red-team llm langchain adversarial-testing

Updated May 9, 2026
Python

Open-source framework for building and testing LLM-powered applications: IRIS (single-agent orchestration), AETHER (declarative multi-agent systems), and AEGIS (adversarial security testing). Developed at MSU Denver's Community-Centered Computing (C3) Lab.

python open-source benchmarking research ai nsf multi-agent-systems security-testing red-teaming rag llm langchain langgraph agentic-ai adversarial-testing msu-denver c3-lab

Updated Jun 10, 2026
Python

audn-ai / skills

Star

Red-team your AI agents from any coding IDE. Adversarial security testing skills for Claude Code, Cursor, Codex, and 40+ agents.

skills jailbreak red-team ai-security prompt-injection llm-security claude-code adversarial-testing agent-skill

Updated Apr 13, 2026

howardpen9 / grok-mcp

Star

MCP server that wraps the xAI Grok CLI. Lets Claude Code, Cursor, Cline, and any MCP host use Grok as a peer code reviewer, adversary, and second-opinion consultant.

typescript mcp code-review cursor grok cline peer-review xai ai-tools ai-agent llm-tools agent-tools model-context-protocol mcp-server second-opinion claude-code adversarial-testing

Updated Jun 10, 2026
TypeScript

jhlee0409 / elenchus-mcp

Sponsor

Star

Elenchus MCP Server - Adversarial verification system for code review

nodejs typescript ai mcp static-analysis code-review claude code-verification llm anthropic model-context-protocol mcp-server adversarial-testing

Updated Jan 29, 2026
TypeScript

alejandrosaenz117 / bonfires-marketplace

Star

A marketplace of Claude Code plugins for adversarial security and architectural code review.

security architecture code-review threat-modeling security-review claude-code adversarial-testing plugin-marketplace

Updated Mar 30, 2026

CodedRichy / food-chain-ideation

Star

Claude Code skill that stress-tests startup ideas with adversarial AI agents — 68 animals, elimination rounds, blind scoring. Your idea either survives or you get 3 pivots

ideation ai-agents claude product-strategy prompt-engineering claude-code adversarial-testing claude-skills

Updated Jun 10, 2026
HTML

zakky8 / llm-jailbreak-taxonomy

Star

Mechanism-grounded taxonomy of 40 LLM jailbreak patterns across 10 categories. 8,000-trial bootstrap evaluation for the June 2026 frontier (Claude Opus 4-8, GPT-5.5, Gemini 3.5, DeepSeek V4). Every citation direct-WebFetch verified; refuted claims documented.

taxonomy jailbreak alignment ai-safety security-testing responsible-disclosure jailbreak-detection adversarial-attacks red-teaming ai-security model-robustness adversarial-ml prompt-injection red-teaming-tools llm-security llm-evaluation llm-jailbreaks ai-red-teaming adversarial-testing

Updated Jun 2, 2026
Jupyter Notebook

stchakwdev / Gaslight_EVAL

Star

AI safety evaluation framework testing LLM epistemic robustness under adversarial self-history manipulation

python ai-safety openrouter llm-evaluation adversarial-testing alignment-research epistemic-robustness

Updated Dec 18, 2025
Python

YaswanthGhanta / llm-logical-integrity-benchmark

Star

Adversarial testing of LLMs on constraint satisfaction deadlocks

reinforcement-learning gemini grok claude hallucination prompt-engineering chain-of-thought chatgpt rlhf qwen llm-evaluation sycophancy deepseek safety-alignment ai-red-teaming kimi-k2 adversarial-testing

Updated Jan 27, 2026

dr-gareth-roberts / context-engineering

Star

Context engineering toolkit for LLMs — pack, cache, debug, red-team, and orchestrate context windows. Council of Experts, adversarial testing, immune system, context compiler, drift detection, multi-agent entanglement. TypeScript + Python.

python typescript ai multi-agent rag llm prompt-engineering llm-tools context-window prefix-caching context-engineering adversarial-testing token-budget council-of-experts context-packing

Updated Jun 8, 2026
Python

craigtrim / persona-api

Star

API for generating LLM bot/agent personalities based on the Big Five personality model.

big-five-model adversarial-testing personality-api llm-agent-personas behavioral-profiles

Updated Jan 2, 2026
Python

vibheksoni / jailbench

Star

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

tasumermaf / the-adversary

Star

Agent-driven adversarial paper audit framework

python ai-agents scientific-writing research-tools adversarial-testing paper-audit

Updated Mar 17, 2026
Python

Zandereins / hydra

Star

Multi-perspective code review council for Claude Code. 3 advisors by default, 10 agents in deep mode (Opus + Codex). Evidence chains, adversarial self-test, dual-path verdict. Based on Karpathy's LLM Council.

security-audit multi-agent code-review opus codex cross-model architecture-review prompt-engineering ai-code-review claude-code adversarial-testing claude-skill claude-code-skill llm-council evidence-chains dual-path-verdict

Updated Jun 3, 2026
Python

jhcdev / omc-codex

Star

Cross-model orchestration for Claude Code — Claude builds, Codex validates. Blind TDD, adversarial stress testing, mixed-model teams, and automatic fallback. Two AI models enter, better code leaves.

plugin tdd developer-tools code-review codex cross-model ai-orchestration claude-code adversarial-testing oh-my-claudecode

Updated Apr 3, 2026
JavaScript

audn-ai / audn-cli

Star

CLI for Audn.ai — CI/CD security gate and developer workflows for AI agent red-teaming

cli golang security cicd red-team ai-security voice-ai llm-testing adversarial-testing

Updated Apr 13, 2026
Go

Improve this page

Add a description, image, and links to the adversarial-testing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-testing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-testing

Here are 82 public repositories matching this topic...

0xSanei / darwinia

IBM / ares

humanbound / humanbound

sherifkozman / the-red-council

msu-denver / bili-core

audn-ai / skills

howardpen9 / grok-mcp

jhlee0409 / elenchus-mcp

alejandrosaenz117 / bonfires-marketplace

CodedRichy / food-chain-ideation

zakky8 / llm-jailbreak-taxonomy

stchakwdev / Gaslight_EVAL

YaswanthGhanta / llm-logical-integrity-benchmark

dr-gareth-roberts / context-engineering

craigtrim / persona-api

vibheksoni / jailbench

tasumermaf / the-adversary

Zandereins / hydra

jhcdev / omc-codex

audn-ai / audn-cli

Improve this page

Add this topic to your repo