Clean code in. Clean PRs out. Hyrax audits your codebase, surfaces findings across security, correctness, maintainability, performance, architecture, and operations, then writes fixes and opens PRs. You review and merge every PR.

How does pricing work?

One flat price per workspace, not per seat. Free: 1 private repo, 1 mini-audit per month, verified fixes, no card. Pro: $30/mo with $30 of usage included, up to 3 repos, full audit pipeline, opt-in overage after. Team: $200/mo flat with $200 of usage included, unlimited repos and seats. Included usage does not roll over.

What do I get with Pro vs Team?

Pro: up to 3 repos, the full audit + scan pipeline, PR reviews on every opened PR, and auto-publish to Linear. Team: unlimited repos and seats, plus the self-improvement learn loop and public repos by URL.

What is the 13-step verification?

Every fix runs through 13 steps before a PR opens: test baseline, fix agent, diff size guard, test regression, build, auto-format, lint, cross-project test, scanner quality loop, review loop, post-fix audit, detection query verify, push and PR. A failure at any critical step aborts the run.

How is Hyrax different from Copilot or Cursor?

Copilot and Cursor help you write code faster. Hyrax ships clean code. It audits issues, fixes them, and opens PRs for you to review and merge. Different category, different outcome.

Scan profiles your entire codebase, your architecture, conventions, patterns, and creates an Agent Context stored in your .hyrax/ folder. Then it runs six agent groups plus a deterministic scanner. Scan produces findings and easy wins, each with a change plan ready for Fix.

Every change ships as a pull request with the [Hyrax] prefix. PR Review reviews every opened pull request automatically against your codebase conventions, leaving comments that update as your code changes. It can block merge on must-fix findings. Available on Pro and Team.

What languages are supported?

Hyrax works across 19 languages: Python, TypeScript, JavaScript, Go, Rust, Swift, Ruby, Java, Kotlin, C#, C++, C, PHP, Scala, Dart, Elixir, Shell, Lua, and MDX. It works with the frameworks built on them — React, Next.js, Vue, Svelte, Angular, Node.js, Django, Rails, Spring, FastAPI, Express, React Native, and Flutter.

What integrations are supported?

GitHub for source control. Linear for ticket management. Tickets are created on audit and closed automatically when fixes merge.

All inference runs in our AWS Bedrock account. We do not train on your code.

INDUSTRY · JUNE 8, 2026 · 4 MIN READ

GitHub's Copilot Code Review just admitted what the data already said

On June 2 GitHub shipped Agent Skills, MCP-connected review, and a Medium tier for Copilot Code Review. Each addition is a quiet admission that the previous version was insufficient.

GitHub shipped three changes to Copilot Code Review on June 2: Agent Skills, MCP server connections, and a Medium analysis tier. Each one is GitHub admitting something specific about the previous version.

What shipped#

Agent Skills. Teams now create a .github/skills/code-review/SKILL.md file in the repo. Copilot Code Review reads it on every PR and applies the team's naming conventions, architecture rules, and security patterns. The mechanism is a checked-in markdown file that Copilot treats as ground truth.

MCP server connections. Once configured, Copilot Code Review pulls context directly into the review from third-party platforms and internal systems, including issue trackers, documentation, service catalogs, and incident tooling.

Medium tier. A new analysis tier routes pull requests to a higher-reasoning model for "deeper analysis of complex logic, security-sensitive code, and cross-service changes." GitHub's stated reason: the standard tier produced too many false positives and missed subtle bugs.

All three are in public preview for Pro, Pro+, Business, and Enterprise subscribers.

What each change admits#

The existence of Agent Skills admits that generic AI review without team context produces output your team will ignore. SKILL.md is the patch for a tool that flagged style without knowing the style.

MCP integration admits that the PR diff is not enough signal. A reviewer needs the issue that motivated the change, the runbook that documents the service, and the postmortem that explained the last incident. The original Copilot Code Review tried to reason about a 200-line diff with none of that. The new version reads from the same systems a human reviewer would consult.

Medium tier admits that the default reviewer was insufficient on the categories that matter most: security and cross-service logic. Routing those PRs to a different model is GitHub's response to its own product missing things.

Copilot Code Review is from the same model family that wrote the PR it is reviewing. When a developer ships a hardcoded secret using Copilot Chat, the chance Copilot Code Review flags that secret on the PR is structurally lower than an independent reviewer's chance. The model has already internalized "this pattern is acceptable" because it produced the pattern.

Agent Skills does not solve this. SKILL.md tells the reviewer what to look for. It does not tell the reviewer that its own training data is the source of the problem.

MCP does not solve this either. Pulling in incident reports gives the reviewer richer context. It does not change which model is doing the reading.

The Medium tier moves to a higher-reasoning model from the same family. The blind spot persists at lower probability.

The only structural fix is a reviewer trained or configured independently of the author. Independence in this context is not a marketing word. It is a property of which weights produced the diff and which weights are reading it.

What this changes for engineering teams this week#

SKILL.md is now a checked-in artifact in your repository. It carries instructions the reviewer will follow. Treat it the way you treat CI configuration: code review the changes, restrict who can edit it, version it explicitly.

MCP-connected reviewers can pull from systems that contain customer data and credentials. Confirm what the configured MCP servers expose before turning the integration on. The default in GitHub's docs assumes a permissive environment.

The Medium tier costs more compute per review. Tracking which PR categories trigger it will surface where the standard tier is silently undershooting.

For PRs that pass Copilot Code Review and still ship with issues, the remaining gap is independence. A review by a different model family on the same diff catches a class of failure that same-family review cannot see.

Hyrax handles every PR, every commit, from a model independent of the editor that produced the change. More on the review-the-reviewer problem is at hyrax.dev.