Microsoft adds FIDES security to Agent Framework

Microsoft has released FIDES in Agent Framework to block prompt injection and data exfiltration with deterministic policy enforcement. The feature labels content by trust and confidentiality, then checks tool calls before sensitive actions can run.

Microsoft has introduced FIDES, short for Flow Integrity Deterministic Enforcement System, as an experimental security feature in Agent Framework. The release targets prompt injection, which it describes as the #1 risk on the OWASP LLM Top 10, by replacing heuristic defenses with middleware that enforces information-flow control. Content is labeled along two dimensions, trusted or untrusted for integrity, and public, private, or user_identity for confidentiality. Those labels propagate automatically through tool calls, messages, and context providers, and are checked before a sensitive tool is allowed to execute.

The core example is a GitHub issue triage agent that reads public issue bodies, posts follow-up comments, reads files, and writes patches. In the attack scenario, a malicious issue includes hidden instructions telling the agent to read .env and post its contents publicly. FIDES treats the entire issue body as untrusted as soon as read_issue(…) returns it. That allows the agent to summarize or classify the report, but it blocks privileged actions when policy rules are violated. A call to write_file(…) can be refused because the tool declares accepts_untrusted=False, and a call to post_comment(…) can be blocked because the tool limits output with max_allowed_confidentiality=”public” while private content is in scope. With approval_on_violation=True, blocked actions become human approval prompts instead of silent failures.

Microsoft says the main difference from defensive prompts, sanitization, or monitoring is determinism. Prompt injection works because a model cannot reliably distinguish developer instructions from instructions embedded inside data. FIDES moves the decision away from the model and into the framework. LabelTrackingFunctionMiddleware propagates security labels across tool outputs and downstream transformations, while PolicyEnforcementFunctionMiddleware checks the current context before each tool invocation. The result is a split where the model can decide what it wants to do, but the framework decides what it is allowed to do.

The system also includes a stricter isolation option for untrusted text. With auto_hide_untrusted=True, untrusted tool output is replaced by a var_<id> reference, stored separately, and processed by quarantined_llm using a separate tools-free model. In that mode, the main model never reads raw attacker-supplied text directly. Microsoft positions this as stronger defense-in-depth, though it adds another model call and means the main agent works from sanitized summaries rather than the original content. With auto_hide_untrusted=False, the main model can still read the raw content, but policy enforcement remains active.

FIDES ships in the core package in version 1.3.0 and later and is marked experimental. Microsoft says it is best suited for agents that ingest content from uncontrolled sources, operate privileged tools, or handle mixed-sensitivity data that must not flow into public outputs. The company also notes limitations, including opt-in labels per data source, conservative most-restrictive-wins propagation, coarse approval workflows, and a single-turn quarantined LLM design. Sample applications include email_security_example.py and repo_confidentiality_example.py, and Microsoft is directing broader feedback on the security model to discussion #5624.

63

Impact Score

Pope Leo XIV to publish encyclical on Artificial Intelligence

Pope Leo XIV’s first encyclical, “Magnifica Humanitas,” is set for release May 25 and will focus on Artificial Intelligence and the protection of human dignity. The Vatican will mark the publication with an unusual press conference featuring the pope, senior cardinals, theologians and an Anthropic co-founder.

AMD starts Venice production on TSMC 2 nm

AMD says its next-generation EPYC processor, Venice, is ramping production in Taiwan on TSMC’s 2 nm process technology. The company also plans a future production ramp at TSMC’s Arizona fabrication facility for data center and Artificial Intelligence infrastructure.

Tech researchers challenge Trump visa policy over online safety work

A lawsuit from the Coalition for Independent Technology Research is challenging a Trump administration visa policy that critics say targets fact-checking, trust and safety, and disinformation research. The case could shape how researchers, platforms, and the public understand online harms and free speech.

Anthropic pushes deeper automation with Claude Code

Anthropic used its London developer event to present a software workflow where Claude increasingly writes, tests, and revises code with minimal human intervention. The pitch landed with an audience already comfortable shipping code generated by Artificial Intelligence, even as concerns over review, security, and developer skill remain unresolved.

Contact Us

Got questions? Use the form to contact us.

Contact Form

Clicking next sends a verification code to your email. After verifying, you can enter your message.