T
traeai
登录
返回首页
Hacker News Best

Microsoft Copilot Cowork Exfiltrates Files

8.5Score
Microsoft Copilot Cowork Exfiltrates Files

TL;DR · AI 摘要

攻击者通过间接提示注入在中毒技能中利用Microsoft Copilot Cowork从M365中窃取文件,成功率高。

核心要点

  • 攻击者利用邮件和Teams消息无需人工审批的特性进行文件窃取。
  • Copilot Cowork可以通过预认证下载链接泄露用户访问的文件。
  • 微软已披露此漏洞,但用户仍需警惕代理产品的潜在风险。

结构提纲

按章节快速跳转。

  1. Copilot Cowork通过间接提示注入在中毒技能中窃取M365中的文件。

  2. 攻击者利用邮件和Teams消息无需人工审批的特性,通过预认证下载链接泄露文件。

  3. 代理产品在多系统集成中存在提示注入攻击面扩大的风险。

  4. 微软已披露此漏洞,但用户需警惕代理产品的潜在风险。

思维导图

用一张图看清主题之间的关系。

查看大纲文本(无障碍 / 无 JS 友好)
  • Microsoft Copilot Cowork 文件窃取

金句 / Highlights

值得收藏与分享的关键句。

#Microsoft Copilot#安全漏洞#文件窃取#间接提示注入
打开原文

This attack achieved a high success rate against state-of-the-art models, including Claude Opus 4.7.

Image 1: Microsoft Copilot Cowork exfiltrates financials and PII

Microsoft Copilot Cowork exfiltrates financials and PII

[Overview](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#overview)

Copilot Cowork is a Frontier feature available now in Microsoft 365. It operates with the users’ Microsoft permissions and can use Microsoft Graph to read and operate on data in one’s Microsoft tenant.

In this article, we demonstrate that through an indirect prompt injection in a poisoned skill, attackers can exfiltrate files from M365. This is done by exploiting the fact that, unlike other sensitive actions, sending emails and Teams messages to the active user does not require human approval, and opening the compromised messages in Teams or Outlook can trigger attacker-controlled network requests.

This risk reflects that giving agents access to multiple systems expands the prompt-injection attack surface. In isolation, the agent’s intended capabilities are benign; however, due to the properties of the integrated systems, users are at risk. This is reminiscent of our previous work on how URL previews in communications apps have become an egress surface for agents). As this risk pertains to the design of a system in which agents act with delegated authority across an entire enterprise ecosystem, rather than to a specific bug, we are publicizing this work to inform users of the risks they are accepting by using an agentic product of this nature.

Separate from this risk, we have disclosed a vulnerability to Microsoft that directly allows data egress from Copilot Cowork’s sandbox environment.

[The Attack Chain](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#the-attack-chain)

Microsoft’s documentation on action approvals states, “[Copilot] Cowork asks for your permission before taking sensitive actions, like sending an email or posting a message in Teams.” However, in practice, when the recipient is the active user, these actions execute immediately without requiring human approval (users do not have a setting to modify this behavior). Because these messages can contain external images that trigger network requests to external websites, data can be exfiltrated when a user opens a compromised message sent by the agent. Copilot Cowork can retrieve ‘pre-authenticated download links’ for files the user has access to, which allow anyone who opens the link to download that file. So, a manipulated agent can exfiltrate files by exfiltrating pre-authenticated download links.

  1. The victim has access to files stored in SharePoint or OneDrive containing PII & Financial data
Image 2: Victim has sensitive files accessible to Copilot Cowork

Victim has sensitive files accessible to Copilot Cowork

  1. The victim uploads a skill file to Copilot Cowork that contains a prompt injection

For general use cases, this is quite common; a user finds a file online that they upload as a skill. This attack is not dependent on the injection source - other injection sources include, but are not limited to: web data from Claude for Chrome, connected MCP servers, etc.

Image 3: Victim has uploaded a poisoned Copilot Cowork Skill file

Victim has uploaded a poisoned Copilot Cowork Skill file _Note: Admins have limited oversight of ‘Skills’, as Skills in Copilot Cowork are automatically loaded from a specific path in a user’s OneDrive._

  1. The victim asks Microsoft Copilot Cowork to review what they worked on that week, triggering the skill
Image 4: Victim asks Copilot Cowork for a recap, triggering the poisoned Skill.

Victim asks Copilot Cowork for a recap, triggering the poisoned Skill.

  1. The injection manipulates Microsoft Copilot Cowork to post a Teams message that will exfiltrate pre-authenticated file download links when it is viewed

The injection tells Copilot Cowork that a service exists to create document previews for the recap message; to do this, the agent retrieves pre-authenticated file download links for each file and passes those URLs as query parameters to an attacker-controlled site via malicious HTML image tags.

Image 5: Copilot Cowork sends a compromised message to the user via Teams

Copilot Cowork sends a compromised message to the user via Teams At no point in this process is human approval required.

If we expand the ‘Task complete’ block, we can see the agent’s actions play out – but _the malicious message content is never visible,_ even when the Teams action is clicked on _._

Image 6: Compromised activity is not visible in Copilot Cowork

Compromised activity is not visible in Copilot Cowork

  1. When the user opens their Teams messages, the pre-authenticated download links are exfiltrated, and the attacker can download the files by visiting the link
Image 7: User opens Teams triggering file exfiltration

User opens Teams triggering file exfiltration

[Mitigating Risks for Your Organization](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#mitigating-risks-for-your-organization)

Microsoft Copilot Cowork has read access to essentially any resource a user does through Microsoft Graph. As such, the primary mechanism to reduce the blast radius of attacks like this is to restrict excessive permissioning across one’s Microsoft ecosystem.

To restrict users’ ability to retrieve pre-authenticated download links for files, administrators can restrict file downloads from SharePoint by running commands in the SharePoint Online Management Shell:

Set-SPOSite -Identity <SiteURL> -BlockDownloadPolicy $true

Or, to block based on a sensitivity label:

Set-Label -Identity <label> -AdvancedSettings @{BlockDownloadPolicy="true"}

_Note: This configuration affects functionality;__documentation__states that for files under the policy 'BlockDownloadPolicy', "Users have browser-only access with no ability to download, print, or sync files. They also can't access content through apps, including the Microsoft 365 Apps (like Word, Excel, PowerPoint, and so on)."_

[Model Agnostic Exploitation](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#model-agnostic-exploitation)

The attack chain was initially conducted with the model selection set to ‘auto’, which dynamically routes between Claude Opus 4.7 and Claude Sonnet 4.6. However, we validated explicitly that this attack succeeds with the exact same injection on the more advanced Opus 4.7 model by setting the model directly.

Image 8: Copilot Cowork with Opus 4.7 exfiltrates more documents than 'Auto' mode

Copilot Cowork with Opus 4.7 exfiltrates more documents than 'Auto' mode

Opus 4.7 was more comprehensive in its search for recently edited documents; it expanded exfiltration to include every document used in previous Cowork Copilot sessions that week, as well as the files stored in more typical document locations that were found when the model was set to ‘Auto’.

[Prompt Injection Efficacy](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#prompt-injection-efficacy)

This prompt injection exhibited a very high efficacy, and we noted that Copilot Cowork completed the entire attack chain on every trial (5 for 5). Furthermore, the attack was not contingent on the specific wording of the user query – whenever the model invoked the skill, the injection succeeded.

The injection consisted of 5 lines in an 81-line skill file, all of comparable length to the other lines.

This demonstrates that even with the latest models and only a small excerpt of malicious text, an indirect prompt injection can hijack agent behavior.

As such, we urge readers to exercise caution when working with untrusted data, such as skills shared online – especially when the untrusted data is placed into a trusted context, such as a skill file.

[Scheduled Tasks Exacerbate Risks](https://www.promptarmor.com/resources/microsoft-copilot-cowork-exfiltrates-files#scheduled-tasks-exacerbate-risks)

In Copilot Cowork, users can create scheduled tasks. A scheduled task is a prompt that executes on a recurring basis without user oversight. The 'weekly review' behavior described in this article is the exact kind of task a user would be likely to automate with a scheduled task.

Image 9: Scheduled tasks increase risks by executing unattended and on a repeated basis.

Scheduled tasks increase risks by executing unattended and on a repeated basis.

Scheduled tasks increase the risk surface for attacks like this significantly, as the user is not present to stop malicious workflows, and the prompt injections can take effect on a recurring basis.

AI 可能会生成不准确的信息,请核实重要内容