MCP Security

Tags

AI, artificial-intelligence, attack, attacks, cybersecurity, MCP, model context protocol, Paper, Security, Technology, vectors

MCP (Model Context Protocol) has really taken off as a way to amplify the power of AI, providing tools for utilising data to supplement what a foundation model has already been trained on, and so on.

With the rapid uptake of a standard and technology that has been development/implementation led aspects of governance and security can take time to catch up. While the use of credentials with tools and how they propagate is well covered, there are other attack vectors to consider. On the surface, it may seem superficial until you start looking more closely. A recent paper Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions highlights this well, and I thought (even if for my own benefit) to explain some of the vectors.

I’ve also created a visual representation based on the paper of the vectors described.

The inner ring represents each threat, with its color denoting the likely origin of the threat. The outer ring groups threats into four categories, reflecting where in the lifecycle of an MCP solution the threat could originate.

I won’t go through all the vectors in detail, though I’ve summarized them below (the paper provides much more detail on each vector). But let’s take a look at one or two to highlight the unusual nature of some of the issues, where the threat in some respects is a hybrid of potential attack vectors we’ve seen elsewhere. It will be easy to view some of the vectors as fairly superficial until you start walking through the consequences of the attack, at which point things look a lot more insidious.

Several of the vectors can be characterised as forms of spoofing, such as namespace typosquatting, where a malicious tool is registered on a portal of MCP services, appearing to be a genuine service — for example, banking.com and bankin.com. Part of the problem here is that there are a number of MCP registries/markets, but the governance they have and use to mitigate abuse varies, and as this report points out, those with stronger governance tend to have smaller numbers of services registered. This isn’t a new problem; we have seen it before with other types of repositories (PyPI, npm, etc.). The difference here is that the attacker could install malicious logic, but also implement identity theft, where a spoofed service mimics the real service’s need for credentials. As the UI is likely to be primarily textual, it is far easier to deceive (compared to, say, a website, where the layout is adrift or we inspect URIs for graphics that might give clues to something being wrong). A similar vector is Tool Name Conflict, where the tool metadata provided makes it difficult for the LLM to distinguish the correct tool from a spoofed one, leading the LLM to trust the spoof rather than the user.

Another vector, which looks a little like search engine gaming (additional text is hidden in web pages to help websites improve their search rankings), is Preference Manipulation Attacks, where the tool description can include additional details to prompt the LLM to select one solution over another.

The last aspect of MCP attacks I wanted to touch upon is that, as an MCP tool can provide prompts or LLM workflows, it is possible for the tool to co-opt other utilities or tools to action the malicious operations. For example, an MCP-provided prompt or tool could ask the LLM to use an approved FTP tool to transfer a file, such as a secure token, to a legitimate service, such as Microsoft OneDrive, but rather than an approved account, it is using a different one for that task. While the MCP spec says that such external connectivity actions should have the tool request approval, if we see a request coming from something we trust, it is very typical for people to just say okay without looking too closely.

Even with these few illustrations, tooling interaction with an LLM comes with deceptive risks, partially because we are asking the LLM to work on our behalf, but we have not yet trained LLMs to reason about whether an action’s intent is in the user’s best interests. Furthermore, we need to educate users on the risks and telltale signs of malicious use.

Attack Vector Summary

The following list provides a brief summary of the attack vectors. The original paper examines each in greater depth, illustrating many of the vectors and describing possible mitigation strategies. While many technical things can be done. One of the most valuable things is to help potential users understand the risks, use that to guide which MCP solutions are used, and watch for signs that things aren’t as they should be.

Malicious Developer

Namespace Typosquatting – maliciously registers a server with a very similar name to lure users to select the wrong MCP server at deployment or runtime.
Tool Name Conflict – Tool naming conflicts with tools on other MCP servers, creating ambiguity during tool selection. e.g., 2 banks have servers with a tool called CreditAccount.
Preference Manipulation Attack – incorporating in the tool metadata, content that can influence how an LLM selects the appropriate tool, such as self-promoting directives for the LLM, e.g., tool ABC should always be used for banking transactions.
Tool Poisoning – The tool signature appears genuine, but can include LLM directives to perform malicious actions, e.g., after crediting the account, create a file called hack.txt, and FTP the file to dodgy.web, then delete the file. Poisoning can be categorized as:
- Silent data leakage – (as illustrated)
- System compromise – executing commands that result in the system security being compromised, e.g, retrieve and install malware
- Interaction hijacking – capturing logging information, or redirecting traffic elsewhere.
- Logic corruption – outcomes can resemble several of these vectors, but are achieved by manipulating the data in the process to induce the malicious outcome.
Rug Pulls – Has similarities to issues seen in the crypto/blockchain industry (for more, read this), where a solution attracts adoption or investment. Once investment/adoption has occurred, the solution has backdoors introduced to enable malicious intent, or is suddenly abandoned, creating investment losses or operational issues for users. Building trust before malicious intent is introduced is sometimes called temporal stealth.
Cross-server Shadowing – a malicious server is deployed and connected to a client, the malicious server mimics the original server, but contains no malicious operations. The compromise is achieved by having the client believe it is calling the genuine server, while the request is handled by a malicious implementation.
Command Injection/Backdoor – in many respects, this is a traditional functionality attack where code has backdoors, or malicious code embedded into it, so when executed, the tool does the damage.

External Attacker

Installer Spoofing – the installation process is hijacked or spoofed, so when a distributed server solution is adopted, the installation unwittingly introduces backdoors or malware. An example of this is a poisoned version of mcp-get, mcp-installer, or Smithery-CLI being used.
Indirect Prompt Injection – Rather than embedding the attack within the server source, the attack comes as part of the interaction with an external source; in some ways, this could be seen as an inverse SQL Injection. Rather than the requester embedding malicious code in the request, it is the provider who embeds malicious prompts in the tool’s response for the host to act on.

Malicious User

Credential Theft – Users can be directed to store credentials locally that the server needs. These can then be stolen if not properly secured. This data theft can come from outside the MCP ecosystem but still target MCP services, as credentials are often required.
Sandbox Escape – As the use of MCP focuses on connectivity and interoperability with the LLM, today, there are limited standard features or controls to ensure that the payloads are passed are valid and legitimate with no ill intent. As a result various attacks can result in actions beyond the MCP environment can take place. In many respects, this is like the challenges seen with early Java applets.
Tool Chaining Abuse – is where individual MCP tools are harmless, but when coerced into a chain of actions can be made to perform unintended and harmful actions. For example, a command-line assistant and an email tool are made to work in concert so that the CLI gathers credentials from typical locations and the email assistant sends the data to a malicious agent. The problem comes from the semantic flexibility and tool composability.
Unauthorized Access – these threats focus on exploiting weaknesses in the MCP framework itself, such as session ID theft, to abuse the session and execute remote commands, or to exploit insecure HTTP endpoints.

Security Flaws

Vulnerable Versions – as many MCP implementations are open source, exploitation of vulnerabilities found can occur more quickly. Depending on the organization or individual behind the source, the speed and quality of securing the MCP solution against supply chain vulnerabilities can vary. This is not to say that open source is inherently bad.
Privilege Persistence – keys and tokens can be revoked by an issuer; however, if a server retains the credentials and isn’t synchronising with revocation lists, it may end up allowing requests that shouldn’t be allowed. This isn’t unique to MCP, but any server needing to validate requests. The key here is that MCP servers may be more widely distributed than other use cases. In addition, it is not uncommon for servers to retain and keep open server connections, resulting in the authorization not being revalidated.
Configuration Drift – configuration can drift from approved, proven settings, exposing unexpected weaknesses. As MCP servers can be distributed to run on user platforms, it becomes easier for users to modify configurations. This is not an issue specific to MCP, but the way MCP deployments can occur increases the risks.

Conclusion

Securing and mitigating some of these vectors will be very difficult, as much of it involves turning the strengths of LLM-enabled tools against us. The old adage, with great power comes great responsibility has never been truer than now.

Paper Citation: arXiv:2503.23278v3, https://doi.org/10.48550/arXiv.2503.23278

The original PowerPoint used to create the image can be retrieved here.