What is an MCP server? A plain-English guide

If you have spent any time around AI tools lately, you have probably seen "MCP" come up and wondered whether it matters. Short answer: it does, and it is simpler than the acronym suggests. An MCP server is the standard way to let an AI assistant — Claude, ChatGPT, Microsoft Copilot — actually use your tools and data, instead of just talking about them.

This is a plain-English explainer: what an MCP server is, how it works, and when you would actually want one. No hype.

The problem MCP solves

A large language model on its own is a closed box. It is good at language, but it cannot see your calendar, query your database, file a ticket, or read last quarter's numbers. To make it useful against real work, you have to connect it to real systems.

Before MCP, every one of those connections was bespoke. If you wanted Claude to read from your CRM and ChatGPT to do the same, you built two different integrations. Add a third assistant and you built a third. Every tool, every assistant, every team — a new custom bridge. It did not scale.

MCP, the Model Context Protocol, is an open standard that replaces those one-off bridges with a single common one. Build the connection once, as an MCP server, and any MCP-compatible assistant can use it. It is often described as "USB-C for AI": one standard plug instead of a drawer full of adapters. The comparison is glib, but accurate.

What an MCP server actually is

An MCP server is a small program that sits between an AI assistant and a system you want the AI to reach. It exposes that system as a set of tools (actions the AI can take — "create a ticket", "search users") and resources (data the AI can read — a document, a record, a report).

The AI assistant is the client. When you ask it to do something, it looks at the tools the server advertises, decides which one fits, and calls it. The server does the real work — talks to your API, runs the query, returns a clean result — and hands the answer back to the model. The model never touches your system directly; it goes through the server, which is exactly where you put the rules.

That client/server split is the whole idea. The assistant brings the language and the reasoning. The MCP server brings the access, the permissions, and the guardrails.

A concrete example

Say you want Microsoft Copilot to answer: "who in the finance team reports to Maria, and are any of their accounts about to expire?" That question spans your directory and your identity data — Microsoft Graph, in practice.

An MCP server for Microsoft Graph would expose a couple of tools: one to look up people, one to check account status. Copilot calls them, the server queries Graph with scoped, audited permissions, trims the response to what matters, and Copilot turns it into a sentence. The user gets an answer; your IT team gets an access path they can govern. I wrote a separate, hands-on guide on building exactly that if you want the code.

MCP server vs an API vs a plugin

A fair question: isn't this just an API? Not quite, and the difference is the point.

An API is built for developers, who read documentation and write exact code against it.
An MCP server is built for an AI model, which decides at runtime which tool to call based on a plain-language description. The server's job is to describe its tools well enough that a model can choose correctly, validate what the model sends, and return results shaped for a model to read.

It usually sits in front of your existing APIs rather than replacing them. And unlike the proprietary "plugins" each AI vendor shipped early on, MCP is an open standard — one server works across assistants, instead of locking you to a single one.

Why it matters

The shift is from "AI that talks" to "AI that does". A model that can describe how to reset a password is a demo. A model that can actually reset it — safely, with the right approvals and an audit trail — is a tool. MCP servers are how you cross that line.

For a business, the practical effect is leverage: connect your systems once, and every assistant your team uses can act on them. You are not rebuilding the same integration for each new AI product that comes along; you are building to a standard that those products adopt.

When you would actually want one

You want an MCP server when you have an AI assistant in play and a real system you want it to use — your Microsoft 365 tenant, ServiceNow, an internal database, a line-of-business API. The higher the value of the actions, and the higher the cost of getting them wrong, the more the server earns its place, because that is where access control and guardrails live.

You do not need one to chat with a model, summarize a document you paste in, or draft text. MCP is about reach, not language.

The honest caveats

MCP is young. The specification and the SDKs are moving quickly, so anything you build is worth revisiting. The transport and auth story has already changed once — the early Server-Sent-Events transport is being retired in favour of streamable HTTP. And an MCP server that can act on a tenant is a security surface in its own right: least-privilege permissions, authentication on both sides, and logging are not optional extras.

None of that is a reason to wait. It is a reason to build carefully, ideally with someone who has done it before.

The short version

An MCP server is the standard adapter that lets an AI assistant use your real tools and data — built once, usable by any compatible assistant, with the permissions and guardrails living in the server where they belong. It is the unglamorous plumbing that turns an impressive demo into something that does real work.

If you are weighing whether an MCP integration makes sense for your Microsoft stack, that is the work I do — and the build guide is a good next read if you want to see what one looks like in code.