MCP Complete Tutorial - Connect Local AI Agent (Ollama) to Tools with MCP Server and Client

TL;DR

MCP standardizes AI-to-tool integration by separating tool hosting (server) from tool usage (client) through a common protocol.

Briefing Cornell Notes

Briefing

Model Context Protocol (MCP) is positioned as a standardized way to connect AI models to external tools and data without hand-building bespoke integrations for every service. Instead of wiring an agent directly to systems like GitHub, Slack, or a database, MCP introduces a host/client/server pattern where the client discovers what a server can do—its available resources, tools, and prompts—then calls those capabilities through a common interface. The practical payoff is portability: a tool exposed via MCP can be plugged into different AI applications using the same protocol mechanics.

The tutorial frames MCP as an “adapter” layer over AI applications. Under the hood, MCP uses JSON-RPC for request/response messaging and supports two main transports. For local setups, communication can run over standard input/output. For more production-friendly deployments, it can run over HTTP using Server-Sent Events (SSE), enabling streaming responses. Before any tool calls happen, MCP performs a handshake where both sides advertise capabilities and versioning, so the client knows what resources, tools, and prompts the server offers.

MCP servers provide three core building blocks. “Resources” act like context endpoints (similar in spirit to REST GET requests), returning data based on parameters such as a resource path. “Tools” are the actionable functions an agent can invoke; they include parameter schemas and documentation so the client can present them correctly to the model. “Prompts” let servers supply reusable, parameterized prompt templates, shifting prompt construction away from the client.

A key implementation walkthrough builds a minimal FastAPI-style MCP server that exposes a single tool, list tasks, which returns a filtered list of to-do items based on max results. The server is created with an MCP instance configured with a name, host, port, and an SSE transport path. Running the server locally also enables an MCP inspector endpoint that lists discovered resources, prompts, tools, and health checks. Through the inspector, the tool’s input schema becomes visible, and direct tool calls return the expected task list.

The tutorial then connects a custom MCP client to that server using the official MCP SDK. The client constructs the server URL, opens bidirectional streams, initializes the session, and queries available tools. Tool metadata—names, descriptions, and JSON-schema-like input types—arrives from the server, and the client can invoke a tool by name with structured arguments. The response comes back as content suitable for downstream agent logic.

From there, the tutorial scales the concept into a complete to-do application. It defines an in-memory task manager (task objects with id, name, and completion status) and wraps its operations—get tasks, add task, remove task, complete task—into MCP tools. On the client side, the MCP tool list is converted into LangChain-compatible structured tools so an LLM can decide when to call which MCP tool. The agent runs an iterative loop with a max-iteration guard, using a system prompt that instructs the model how to represent task completion (e.g., emoji markers) and when to invoke tools.

Finally, everything is wired into a Streamlit UI. With a local Ollama model (configured as Qwen 2.5) running alongside the MCP server, users can ask questions like “What are my tasks for today?” or request new items, and the app translates those requests into MCP tool calls (e.g., list tasks or add task) and renders the agent’s updated results. The result is a working example of how standardized tool access can turn an AI agent into a functional app backend without custom glue code per tool.

Cornell Notes

MCP (Model Context Protocol) standardizes how AI models connect to external tools and data. It uses a client/server model where the client handshakes with the server to learn available Resources, Tools, and Prompts, then calls tools through JSON-RPC over transports like stdio or HTTP SSE. The tutorial demonstrates a minimal Fast MCP server exposing a list tasks tool, verifies it via an MCP inspector, and then builds a full to-do agent that converts MCP tools into LangChain structured tools. An Ollama-hosted Qwen 2.5 model drives an iterative tool-calling loop, while a Streamlit app provides the UI. The approach matters because it makes tool integration reusable and portable across AI applications.

What problem MCP is trying to solve for AI app developers, and why does it matter?

MCP targets the integration burden that appears when an AI agent needs to use external systems (databases, GitHub, Slack, file systems, etc.). Without a standard, each agent requires custom “tooling plumbing” and bespoke code to translate model actions into API calls. MCP matters because it standardizes discovery and invocation: once a tool is exposed by an MCP server, different clients/agents can use it through the same protocol mechanics (resources/tools/prompts discovered via handshake, then called via JSON-RPC).

How do Resources, Tools, and Prompts differ in MCP, and what does each enable?

Resources provide contextual data endpoints, conceptually like REST GET requests; they’re accessed via a resource path and parameters and return data (often context strings in demos). Tools are callable functions with parameter schemas and documentation; they’re the main mechanism for agent actions like add task or complete task. Prompts are server-provided prompt templates that can be parameterized, letting the server generate prompt content dynamically rather than forcing the client to assemble everything.

What happens during the MCP handshake, and why is it necessary?

Before tool calls, the client and server exchange capability information and MCP protocol versioning. The server advertises what it offers—resources, tools, prompts—while the client indicates the MCP version it speaks. After this handshake, both sides can safely communicate using the agreed capabilities and message formats.

Why does the tutorial emphasize transports like stdio vs HTTP SSE?

Transport determines how client and server communicate. For local development, stdio is convenient because both run on the same machine and can exchange messages through standard input/output. For more production-like setups, HTTP SSE supports streaming responses and a more deployment-friendly pattern. The tutorial uses SSE in the examples, which also pairs well with an inspector endpoint for debugging.

How does the to-do agent connect MCP tools to LangChain so the model can call them?

The client first fetches the MCP tool list from the server (including each tool’s name, description, and input schema). It then converts each MCP tool into a LangChain structured tool. When the agent decides to call a tool, the client routes that call back to the MCP server using the MCP session, passes the model-provided arguments, and returns the tool’s text content to the agent loop.

What does the agentic loop do, and how does it prevent runaway behavior?

The agent runs an iterative loop where it sends the user query plus system instructions to the LLM, then checks whether the model produced a tool call. If a tool call appears, the client invokes the corresponding MCP tool and appends the tool result back into the message history. The loop continues until the model returns a final response or until a configured max-iteration limit is reached, at which point the code raises an error to avoid infinite tool-calling.

Review Questions

In MCP, what information must be exchanged during the handshake, and how does that affect later tool calls?
Describe how a LangChain structured tool is created from an MCP tool, including what metadata is used (name, description, input schema).
What are two different MCP transports mentioned, and what practical scenario does each fit best?

Key Points

1
MCP standardizes AI-to-tool integration by separating tool hosting (server) from tool usage (client) through a common protocol.
2
MCP servers expose three capability types—Resources, Tools, and Prompts—discovered by the client after an initial handshake.
3
MCP uses JSON-RPC messaging and supports transports like stdio for local setups and HTTP SSE for streaming, more production-friendly communication.
4
An MCP inspector can validate what a server offers (tools, schemas, prompts) and helps debug tool invocation end-to-end.
5
To build an agent app, MCP tool definitions can be converted into LangChain structured tools so the model can trigger tool calls with structured arguments.
6
A complete application can be assembled by combining an MCP server (task operations), an MCP client (tool invocation), an LLM loop (iterative tool calling), and a Streamlit UI for user interaction.

Highlights

MCP’s handshake lets the client learn a server’s exact capabilities—resources, tools, and prompts—before any tool invocation happens.

SSE transport over HTTP enables streaming-style communication patterns that fit better than stdio for deployed systems.

Converting MCP tool metadata into LangChain structured tools is the bridge that lets an LLM reliably call external functionality.

The to-do app demonstrates the full chain: user prompt → agent tool call → MCP server tool execution → updated agent response. 

Topics

Model Context Protocol
MCP Tools
JSON-RPC
Ollama Integration
Streamlit UI

Mentioned

Venelin Valkov
MCP
JSON-RPC
SSE
UI
SDK
API
REST
LLM
Ollama
LangChain
FastAPI
VS Code