Are Prompt Engineering, Context Engineering, and MCP Enough, or Do Modern AI Agents Need Skills?

Most people believe building AI agents is just about writing better prompts. That’s out of date.

Modern AI agents or systems are built on four layers i.e Prompt Engineering,Context Engineering,Skills, and MCP (Model Context Protocol) connectors. Understanding these layers is necessary if you want your agent to avoid hallucination, use tools effectively and perform well in real workflows.

Let’s break this properly.

Prompt Engineering: 

Prompt engineering refers  to the process of  designing and structuring inputs to get the best possible output from an AI model like ChatGPT. Prompt engineering is the systematic design of input instructions that guide a large language model to perform specific tasks accurately without modifying its internal parameters.. In simple terms, it is:

How do you ask the model to do something?

How it works:

When you give a prompt:

  1. You provide input (a question, instruction, or task)
  2. The AI analyzes patterns and context
  3. It predicts the most likely useful response
  4. The structure of your prompt strongly influences that prediction

A good prompt will have elements of clarity, context, proper format, constraints, and examples.

For example:

  • A weak prompt would be: “Tell me about marketing.”
  • A stronger prompt would be: “Explain digital marketing strategies for a small clothing business in 5 bullet points, including practical examples.”

Context Engineering: 

Context engineering is the practice of designing, organizing, and managing contextual information so that machines can act in ways that align with human intentions and improve task performance.

As described in a Medium blog by Vijay Kumar:  “Context engineering is the discipline of designing, structuring, and optimizing the contextual information provided to AI systems to achieve desired outcomes.”

In simple terms, it answers the question: What information do you give to the model before it thinks?.

This includes: chat history, user data, retrieved documents (RAG), tool descriptions, and system instructions.

Modern AI agents cannot rely on simple chat history. As highlighted in advanced agent systems, most agents fail because they behave like: “stateless chatbots that read, guess, and forget.”

Memory is a key component in modern AI agents that allows them to move beyond stateless behavior and handle long conversations, multi-step tasks, and personalization. Instead of relying on full chat history, AI systems extract important information from interactions, store it efficiently, and retrieve only the most relevant context when needed.In context engineering, memory is often organized into  4 Major Memory Layer Systems Hindsight, Mem0, Acontext, and Zep.

Example

Without context engineering:
When a user asks, “What did the client say yesterday?”
The AI responds: “I don’t know.”

With context engineering:
The system injects relevant data, such as yesterday’s Slack messages or meeting notes from Notion.

The AI responds:
“Yesterday, the client requested a timeline update and asked for a dem 

MCP (Model Context Protocol)

Model Context Protocol (MCP) is a standardized interface that enables large language models to interact with external tools, APIs, and data sources in a structured and interpretable way.

By default, AI cannot access real-time data, cannot call APIs directly, and cannot take actions. MCP solves this by standardizing how tools are exposed to the AI.

Examples of MCP Servers:

Database, GitHub, Slack, WhatsApp API, and Internal APIs

How MCP works:

  1. Tools are registered in MCP server
  2.  AI agent receives user query
  3.  AI decides which tool to use
  4.  MCP server executes the tool
  5.  Result comes back to AI

Skills in AI agents

Skills are modular, reusable workflows that encode instructions, decision logic, and tool usage patterns, enabling AI agents to perform complex multi-step tasks consistently.

Skills are predefined, reusable capabilities that an AI agent can use to perform specific tasks.A skill is NOT just a prompt. It includes instructions, steps / workflow, constraints, output format, and tool usage logic.

Skill Structure

A skill contains:

  • SKILL.md (instruction main logic)
  • scripts/ (optional code)
  • references/ (docs)
  • assets/ (templates)

How Skills work

  1. A skill is defined
  2. User gives a request
  3. AI selects the skill
  4. Skill executes internally
  5. Final result returned      

Every skill in an AI agent system starts with a SKILL.md file, which acts as the main instruction file for that skill.

The SKILL.md file is a simple Markdown document that contains:

  1. YAML frontmatter
  2. Instructions

It basically tells the AI: What this skill is, when to use it, and how to perform the task

People treat MCP and Skills like they’re the same thing. They are not. Let’s make them clear.

MCP (Connectivity) Skills (Knowledge)
Connects AI to external services (e.g., Notion, Asana, Linear) Teaches AI how to use those services
Provides access to tools, APIs, and real-time data Encodes workflows, logic, and best practices
Enables tool invocation (e.g., create task, fetch data) Guides when and how to call those tools
Focuses on capability (what AI can access) Focuses on intelligence (how AI should act)
Aspect Prompt Engineering Context Engineering Skills MCP (Model Context Protocol)
What it is It is the way you write instructions to guide the model on what kind of output you want. It is about controlling everything the model sees before it responds, not just the instruction. It is a structured way of teaching the agent how to complete a task step by step with logic and rules. It is a standard system that allows the agent to communicate with external tools and APIs.
Main purpose It helps the model understand what you are asking for in a single interaction. It ensures the model has the right information and environment to make better decisions. It ensures the agent performs tasks consistently without guessing every time. It allows the agent to actually perform actions like saving data or sending messages.
Focus area It focuses only on the instruction or question you give. It focuses on the full input environment including history, data, and tools. It focuses on execution logic and workflow for specific tasks. It focuses on integration and communication with external systems.
Scope It is limited to one request or response cycle. It works across multiple steps and maintains continuity. It is reusable across many tasks and sessions. It works across different systems and tools universally.
Dependency It depends heavily on how well you write the instruction. It depends on how well you structure and inject relevant data. It depends on having clear workflows and defined actions. It depends on properly defined APIs or tool servers.
Reusability It is usually rewritten again and again for similar tasks. It can be reused but often changes based on the situation. It is designed to be reused across different agents and workflows. It is reusable across multiple agents and platforms once built.
Example usage You write a prompt to generate an email or summarize text. You provide chat history, user data, and documents before asking the model. You define a process like lead qualification or result generation with clear steps. You connect the agent to systems like databases, WhatsApp, or CRM tools.
Limitation It fails when tasks become complex or multi-step. It becomes hard to manage if the context is too large or noisy. It cannot work without tools if external actions are needed. It does not tell the agent when or how to use tools.
Role in system It acts as the entry point for user intent. It acts as the environment builder for the agent. It acts as the intelligence layer for task execution. It acts as the action layer that connects to the real world.

What problem do Skills solve?

Problem 1: Prompt repetition

Without skills you write long prompts again and again. Skills fix this by storing logic once.

Problem 2: Inconsistent results

Without skills the same task may lead to  different outputs. Skills enforce structure

Problem 3: Complex workflows

Without skills AI struggles with multi-step tasks. Skills define step-by-step execution

Problem 4: Tool misuse

Without skills AI may call wrong tools.Skills guide correct tool usage via MCP

How Skills and MCP Work Together

When a user provides input, the AI agent does not immediately execute an action. Instead, it first performs reasoning to understand the user’s intent, the task requirements, and the desired outcome. This reasoning step is important because it determines whether the request can be answered directly or requires a structured workflow.

If the task is complex or action-oriented, the agent then selects an appropriate skill. A skill acts as a predefined workflow that guides the agent on how to handle the task. It defines the sequence of steps, the logic to follow, and most importantly, which tools should be used, when they should be used, and how they should be used. In this way, the skill ensures that the agent follows a consistent and correct approach rather than making random decisions.

Once the skill determines the required actions, the execution is handled by MCP (Model Context Protocol). MCP provides a standardized interface for interacting with external tools and services, such as databases, APIs, or messaging platforms. It takes the instructions generated by the skill and performs the actual operations.

After the tools are executed, the results are returned to the agent. The agent may further process this information, combine it with context, and then generate a final response for the user.

The key insight here is that MCP enables action, while skills ensure that the action taken is correct, structured, and aligned with the intended outcome.

How Skills Improve Other Layers

Skills significantly improve prompt engineering by eliminating the need to write long prompts repeatedly, as they have optimized prompts internally.

They also enhance context engineering by clearly defining what information is required, which helps reduce irrelevant data and improves overall efficiency.

In addition, skills improve tool usage by preventing incorrect API calls and ensuring that tools are used in the correct sequence.

Finally, skills bring consistency to AI systems, ensuring that the same task produces reliable and predictable results every time

Aspect Prompt Engineering Context Engineering Skills MCP (Model Context Protocol)
What it is It is the way you write instructions to guide the model on what kind of output you want. It is about controlling everything the model sees before it responds, not just the instruction. It is a structured way of teaching the agent how to complete a task step by step with logic and rules. It is a standard system that allows the agent to communicate with external tools and APIs.
Main purpose It helps the model understand what you are asking for in a single interaction. It ensures the model has the right information and environment to make better decisions. It ensures the agent performs tasks consistently without guessing every time. It allows the agent to actually perform actions like saving data or sending messages.
Focus area It focuses only on the instruction or question you give. It focuses on the full input environment including history, data, and tools. It focuses on execution logic and workflow for specific tasks. It focuses on integration and communication with external systems.
Scope It is limited to one request or response cycle. It works across multiple steps and maintains continuity. It is reusable across many tasks and sessions. It works across different systems and tools universally.
Dependency It depends heavily on how well you write the instruction. It depends on how well you structure and inject relevant data. It depends on having clear workflows and defined actions. It depends on properly defined APIs or tool servers.
Reusability It is usually rewritten again and again for similar tasks. It can be reused but often changes based on the situation. It is designed to be reused across different agents and workflows. It is reusable across multiple agents and platforms once built.
Example usage You write a prompt to generate an email or summarize text. You provide chat history, user data, and documents before asking the model. You define a process like lead qualification or result generation with clear steps. You connect the agent to systems like databases, WhatsApp, or CRM tools.
Limitation It fails when tasks become complex or multi-step. It becomes hard to manage if the context is too large or noisy. It cannot work without tools if external actions are needed. It does not tell the agent when or how to use tools.
Role in system It acts as the entry point for user intent. It acts as the environment builder for the agent. It acts as the intelligence layer for task execution. It acts as the action layer that connects to the real world.

About the author

Malik Sami

Doctoral Researcher

Scroll to Top