AI Agents Explained

You tell ChatGPT to “fix the bug” and it gives you a code snippet. You copy it, paste it, realize it’s wrong, go back, paste the error, get another snippet. Repeat.

You tell Claude Code to “fix the bug” and it does everything itself. It opens your files, finds the problem, writes a fix, runs the tests. If a test still fails, it goes back, tries a different approach, and runs the tests again. When everything passes, it commits.

Same model underneath. The difference is agency.

An agent is just a program that can use tools, look at what happened, and decide what to do next — in a loop. A chatbot gives you one response and stops. An agent keeps going until the job is done.

Here’s the difference visually. A chatbot is one turn — you ask, it answers, you’re on your own.

1
  Chatbot
2

3
  You: "Fix the bug in auth.ts"
4
       │
5
       ▼
6
  ┌─────────┐
7
  │  Model   │
8
  └────┬─────┘
9
       │
10
       ▼
11
  "Here's the fix:
12
   Change line 42..."
13

14
  Done. You go copy-paste it.

An agent loops. It thinks, does something, checks if it worked, and keeps going.

1
  Agent
2

3
  You: "Fix the bug in auth.ts"
4
       │
5
       ▼
6
  ┌─────────┐
7
  │  Think   │ ◄──────────────┐
8
  └────┬─────┘                │
9
       │                      │
10
       ▼                      │
11
  ┌─────────┐                 │
12
  │   Act    │  call a tool    │
13
  └────┬─────┘                │
14
       │                      │
15
       ▼                      │
16
  ┌─────────┐     not done    │
17
  │ Observe  │ ───────────────┘
18
  └────┬─────┘
19
       │ done
20
       ▼
21
  "Fixed. Tests pass.
22
   Committed to main."

One gives you instructions. The other does the work.

Every agent you’ve heard of — Claude Code, Codex, Cursor, Devin — runs this same loop. Think, act, observe, repeat. That’s it. People call it the ReAct pattern (Reason + Act), but it’s really just a while loop.

Say you ask an agent: “Find all TypeScript files with unused imports and remove them.” Here’s what actually happens under the hood:

1
  Think
2
  "I need to find .ts files first."
3
       │
4
       ▼
5
  Act
6
  glob("src/**/*.ts")
7
       │
8
       ▼
9
  Observe
10
  Found 47 files.
11
       │
12
       ▼
13
  Think
14
  "Now I'll check each file for unused imports.
15
   Let me start with src/utils/auth.ts."
16
       │
17
       ▼
18
  Act
19
  read("src/utils/auth.ts")
20
       │
21
       ▼
22
  Observe
23
  Line 2: import { hash } from 'bcrypt'  ← never used
24
       │
25
       ▼
26
  Act
27
  edit("src/utils/auth.ts", remove line 2)
28
       │
29
       ▼
30
  Think
31
  "Removed. Let me verify it still compiles."
32
       │
33
       ▼
34
  Act
35
  run("npx tsc --noEmit")
36
       │
37
       ▼
38
  Observe
39
  No errors. Move to next file.
40
       │
41
       ▼
42
  ... repeat for 46 more files ...
43
       │
44
       ▼
45
  Think
46
  "All files checked. 12 unused imports removed.
47
   Everything compiles. Done."

Notice it’s not running all these steps in advance. It decides each step based on what it saw in the previous one. That’s the key difference between an agent and a script. A script follows a fixed path. An agent adapts.

Now here’s the thing — an agent is only as useful as its tools. Without tools, it’s just a chatbot with extra steps. The model itself doesn’t actually touch your filesystem. It outputs a structured tool call — basically saying “hey, I want to read this file” — and the system around it does the actual work and passes the result back.

1
  ┌──────────────────────────────────┐
2
  │            AI MODEL              │
3
  │                                  │
4
  │   "I need to read a file"        │
5
  │          │                       │
6
  │          ▼                       │
7
  │   tool_call: read_file           │
8
  │   args: { path: "src/auth.ts" }  │
9
  └──────────┬───────────────────────┘
10
             │
11
             ▼
12
  ┌──────────────────────────────────┐
13
  │         TOOL EXECUTOR            │
14
  │                                  │
15
  │   read_file("src/auth.ts")       │
16
  │          │                       │
17
  │          ▼                       │
18
  │   returns file contents          │
19
  └──────────┬───────────────────────┘
20
             │
21
             ▼
22
  ┌──────────────────────────────────┐
23
  │            AI MODEL              │
24
  │                                  │
25
  │   "Now I can see the bug on      │
26
  │    line 42. Let me fix it."      │
27
  └──────────────────────────────────┘

Common tools agents use:

Tool	What it does
`read_file`	Read a file from disk
`write_file`	Write or edit a file
`run_command`	Execute a shell command
`web_search`	Search the internet
`list_files`	List directory contents

Different agents give the model different amounts of freedom. Some ask permission before every action. Some run fully autonomously and just show you the result. The loop is the same — the leash is what changes.

You can build one yourself. It’s simpler than you’d think. Here’s a working agent in TypeScript — it takes a goal, calls Claude, and loops until the model says it’s done.

1
import Anthropic from "@anthropic-ai/sdk";
2
import { execSync } from "child_process";
3

4
const client = new Anthropic();
5

6
const tools: Anthropic.Tool[] = [
7
  {
8
    name: "run_command",
9
    description: "Run a shell command and return the output",
10
    input_schema: {
11
      type: "object" as const,
12
      properties: {
13
        command: { type: "string", description: "The shell command to run" },
14
      },
15
      required: ["command"],
16
    },
17
  },
18
];
19

20
async function agent(goal: string) {
21
  const messages: Anthropic.MessageParam[] = [
22
    { role: "user", content: goal },
23
  ];
24

25
  while (true) {
26
    const response = await client.messages.create({
27
      model: "claude-sonnet-4-5-20250929",
28
      max_tokens: 1024,
29
      tools,
30
      messages,
31
    });
32

33
    // If the model responds with text and no tool calls, it's done
34
    if (response.stop_reason === "end_turn") {
35
      const text = response.content.find((b) => b.type === "text");
36
      return text?.text;
37
    }
38

39
    // Otherwise, execute each tool call
40
    const toolResults: Anthropic.ToolResultBlockParam[] = [];
41

42
    for (const block of response.content) {
43
      if (block.type === "tool_use") {
44
        const input = block.input as { command: string };
45
        const result = execSync(input.command, { encoding: "utf-8" });
46
        toolResults.push({
47
          type: "tool_result",
48
          tool_use_id: block.id,
49
          content: result,
50
        });
51
      }
52
    }
53

54
    // Feed results back into the conversation
55
    messages.push({ role: "assistant", content: response.content });
56
    messages.push({ role: "user", content: toolResults });
57
  }
58
}
59

60
agent("What's the latest version of @anthropic-ai/sdk on npm? Check using curl.");

The whole thing is a while(true) loop. Call the model, check if it wants to use a tool, execute the tool, feed the result back. That’s an agent.

I ran this on my machine. npx tsx agent.ts and watched it work:

1
🎯 Goal: What's the latest version of @anthropic-ai/sdk on npm? Check using curl.
2

3
Step 1: Think
4
  "I'll check the latest version of @anthropic-ai/sdk on npm using curl."
5
Step 2: Act
6
  $ curl -s https://registry.npmjs.org/@anthropic-ai/sdk/latest | grep -o '"version":"[^"]*"' | head -1
7
Step 3: Observe
8
  "version":"0.72.0"
9

10
✅ Done: The latest version of @anthropic-ai/sdk on npm is 0.72.0.

1
  agent("Count .ts files")
2
       │
3
       ▼
4
  ┌─────────────┐
5
  │  Call model  │ ◄──────────────────┐
6
  └──────┬──────┘                     │
7
         │                            │
8
         ▼                            │
9
  ┌──────────────┐   tool_use?        │
10
  │Check response│───── yes ──► Execute tool
11
  └──────┬───────┘                    │
12
         │ no                         │
13
         │ (end_turn)       Feed result back
14
         ▼
15
  Return final answer

In production you’d add error handling, timeouts, and permission checks. But the core is always this loop.

Agents aren’t magic though. They fail in ways you’ll recognize pretty quickly.

They loop forever. The agent edits a file, runs the test, it fails, edits it back, runs the test again. Over and over. Good agents have a retry limit.

They pick the wrong tool confidently. The model decides to delete a file when it should have edited it. This is why every serious agent has a permission system.

They forget what they were doing. Long tasks generate tons of tool results. Eventually the conversation gets so long the model loses track of the original goal. The best agents summarize as they go to keep context manageable.

They compound small mistakes. Step 3 is slightly off. Step 7 builds on it. By step 15, the agent has built an elaborate wrong solution and feels great about it. Shorter feedback loops help — verify after each step, not after 50.

Agents work best when the task is clear and there’s a way to check the result. “Fix the failing test” is perfect — the agent can run the test to verify. “Make the UI look better” is terrible — it can’t see what the UI looks like.

AI Agents Explained

What AI agents are, how the agent loop works, and why they're different from chatbots.

Trevor I. Lasn

Related Articles

Check out these related articles that might be useful for you. They cover similar topics and provide additional insights.

Building Custom MCP Servers with Next.js and mcp-handler

Context7 MCP: Stop LLM Hallucinations with Live Docs

Anthropic's Sequential Thinking MCP

Next.js DevTools MCP: Your Development Server Just Got Smarter

Chrome DevTools MCP: Let Your AI Agent Debug Your App

Claude Code Superpowers: How to Add Skills That Plan Before Coding

Google Search Console MCP for Claude Code