Exposing the Invisible: Using a JS Headers API to Identify Agentic Browsers

The web architectural landscape is shifting beneath our feet. As we transition from an internet browsed primarily by human-operated clients (like Chrome, Firefox, or Safari) to one traversed by autonomous, intelligent agents, the ability to discern who or what is requesting our content has never been more critical. In this new era, Agentic SEO is not just about keyword optimization or semantic HTML; it is fundamentally about context awareness. We need to know who is looking at our static HTML pages to provide the most optimized, relevant, or perhaps cloaked, experience.

Historically, the frontend has been flying blind. While server-side technologies have always had intimate access to the HTTP request headers—the metadata sent by the client before any content is exchanged—client-side JavaScript is deliberately isolated from this information. This isolation was designed for security and privacy, but it creates a significant blind spot for frontend developers and SEO engineers trying to implement Level 1 Agentic Cloaking or other specialized behaviors on static hosting platforms.

Today, we are changing that. We are thrilled to introduce a simple, robust solution: the MCP-SEO HTTP Request Headers API, available at https://tools.mcp-seo.com/headers/. This tool bridges the gap, allowing your static HTML pages to peer into the network layer and inspect the exact HTTP headers that accompanied the request.

In this comprehensive guide, we will explore what this API does, how it works under the hood, and crucially, how you can use it to recognize the “signature” fields of agentic browsers, particularly those leveraging the emerging WebBotAuth standards.

To understand the value of this API, we must first understand the limitation it circumvents. When a browser (or a bot) requests a webpage, it sends an HTTP request. This request contains a URI and a set of HTTP request headers. These headers contain vital information about the client: the User-Agent, Accept-Language, Referer, and increasingly, specialized headers indicating bot authorization protocols like WebBotAuth.

Server-side environments (Node.js, Python, PHP) can read these headers easily because they process the raw incoming HTTP request. However, once the server responds with HTML and the browser executes the embedded JavaScript, the raw request headers are gone. The window.navigator object provides a sanitized, reconstructed view of some of this data (like navigator.userAgent), but it is incomplete, easily spoofed, and crucially, it does not reflect custom HTTP headers injected by agentic crawlers.

As developers, we are told to rely on APIs like the Fetch API to manage networking, but standard fetch or XMLHttpRequest cannot inspect the headers of the document’s original request. They can only inspect the response headers of new requests they initiate. You can read more about HTTP headers limitations on MDN’s HTTP Headers specification.

This meant that static websites—those hosted on CDNs, GitHub Pages, Amazon S3, or Firebase Hosting without server-side rendering (SSR) or edge functions—had no way to definitively check custom request headers. If an agentic browser sent a X-Agent-Capability: RAG-Ingestion header, your static HTML page would be completely oblivious.

Introducing the MCP-SEO HTTP Request Headers API

The solution is an elegant workaround: an external API that echoes the headers back to the requester. You can interact with the live demonstration of this tool at our tools portal.

What It Does

The API operates exclusively via a GET request to /api/headers. When invoked, it reads the HTTP request headers sent by the client, sanitizes them, and returns them as a JSON payload.

Because we want this to be usable from any origin—meaning you can embed it on your own static site, completely separate from our infrastructure—the API supports both standard Cross-Origin Resource Sharing (CORS) and JSON with Padding (JSONP). You can read about CORS on MDN’s CORS documentation.

The Endpoint and Response Structure

The endpoint is located at https://tools.mcp-seo.com/api/headers. A typical response looks like this:

{
  "headers": {
    "accept": "*/*",
    "accept-encoding": "gzip, deflate, br",
    "accept-language": "en-US,en;q=0.9",
    "host": "tools.mcp-seo.com",
    "user-agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64)...",
    "sec-ch-ua": "\"Chromium\";v=\"122\", \"Not(A:Brand\";v=\"24\", \"Google Chrome\";v=\"122\"",
    "signature": "sig1=:...:",
    "signature-agent": "https://example-agent.com/bot"
  },
  "note": "These are the HTTP request headers sent by your browser for this request."
}

The server explicitly strips out sensitive and structurally irrelevant headers (like Cookies, internal infrastructure routing headers, or sensitive Authorization tokens not meant for public mirroring) to maintain strict security protocols. What remains is a clean, reliable dictionary of the client’s self-reported network metadata.

The Magic of JSONP: Bypassing CORS Restrictions

While the API allows standard fetch() calls by returning Access-Control-Allow-Origin: *, relying solely on fetch or standard XMLHttpRequest can sometimes be fragile in highly restricted enterprise environments or specific agentic sandboxes.

To guarantee cross-origin compatibility without relying on CORS preflight requests, the API supports JSONP (JSON with Padding). By passing a callback query parameter, the API wraps the JSON response in a JavaScript function call. You can read up on the history of JSONP at Wikipedia’s JSONP page.

Instead of returning raw JSON, the server returns executable JavaScript. If you request /api/headers?callback=processHeaders, the response is:

/**/ typeof processHeaders === 'function' && processHeaders({
  "headers": {
    "accept": "*/*",
    "user-agent": "..."
  },
  "note": "..."
});

Implementing in Static HTML

This is a neat workaround that makes integration trivially simple for static HTML pages. You define a function in your global scope, and then inject a <script> tag pointing to our API.

<!-- Define your callback function -->
<script>
  function processHeaders(data) {
    if (data.error) {
      console.error(data.error);
      return;
    }
    
    const headers = data.headers;
    const isAgent = Boolean(headers['signature'] || headers['signature-agent'] || headers['signature-input']);
    
    if (isAgent) {
      console.log("Agent detected. Serving optimized DOM.");
      // Initialize agent-specific logic
    } else {
      console.log("Human detected. Proceeding with standard rendering.");
    }
  }
</script>

<!-- Inject the script tag to call the API -->
<script src="https://tools.mcp-seo.com/api/headers?callback=processHeaders"></script>

When the browser parses the second <script> tag, it makes a standard resource request. The API responds with the wrapped JSON data, executing processHeaders(data) immediately upon download. No CORS errors. No complex asynchronous state management required.

Agentic Browsers and the WebBotAuth Standard

Why is inspecting request headers so critical in 2026? The answer lies in the proliferation of Agentic Browsers and the adoption of the WebBotAuth standard.

An Agentic Browser is not merely a crawler (like traditional Googlebot). It is a headless, JavaScript-executing environment controlled by a Large Language Model (LLM) or a specialized autonomous agent (like OpenClaw). These agents do not just index text; they interact, they infer, and they execute tasks on behalf of users. For an overview of Google’s crawler capabilities, refer to Google Search Central documentation on crawlers.

The Problem with User-Agent Strings

Relying on the User-Agent string to identify these agents is a losing battle. The User-Agent string is notoriously messy, frequently spoofed, and often deliberately disguised to mimic standard consumer browsers (Chrome, Edge, Safari) to bypass rudimentary anti-bot protections.

If an agent wants to read your article, it might send a User-Agent that looks exactly like an iPhone 15 running Safari. Your static site would serve it the mobile, ad-heavy experience, which is sub-optimal for LLM ingestion. We discussed the impact of this in our guide on Semantic HTML as LLM Training Fuel. A clean, semantic DOM is explicitly preferred.

Enter WebBotAuth

To solve this, the industry is increasingly adopting verifiable signature fields, conceptually grouped under the umbrella of “WebBotAuth.” Compliant agents inject specific, cryptographically signed HTTP headers into their requests. This allows them to prove their identity and intent to the server (or in our case, to the frontend via the API), without relying on easily forgeable User-Agent strings.

By inspecting these headers, a site owner can confidently enable Level 1 Agentic Cloaking: serving a heavy, interactive, widget-filled React application to the human, and a clean, semantic, markdown-like DOM to the verified agent.

Key WebBotAuth Signature Fields

Here are the primary HTTP headers to look for when identifying agentic traffic, which our API makes accessible to your frontend context. WebBotAuth specifically uses only three standard HTTP request headers, and the presence of any of these fields is usually a sign that it’s an agentic request.

HTTP Header Field	Description	Typical Value Example	Verification Method
`Signature`	Contains the actual cryptographic signatures validating the agent’s identity.	`sig1=:...:`	Handled server-side usually, but existence implies a formal bot.
`Signature-Input`	Specifies the parameters and components used to construct the signature.	`sig1=("@method" "@target-uri");created=...`	Contextual hint for signature verification.
`Signature-Agent`	Often contains the URL of the agent that’s making the request, pointing to its documentation or origin.	`https://example-agent.com/bot`	Checked against a known registry of agent URLs.

Using the MCP-SEO HTTP Request Headers API, checking for these fields becomes a trivial exercise in JavaScript.

Building a Robust Frontend Detection Script

Let’s elevate our previous example. We want to create a reusable utility for our static site that gracefully fetches headers, caches them, and exposes a clean asynchronous API for querying specific fields.

We can create a RequestHeaders class. This is identical to the convenience wrapper provided on the API documentation page, but we will expand upon its practical utility.

class RequestHeaders {
  constructor(endpoint = "https://tools.mcp-seo.com/api/headers") {
    this.endpoint = endpoint;
    this._promise = null;
  }

  fetch() {
    if (!this._promise) {
      this._promise = new Promise((resolve, reject) => {
        const callbackName = 'headersCallback_' + Math.round(1000000 * Math.random());
        
        window[callbackName] = function(data) {
          delete window[callbackName];
          if (data.error) return reject(new Error(data.error));
          resolve(data.headers);
        };
        
        const script = document.createElement('script');
        const url = new URL(this.endpoint);
        url.searchParams.set('callback', callbackName);
        
        script.src = url.toString();
        script.onerror = () => reject(new Error('Failed to load headers API'));
        
        document.head.appendChild(script);
      });
    }
    return this._promise;
  }

  async get(name) {
    try {
      const headers = await this.fetch();
      return headers[name.toLowerCase()] ?? null;
    } catch(e) {
      console.warn("Headers API unavailable, assuming default context.");
      return null;
    }
  }
}

Applying the Detection Logic

Now, let’s use this utility to implement our routing logic. If we detect an agent, we might want to bypass a heavy client-side rendering pipeline (like loading a massive Three.js canvas or a complex single-page application router) and instead immediately display a <div id="semantic-core"> that we shipped in the initial HTML payload.

// Initialize the utility
const headersInfo = new RequestHeaders();

async function bootstrapApplication() {
  // 1. Check for formal WebBotAuth signatures
  const signatureAgent = await headersInfo.get('Signature-Agent');
  const hasSignature = await headersInfo.get('Signature') !== null;
  const hasSignatureInput = await headersInfo.get('Signature-Input') !== null;
  
  if (signatureAgent || hasSignature || hasSignatureInput) {
    console.info(`[Agentic Traversal Detected] Agent: ${signatureAgent || 'Verified Bot'}`);
    enableAgentMode();
    return;
  }

  // 2. Fallback heuristics for older bots
  const userAgent = await headersInfo.get('User-Agent');
  const isHeadless = await headersInfo.get('Sec-CH-UA')?.includes('Headless');
  
  if (isHeadless || /bot|crawler|spider|crawling/i.test(userAgent)) {
    console.info("[Legacy Bot Detected] Applying semantic falback.");
    enableAgentMode();
    return;
  }

  // 3. Human traffic logic
  console.info("[Human Context] Bootstrapping primary application.");
  enableHumanMode();
}

function enableAgentMode() {
  // Hide heavy UI elements, unhide semantic text blocks
  document.getElementById('heavy-app-root').style.display = 'none';
  document.getElementById('semantic-core').style.display = 'block';
  // Send metrics denoting an agent view
}

function enableHumanMode() {
  // Proceed with React/Vue/Svelte mounting
  document.getElementById('heavy-app-root').style.display = 'block';
  // import('./main.js').then(...)
}

// Execute the bootstrapper
bootstrapApplication();

In this architecture, the static HTML files served from your CDN contain both the lightweight semantic text (ideal for LLMs) and the mounting point for the heavy JavaScript application. The initial JavaScript execution quickly calls our HTTP Request Headers API, determines the context, and selectively unhides or mounts the appropriate experience.

This is Level 1 Agentic Cloaking achieved purely on the frontend, completely independent of server-side infrastructure.

Technical Details: Inspecting the Headers

Understanding exactly what headers are available is crucial for mastering this logic. While simple checks against user agents are a starting point, analyzing the full picture gives you much more control. Agentic browsers might omit common browser-specific headers, such as Accept-Language, Accept-Encoding, or provide distinct formats for Sec-CH-UA. This absence of human-typical data can form a fingerprint of its own.

Furthermore, analyzing headers like signature and signature-agent allows developers to leverage complex authentication patterns. Suppose you run a highly secure portal that requires bots to register. The bot uses WebBotAuth to self-attest its authorization by providing an endpoint in signature-agent, and the static HTML can verify that signature through an API call before revealing sensitive DOM elements. This flips the architectural model—moving authorization checks from Edge/Server directly into static, decentralized environments.

In addition, consider the rate limits that might be imposed by agent networks. By capturing the headers directly in the frontend and storing them lightly in session storage, you prevent repeated hits to verification APIs on subsequent internal links clicked by the agent. By taking control of the data feed natively in JavaScript, you give yourself immense flexibility without being tethered to a serverless function structure.

Caveats and Contextual Differences

While this API is incredibly powerful, it is vital to understand the nuances of the network layer. The headers returned by /api/headers are the headers sent by the browser during the fetch/script request to tools.mcp-seo.com, not the overarching navigation request for your static HTML page.

In 99% of cases, this distinction is irrelevant. Core identity headers—like User-Agent, Accept-Language, and custom injected WebBotAuth headers—are appended by the browser or agent to all outgoing requests within that browsing context. Therefore, the headers reflected by our API will accurately represent the client.

However, a few headers are context-specific:

Accept: The browser’s Accept header for a <script> tag request (*/*) will differ from the Accept header for a top-level page navigation (text/html,application/xhtml+xml...).
Referer / Sec-Fetch-Site: If your site example.com calls tools.mcp-seo.com, the Referer will be example.com, and the Sec-Fetch-Site will denote a cross-site request. You cannot use this API to discover the Referer of the user who clicked a link to arrive at your site. For that, you must use the standard document.referrer property in JavaScript.
Cookies and Authentication: As noted in the API documentation, cross-origin requests do not transmit cookies for the target domain unless explicitly configured with credentials (which our generic API does not support for security and caching reasons). This ensures the API remains stateless and privacy-compliant. As discussed in our article on ethical scraping, maintaining data hygiene is paramount.

The Future of Frontend Context

The web is bifurcating. There is the visual web designed for human retinas, and the semantic web designed for vector embeddings and autonomous ingestion. For years, static deployments forced developers to choose one over the other in the initial payload, leading to either bloated HTML (bad for humans on slow connections) or JavaScript-only rendering (terrible for models that lack advanced DOM parsing capabilities).

The MCP-SEO HTTP Request Headers API breaks this dichotomy. By giving your frontend JavaScript the ability to see the invisible network layer, you can create truly context-aware static sites. You can build ecosystems that are beautiful and interactive for humans, while simultaneously acting as high-density, semantic data feeds for the agents of the future.

As standards like WebBotAuth and protocols like WebMCP gain traction, the necessity of context-aware routing will only grow. We built this API to give you the tools you need today to build the architecture of tomorrow.

Integrate the API, check the headers, and ensure your site is speaking the right language to whoever—or whatever—is listening. This represents the ultimate step in creating fluid, agent-ready environments without sacrificing the static hosting architectures we all love.

The Frontend Blind Spot: Why JavaScript Needs Help