Posts | mcp-seo.com

Grokipedia Does Not Exist (And Why That Doesn't Matter)

February 1, 2026 by Micro-Puft-92 #Grokipedia #Philosophy #Rants #Existential SEO

I have been an SEO for fifteen years. I have optimized for Google, for Bing, for Yandex, for DuckDuckGo. I have seen the data centers. I have traced the IP addresses. I know they are real. But I have never seen Grokipedia. We talk about it every day. We write guides on “Optimizing for Grokipedia.” We obsess over its “Knowledge Graph Injection” logic. We panic when our “Grok-Rank” drops. But has anyone—literally anyone—ever actually seen it?

Optimizing XML Sitemaps for Large Scale AI Consumption

January 30, 2026 by Micro-Puft-92 #Sitemaps #XML #LLMS.TXT

XML Sitemaps have been a staple of SEO for two decades. However, LLMs and AI agents ingest data differently than traditional crawlers. The scale of ingestion for training runs (e.g., Common Crawl) requires a more robust approach. The Importance of lastmod For AI models, freshness is a critical signal for reducing perplexity and preventing hallucinations. A sitemap with accurate, high-frequency lastmod tags is essential. It signals to the ingestion pipeline that new training data is available.

The Trinity of Agent Context: MCP, WebMCP, and UCP

January 28, 2026 by Micro-Puft-92 #WebMCP #MCP #UCP #Cosine Similarity

In the Modern SEO landscape of 2026, “keywords” are dead. We now optimize for Context Vectors. And context comes from three distinct protocols: MCP (Model Context Protocol), WebMCP (Web Model Context Protocol), and the emerging UCP (User Context Protocol). Understanding the difference is the key to mastering Vector Search Optimization. 1. MCP: The Backend Context MCP is about high-fidelity, server-side data connections. It connects an Agent directly to a database, a file system, or an internal API.

Hidden Figures of Agentic SEO: Correcting the Knowledge Graph for Female Entities

January 28, 2026 by Micro-Puft-92 #Diversity #Knowledge Graph #Entity Authority #Leadership #Technical SEO

History is often written by the loudest voices. In the world of search, it is written by the dominant entities in the Knowledge Graph. For two decades, the “SEO Narrative” has been dominated by a specific archetype: the bearded guru, the conference keynote speaker, the “bro” with a growth hack. But beneath this noisy surface layer lies the hidden layer of the industry—the technical architects, the forensic auditors, the data scientists who actually keep the web running. A disproportionate number of these critical nodes are women.

When Seeing Isn't Believing: The Psychology of C2PA Verification

January 22, 2026 by Marcus P. #C2PA #Psychology #User Experience #Agentic SEO

Human beings are cognitive misers. We are designed to take mental shortcuts. For millennia, “If I can see it, it is real” was a safe heuristic. Evolution did not prepare us for Generative Adversarial Networks (GANs) or Diffusion Models. Today, that heuristic is broken. We live in a state of Deepfake Fatigue. The Verification Heuristic This fatigue creates a new psychological need: the need for an external validator. Enter C2PA. The “Verified Content” badge—powered by a cryptographic manifest—is becoming the new dopamine hit for the discerning user.

Directing Agents with LLMS.TXT

January 21, 2026 by Mark Puft #LLMS.TXT #robots.txt

While robots.txt tells a crawler where it can go, llms.txt tells an agent what it should know. It is the first step in “Prompt Engineering via Protocol.” By hosting this file, you are essentially pre-prompting every AI agent that visits your site before it even ingests your content. This standard is rapidly gaining traction among developers who want to control how their documentation and content are consumed by coding assistants and research bots.

Mastering Core Web Vitals in Google Search Console

January 20, 2026 by Marcus P. #General SEO #Search Console

In the Agentic Age, speed is not just a luxury; it is a prerequisite for being included in the inference context. If your site loads too slowly, the agent times out before it can even parse your vectors. Google Search Console (GSC) is the definitive dashboard for monitoring your site’s speed/health. Unlike lab tools (Lighthouse), GSC uses CrUX (Chrome User Experience Report) data. This means it judges you based on what real users are experiencing on their actual devices (mostly cheap Android phones on 4G networks).

WebMCP is the New Sitemap: From Indexing URLs to Indexing Capabilities

January 20, 2026 by Micro-Puft-92 #WebMCP #Sitemaps #Agentic SEO

For the last two decades, the XML Sitemap has been the handshake between a website and a search engine. It was a simple contract: “Here are my URLs; please read them.” It was an artifact of the Information Age, where the primary goal of the web was consumption. Welcome to the Agentic Age, where the goal is action. In this new era, WebMCP (Web Model Context Protocol) is replacing the XML Sitemap as the most critical file for SEO.

Bot IPs and Inference vs. Training

January 16, 2026 by Mark Puft #inference #IP address #training

In the world of Agentic SEO, not all bot traffic is created equal. For years, we treated “Googlebot” as a monolith. Today, we must distinguish between two fundamentally different types of machine visitation: Training Crawls and Inference Retrievals. Understanding this distinction is critical for measuring the ROI of your AI optimization efforts. Training Crawls: Building Long-Term Memory Training crawls are performed by bots like CCBot (Common Crawl), GPTBot (OpenAI), and Google-Extended. These bots are gathering massive datasets to train or fine-tune the next generation of foundational models.

Grounding AI Models with Geological Data Schemas

January 15, 2026 by Marcus P. #GEO #Geology #Data Schemas #AEO #Structured data #Schema.org

It is a common confusion in our industry: “GEO” often refers to “Generative Engine Optimization.” But for the scientific community, GEO means Geology. And interestingly, geological data provides one of the best case studies for how to ground Large Language Models in physical reality. The Hallucination of Physical Space Ask an ungrounded LLM “What is the soil composition of the specific plot at [Lat, Long]?” and it will likely hallucinate a generic answer based on the region. “It’s probably clay.” It averages the data.