{"id":11149,"date":"2026-06-29T07:34:24","date_gmt":"2026-06-29T13:34:24","guid":{"rendered":"https:\/\/attentionmedia.io\/?p=11149"},"modified":"2026-06-29T07:34:24","modified_gmt":"2026-06-29T13:34:24","slug":"agentic-ai-is-rewriting-martech-economics-and-infrastructure","status":"publish","type":"post","link":"https:\/\/attentionmedia.io\/?p=11149","title":{"rendered":"Agentic AI is rewriting martech economics and infrastructure"},"content":{"rendered":"<div><img fetchpriority=\"high\" decoding=\"async\" width=\"800\" height=\"450\" src=\"https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/part1-hero-hermes-agent-800x450.png\" class=\"attachment-large size-large wp-post-image\" alt=\"\" \/><\/div>\n<p class=\"wp-block-paragraph\">Marketers embraced AI when it was priced like an all-you-can-eat buffet. Providers\u2019 shift to token-based pricing comes just as agentic workflows are becoming part of everyday marketing \u2014 and agents use many, many tokens. Martech\u2019s infrastructure needs to change if it\u2019s going to keep costs down amid growing demand.<\/p>\n<p class=\"wp-block-paragraph\">The moment AI connects to your business systems, the chatbot becomes much more powerful. Instead of answering one question at a time, it can pull customer records from your CRM, analyze campaign performance, search the web, and generate a personalized report in a single workflow. That\u2019s made possible by tool calling, which lets AI access external systems via APIs and Model Context Protocol (MCP) connections.<\/p>\n<p class=\"wp-block-paragraph\">The result is a huge productivity boost for marketers. AI can chain together multiple tools without requiring users to jump between applications. The catch is that every tool call consumes tokens. AI agents, in particular, use an incredible number of them because they pass the entire task history, their internal reasoning, and any external tool data back through the model at every step of their problem-solving loop.<\/p>\n<h2 class=\"wp-block-heading\">The token cap reality<\/h2>\n<p class=\"wp-block-paragraph\">Let\u2019s look at a real-world example of how that works.<\/p>\n<p class=\"wp-block-paragraph\">A typical daily pipeline \u2014 search 200 results, summarize them, generate five headline variations \u2014 can easily run 4,000 to 5,000 tokens or more per run. Over a 30-day month, that can reach well over 100,000 tokens, well past the free-tier limits on OpenAI, Anthropic, and similar platforms, and even enough to blow through a $20 subscription well before the month is over.<\/p>\n<figure class=\"wp-block-image size-large is-resized\"><img decoding=\"async\" width=\"800\" height=\"345\" src=\"https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42-800x345.png\" alt=\"\" class=\"wp-image-410434\" \/><\/figure>\n<p class=\"wp-block-paragraph\"><em>(All token estimates in this article are based on standard tokenization metrics used across the industry \u2014 the same method providers use to calculate your bill. They are rough projections, not exact measurements from a live pipeline, and actual usage will vary based on model, prompt structure, and output length.)<\/em><\/p>\n<h2 class=\"wp-block-heading\">Why Claude Cowork and similar workflows hit the wall<\/h2>\n<p class=\"wp-block-paragraph\">Unfortunately, there\u2019s no correlation between the amount of tokens used and the quality of the result. As Scott Brinker and Frans Riemersma note in the <a href=\"https:\/\/chiefmartec.com\/2026\/05\/2026-marketing-technology-landscape-supergraphic-peak-martech-achieved-maybe\/\">State of Martech 2026 report<\/a>, \u201cmore input does not automatically mean better output\u201d \u2014 but you are still paying for every bit of it. <\/p>\n<p class=\"wp-block-paragraph\">Claude Cowork and other tool-heavy environments make the problem visible fast. Every file read, every search, every API call adds a billable token interaction. Users who start the month with a $20 subscription often find themselves throttled by week two.<\/p>\n<p class=\"wp-block-paragraph\">The consequence is choosing between throttling your workflow or paying astounding overage fees. Neither is sustainable for a marketing team that needs to run pipelines daily.<\/p>\n<h2 class=\"wp-block-heading\">The answer is owned context, not a single provider<\/h2>\n<p class=\"wp-block-paragraph\">Fortunately, there is a solution: Keep the raw data under your control. Store it in a shared team database like PostgreSQL or Qdrant, in a cloud data warehouse like Snowflake or BigQuery, or in a folder in shared cloud storage \u2014 and use lightweight, non-LLM filtering logic to pull out relevant pieces before anything touches the model.<\/p>\n<p class=\"wp-block-paragraph\">Setting that up might involve an LLM once, the same way you might use an AI assistant to write a formula or a script. But once it is in place, it runs automatically on every batch of new data \u2014 and it does not call an LLM at all. Simple keyword scoring or vector similarity search, both orders of magnitude cheaper than an LLM call, rank the data by relevance.<\/p>\n<p class=\"wp-block-paragraph\">When a social listening pipeline pulls 500 tweets about a brand, the filtering step quietly selects the 10 most relevant ones and sends only those to the model. The token bill typically drops by 60% or more. The insight quality stays the same.<\/p>\n<h2 class=\"wp-block-heading\">Beyond the one-off agent<\/h2>\n<p class=\"wp-block-paragraph\">There are a number of tools that can do this type of filtering. The <a href=\"https:\/\/github.com\/NousResearch\/hermes-agent\">open-source Hermes Agent<\/a>, Claude Cowork, Claude Code, and Perplexity Computer all connect an LLM to external tools, allowing it to call APIs, read files, and automate workflows that can require switching between half a dozen applications.  However, Hermes runs on your infrastructure and is provider-agnostic. The others are tied to the models and infrastructure of Anthropic and Perplexity.<\/p>\n<p class=\"wp-block-paragraph\">Other notable tools in the broader agent ecosystem include:<\/p>\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/github.com\/openclaw\/openclaw\">OpenClaw<\/a>\u00a0(380K+ GitHub stars), an open-source agent harness that pairs with filesystem-based memory stores;<\/li>\n<li><a href=\"https:\/\/github.com\/openai\/codex\">OpenAI Codex<\/a> CLI\u00a0(93K stars), which gives developers terminal-based agent access with local file persistence; and<\/li>\n<li>Orchestration\u00a0frameworks\u00a0like\u00a0<a href=\"https:\/\/github.com\/langchain-ai\/langchain\">LangChain<\/a>\u00a0(140K stars) and\u00a0<a href=\"https:\/\/github.com\/crewAIInc\/crewAI\">CrewAI<\/a>\u00a0(54K stars), which you build against rather than use directly.<\/li>\n<\/ul>\n<p class=\"wp-block-paragraph\">What they all share, in different ways, is that the model is a guest in your system, not the landlord.<\/p>\n<p class=\"wp-block-paragraph\">Hermes takes that principle to a good extreme. It maintains a persistent local context store \u2014 your conversation history, tool outputs, and embeddings are in your database and accessible across sessions. A memory layer on top of that learns from each interaction, capturing preferences, corrections, and recurring patterns so the agent improves over time rather than starting fresh each session.<\/p>\n<p><a href=\"https:\/\/www.semrush.com\/lp\/semrush-one\/en\/?utm_campaign=ic_semrush_one&amp;utm_source=searchengineland.com&amp;utm_medium=overlay&amp;onboarding=off\" target=\"_blank\"><\/a><\/p>\n<div>\n<div>\n<div class=\"headline-responsive\">\n        Your customers search everywhere. Make sure your brand <span>shows up<\/span>.\n      <\/div>\n<p>\n        The SEO toolkit you know, plus the AI visibility data you need.\n      <\/p>\n<\/div>\n<div>\n      <span>Start Free Trial<\/span>\n    <\/div>\n<div>\n<div>Get started with<\/div>\n<p>      <img decoding=\"async\" src=\"https:\/\/searchengineland.com\/wp-content\/seloads\/2025\/11\/semrush-one.webp\" alt=\"Semrush One Logo\" \/>\n    <\/p><\/div>\n<\/div>\n<p><\/p>\n<p class=\"wp-block-paragraph\">Its built-in tool ecosystem (web, terminal, APIs, vision, Python) means the same pipeline that pulls Salesforce or HubSpot records, checks a data warehouse, and drafts a report, also captures the intermediate results and saves them locally. And, because it is provider-agnostic, you only need to change a config line to go from OpenRouter to a self-hosted LLaMA.<\/p>\n<p class=\"wp-block-paragraph\">The product is the implementation. The pattern is what matters \u2014 and any team can adopt it. The message is not \u201cuse Hermes Agent.\u201d The message is \u201cstart building the systems that let you own your context, because the provider-centric approach cannot scale.\u201d<\/p>\n<p class=\"wp-block-paragraph\">The momentum behind agentic, context-owning tools is unmistakable. But the real question these tools force is strategic: do you want to pay for the work, or own the infrastructure and pay for the reasoning? Switch to a bigger subscription, and you\u2019re still likely to run out of capacity. A different architecture removes that issue entirely. The choice every marketing team faces is which side of that equation they want to be on.<\/p>\n<p class=\"wp-block-paragraph\"><em>This is the first in a three-part series on the shift toward agentic marketing workflows and the infrastructure required to support them. In Part 2, I walk through how the architecture works in practice. Part 3 covers getting started with Hermes Desktop \u2014 the actual installation, skills, and workflows.<\/em><\/p>\n<p class=\"wp-block-paragraph\">\n<\/p><p>The post <a href=\"https:\/\/martech.org\/agentic-ai-is-rewriting-martech-economics-and-infrastructure\/\">Agentic AI is rewriting martech economics and infrastructure<\/a> appeared first on <a href=\"https:\/\/martech.org\/\">MarTech<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>Marketers embraced AI when it was priced like an all-you-can-eat buffet. Providers\u2019 shift to token-based pricing comes just as agentic workflows are becoming part of everyday marketing \u2014 and agents use many, many tokens. Martech\u2019s infrastructure needs to change if it\u2019s going to keep costs down amid growing demand. The moment AI connects to your &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/attentionmedia.io\/?p=11149\" class=\"more-link\">Read more<span class=\"screen-reader-text\"> &#8220;Agentic AI is rewriting martech economics and infrastructure&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-11149","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"featured_media_urls":{"thumbnail":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"medium":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"medium_large":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"large":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"1536x1536":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"2048x2048":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"inspiro-featured-image":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"inspiro-loop":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"inspiro-loop@2x":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail@2x":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-masonry":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-masonry@2x":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail_cinema":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail_portrait":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail_portrait@2x":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false],"portfolio_item-thumbnail_square":["https:\/\/martech.org\/wp-content\/uploads\/2026\/06\/image-42.png",0,0,false]},"_links":{"self":[{"href":"https:\/\/attentionmedia.io\/index.php?rest_route=\/wp\/v2\/posts\/11149","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/attentionmedia.io\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/attentionmedia.io\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/attentionmedia.io\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/attentionmedia.io\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=11149"}],"version-history":[{"count":0,"href":"https:\/\/attentionmedia.io\/index.php?rest_route=\/wp\/v2\/posts\/11149\/revisions"}],"wp:attachment":[{"href":"https:\/\/attentionmedia.io\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=11149"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/attentionmedia.io\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=11149"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/attentionmedia.io\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=11149"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}