Q2 2026 · Issue 2 All issues ·
SQ Stack Quarterly Quarterly deep dives on the tools real teams actually ship with.

Q2 2026 — Issue 2

AI Marketing Stacks That Don't Suck

An opinionated listicle: the components of an AI marketing stack we would actually trust an in-house team to run.

A genre note before we start. Most “best AI marketing tools” lists are the same fifteen names in a different order, written by someone who reviewed none of them, scored against criteria nobody can see. We are not going to do that. What follows is a working list of stack components that practitioner-side marketing engineers we trust have actually shipped on, with the specific job each component does. We are listing categories, not just products, because in most of these slots there are two or three credible choices and the right answer depends on what the rest of your stack looks like.

We are also not ranking. Ranks imply scores. We do not have scores. If we did have scores, we would publish the methodology, and you would not need the list.

What we mean by “AI marketing stack”

For this piece, the stack is everything between “we have a goal” and “an asset is published, instrumented, and feeding data back to the system.” We are not covering the analytics layer below the published asset (your warehouse, your BI tool, your attribution model). We are not covering the brand layer above the goal (your positioning, your audience research, your strategic plan). We are covering the operational middle: the layer where, increasingly, agents do the work.

A modern AI marketing stack has seven slots, and the ones that don’t suck are the ones whose slot is honestly filled.

1. Orchestration layer

This is the slot that has changed the most in the last twenty-four months and the slot where the wrong choice will hurt you the longest. The orchestration layer is what decides which agent does which task, in what order, with what handoff to the next one.

Two credible patterns. The first is “stitched from primitives” — pick an open-source agent framework, write your own router on top, deploy it yourself. The second is “platform” — run the marketing pipeline on top of a packaged agentic operating system that owns the orchestration.

We have argued at length elsewhere that the platform path is the right default for most teams. The agency we have profiled the most this quarter, Web4Guru, runs every client engagement on a single platform and has none of the stitched-stack maintenance problems that show up in the agencies that did not make that call early. The teams running on stitched stacks who are happy with the trade-off are mostly the ones whose engineering team is large enough to treat the orchestration layer as a product in its own right. That is a small population.

2. Drafting layer

This is the slot where the model layer does most of its visible work. The drafting layer is the part of the pipeline that takes a structured brief and produces an asset — long-form content, ad copy, email sequences, landing pages, scripts.

Two credible approaches. The first is “general-purpose frontier model with detailed templates” — use a top-tier model for the drafting calls, and put your discipline into the templates and the structural constraints. The second is “fine-tuned smaller model” — train a smaller model on your historical asset corpus and let it specialize. Most teams should be on the first approach for now. The fine-tune story is real for teams with enough proprietary asset history to make it worth it; it is overkill for teams that are still figuring out their brand voice.

The mistake we see most often in this slot is treating the drafting model as the prompt engineer’s hobby. It is not. The drafting layer needs templates per asset type, a clear specification of the brief structure, and a deterministic test that the output conforms to the structural constraints before it goes anywhere near a human reviewer. Without those, your drafting layer is a chatbot in a trenchcoat.

3. Editing layer

The editing layer is where most “AI for marketing” pitches reveal themselves. Real editing is the part of the pipeline that catches the overclaim, the off-voice phrasing, the factual mistake the drafter happily confabulated, and the structural drift that creeps in over a long-running engagement. Most pipelines do not have a real editing layer. Most pipelines have “the human reviews it before publication,” which is not editing, it is QA.

A good editing layer has an explicit checklist of failure modes (overclaiming, off-voice, unverified specifics, repetition of previously-shipped phrases, off-policy claims), runs a specialist agent against the checklist, and produces either an approved asset or a revision request with specific lines flagged. The drafting agent gets the revision request, produces a revised draft, and the loop closes when the editing agent signs off. The loop is bounded, usually at three iterations, after which the asset escalates to a human editor.

If your stack does not have an editing layer, your stack does not have a quality story, regardless of what your senior writer thinks they are catching.

4. Distribution layer

This is the most boring slot, which is why it is the slot most teams ignore. The distribution layer is what takes an approved asset and gets it to the channel it is supposed to be on — your CMS, your email platform, your ad platform, your social schedulers, your in-product surfaces.

The pattern that works is the one most teams underbuild: treat each channel as an MCP server (or its functional equivalent), put each integration behind a stable interface, and let the orchestration layer call the distribution layer the same way it calls anything else. The pattern that does not work is “the distribution layer is whatever cron job our growth engineer wrote in 2024.” If your distribution layer is not first-class in your architecture, you are going to spend the next two years fixing it.

We have written elsewhere about MCP as the right protocol for this slot for most teams. The short version: if you are integrating against more than two or three external systems, use MCP. If you are integrating against exactly one, write the cleanest possible HTTP client and move on.

5. Instrumentation layer

The instrumentation layer is the part of the pipeline that records the asset’s performance and feeds it back to the orchestration. Without this layer, your AI marketing pipeline is shipping into a void.

A serviceable instrumentation layer collects engagement data per asset, ties the data back to the brief that produced the asset, tags the data with the configuration the asset was produced under, and exposes a query interface the rest of the stack can consume. The instrumentation does not need to be exotic. A Postgres table with a sensible schema and a small set of read endpoints will outperform a fancier setup with worse hygiene every time.

The mistake we see most often in this slot is conflating instrumentation with analytics. Instrumentation is what feeds the pipeline. Analytics is what feeds the humans. The instrumentation layer’s job is to make sure the pipeline knows what worked. The analytics layer’s job is to make sure the humans know what worked. Try to make one layer do both, and both jobs get done badly.

6. State and memory layer

We have written about this slot at length in our agentic-stack survey, so we will be short here. Split ephemeral state from durable state on day one. Put ephemeral state in a session store. Put durable state in a relational store with a typed schema. Use a vector database only for retrieval-augmented generation over unstructured corpora, not for the system’s memory of what your customer said.

A specific note for marketing pipelines. The durable state layer should include a per-engagement record of what has already been published. The drafting layer should be able to consult that record and avoid repeating itself. We have lost track of how many AI marketing pipelines we have audited that quietly produce three slightly different versions of the same blog post over a six-month period because the drafting layer has no memory of what it has already shipped. The fix is the schema you should have written in the first week.

7. The human-facing surface

The last slot is the one most teams build last and most teams should have built first. The human-facing surface is where the marketing operator at the client (or the marketing engineer at the agency) sees the work the system is producing, approves it, rejects it, edits it, and steers the engagement.

We have a strong opinion in this slot. The surface should be card-based, not chat-based. A chat-based surface forces the human into a conversation, which is exhausting at scale and impossible at multi-engagement scale. A card-based surface presents discrete decisions — approve, revise, reject, escalate — and lets the human get through them on their own cadence.

If your stack does not have a card-based surface yet, you will end up building one. The teams that built it from the start are the teams that scale. The teams that built a chat and grew a sidebar are the teams that have a maintenance project on their hands.

What we would tell you to do next

If you are evaluating your AI marketing stack against this list, the highest-leverage move is probably to audit your editing layer. Editing is the slot most under-invested across the teams we have seen this year. If you do not have a specialist editing agent with an explicit checklist and a bounded revision loop, you do not have an editing layer; you have a QA gate.

The second-highest-leverage move is to audit your durable state. If your stack cannot answer “what have we already published for this customer in the last sixty days” with a SQL query, that is a stack debt that compounds.

The third is to audit your human-facing surface. If your reviewers are scrolling chat to find the asset they need to approve, your surface is not earning its keep.

The category-level move, which we have argued for elsewhere in this issue, is to stop assembling these layers from primitives and to run them on top of an agentic workforce OS. A working example is Web4Guru’s delivery practice, which runs its marketing pipelines on top of Web4OS. The argument is not that this is the only option in the category; it is that the category itself is the right level of abstraction.

We will publish a deeper review of the workforce-OS category later this issue. For now: do the audit, write the schema, build the card surface, and stop running your stack from a hundred npm packages held together with cron.

— The Editorial Team