How Lily Max works

How Lily Max turns product intelligence into measurable lift.

Your team sets the goal. Lily agents execute the work. Your team makes the final decision.

Operating model

You set the goal. Lily Max runs the optimization loop.

Lily Max is not an uncontrolled black box. It is a goal-based system. Your team defines the business outcome, the surface, the product set, the cadence, and the approval rules. Lily agents do the operational work and prepare what should ship.

01

Human sets the goal

Performance teams decide what matters first, from Google Shopping ROAS to onsite search conversion.

  • Choose the surface and product set
  • Set the success metric and test cadence
  • Define approval and deployment rules

02

Agents execute the work

Lily agents evaluate the feed, PDPs, attributes, and structured data against the surface-specific goal.

  • Find missing or weak product signals
  • Generate richer product intelligence
  • Prepare variants and test design

03

Human approves what ships

Teams get a clear readout with the expected impact, confidence, and next step.

  • Scale winning patterns
  • Iterate on a subset
  • Pause or end the experiment

Optimization loop

Every test makes the next test smarter.

The moat is the system. Lily doesn't just generate content in batches. It runs a continuous experiment engine that learns from every surface, every test, and every customer.

01

Discover gaps

Scan feeds, PDPs, schema, and attributes against the active surface and goal.

58Out of 100
  • Required Field Present14,832 / 14,832
  • Image Missing147 SKUs
  • Description Issues64 SKUs
  • Title Issues87 SKUs
  • Ingestion Issues28 SKUs

02

Generate hypotheses

Create titles, descriptions, attribute enrichments, and structured signals to test.

Before Enrichment58 / 100
After Enrichment ↑ +4090 / 100

03

Launch experiments

Deploy matched test and control groups through the right feed, page, or onsite system.

Lily Max Agent
Experiment launched
A/B split confirmed
Early review
Mid-run health check

04

Measure lift

Use holdouts, matched spend, and DiD where appropriate to isolate impact.

Data Processed

14,832 of 14,832 Products

100% Processed

05

Deploy winners

Recommend the next move with human approval before any broad rollout.

Experiment Lifecycle

Women's Footwear, Title + Description · Day 18 of 28

  • Day 0 · Experiment LaunchedPassed
  • Day 3 · Full Ramp ConfirmedIn Progress
  • Day 7 · Early Signal ReviewAction Required
  • Day 28 · Results ReadoutLive

06

Learn continuously

Feed results back into the system so future runs start smarter.

Batch X29d4a, Google Ads

  • Received250,000
  • Enriched211,000
  • Delivered to GMC188,000
  • Needs attention19,000

Operational pace

Google ships.
Lily shipped.

In a landscape that changes by the week, you need a partner that can pivot just as fast. When Google introduced Conversational Attributes in Merchant Center, a new product-data format built for AI Mode, Gemini, and Business Agent, Lily Max implemented support for all six attributes within days. Without that level of agility, brands lose visibility, traffic, and revenue.

question_and_answerdocument_linkrelated_productThe Six Conversational Attributesitem_group_titlevariant_optionpopularity_rank
Lily Max Agent: Headline Engagement Study with cumulative DID performance trend

Tested, not promised

Proof designed for scrutiny.

Every performance claim should be CFO-ready. Lily Max pairs lift numbers with the methodology behind them, so teams can separate directional improvement from validated impact.

  • Matched-spend A/BCompare enriched and control feeds at equivalent spend levels to isolate product intelligence impact.
  • Difference-in-differencesValidate lift against pre-period baselines and control groups, not just before/after movement.
  • Statistical significanceReport confidence and methodology footnotes alongside performance results.

What Lily Max does

A plain-English answer for performance teams.

It does not replace your feed manager.

Lily Max optimizes the product intelligence your feed manager, catalog system, or commerce stack distributes.

It does not just generate content.

Content is one output. The platform tests which product signals actually improve visibility, conversion, and ROAS.

It does not just measure visibility.

AEO tools tell you where you show up. Lily Max improves the product intelligence that determines whether you show up.

One engine, every AI-mediated surface

Make your products legible
wherever AI sells.

Lily Max uses goal-based AI agents to optimize product intelligence across paid, organic, onsite, and agentic commerce surfaces, including the Google, Meta, LLM, and retailer-owned systems that decide what shoppers see.

Always On

Agentic Product
Intelligence

Goal-based agents enrich, evaluate and republish your catalog continuously

  • Google Ads
  • Claude
  • AI Mode
  • Meta Ads
  • Gemini
  • Onsite Search
  • AI Overview
  • Amazon

Why Lily Max

Broader than ACO. More rigorous than content generation.

Lily Max covers agentic commerce, but does not stop there. The same operating discipline extends across the paid, organic, onsite, and AI discovery surfaces that drive performance today.

Versus content generators

Content is an output. Lily Max tests which product intelligence actually lifts performance, then deploys what wins.

Versus feed managers

Feed managers distribute product data. Lily Max improves the product intelligence those feeds carry.

Versus ACO-only tools

ACO optimizes agentic surfaces. Lily Max applies the same discipline across agentic commerce plus Google, Meta, organic, and onsite.

Versus visibility tools

Visibility tools tell you where you appear. Lily Max improves the inputs that determine whether AI systems surface your products.

Get started

Start with your Google Merchant Center (GMC) feed. Expand across every AI-mediated commerce surface.

Lily Max will score your product intelligence gaps and identify where richer, AI-ready attributes can improve visibility, conversion, and ROAS without replacing the tools your team already uses.

Frequently asked questions

How does Lily Max work?

Lily Max runs a continuous optimization loop. Your team sets the goal, then agents discover product-data gaps, generate enrichments, launch controlled tests, measure the lift, and feed the winners back into the next round, with your approval before anything ships.

Who sets the goals, the agents or my team?

Your team does. Lily Max is a goal-based system, not a black box: you define the surface, the product set, the success metric, the test cadence, and the approval rules, and the agents do the operational work toward that goal.

Do I stay in control of what gets published?

Yes. Nothing goes live without your sign-off. You review the proposed changes and the projected impact, then approve what reaches the feed, so agents handle the volume while humans own the decisions.

What are the steps in the optimization loop?

There are five: detect gaps, generate enrichments, launch a controlled test, measure the lift, and recommend what to scale. Each test makes the next one smarter, because results feed back into the system.

How does Lily Max detect product-data gaps?

Lily Max scores your catalog against what each surface rewards, then ranks the highest-impact gaps. It scans feeds, product pages, schema, and attributes for missing fields, weak titles, and thin descriptions before any enrichment work begins.

How does Lily Max prove the lift is real?

Every result is measured against a control. Lily Max uses matched-spend A/B tests, holdout groups, and difference-in-differences analysis, then reports lift with confidence intervals, so results separate validated impact from directional movement and hold up to a finance review.

What is a matched-spend A/B test?

A matched-spend A/B test runs the enriched group and the control group at the same spend level, so any difference in performance reflects the product-data change rather than a bigger budget. It is how Lily Max isolates the effect of enrichment.

How fast can Lily Max support a new AI surface?

Lily Max is built to ship support for new surface capabilities in days, not quarters. When a platform introduces a new product-data type, a catalog that is always scored and always testable can adopt it quickly, rather than waiting a release cycle.

What is the quality score, and is the score the outcome?

The score is not the outcome. Lily Max scores product information on completeness, correctness, compliance, relevance, and differentiation, and uses that score to find the signals most likely to move visibility, conversion, and ROAS, which it then tests.

How long until I see results?

Most teams connect their data and launch within days, then see results as the first controlled tests resolve. Timelines vary by catalog size and starting feed quality, and Lily Max shows where to expect impact first before any credits are spent.