Skip to content

Reference media guidelines

Every recipe is driven by reference media — the images and videos that anchor the output to your product, brand, or talent. Strong references are the single biggest lever on output quality. This page covers how to prepare each type.

For accepted file formats, sizes, and limits, see Inputs. For how to pass references as URLs or base64 data URIs, see Using the API.

  • Resolution — use the highest-quality source you have. Low-resolution or heavily compressed references degrade the output.
  • Isolation — the subject of a reference should be clearly the focus, free of clutter and distractions.
  • Lighting — even, neutral lighting reproduces most faithfully. Harsh shadows and color casts can carry into the result.
  • One subject per reference — avoid references with multiple competing subjects unless the recipe specifically calls for it.

Used by Product Ad, Product Swap, Product Campaign Image, and Product UGC.

  • Center the product and keep it unobstructed.
  • Prefer a clean or plain background so the product is easy to isolate.
  • Capture the angle you want featured; the recipe works from what it can see.
  • Avoid heavy reflections, watermarks, or overlaid text on the product.

Used by Marketing Stock Image and Product Campaign Image.

  • Choose references that share a consistent palette, lighting, and mood.
  • Provide a few complementary images rather than one — a small, cohesive set defines the style more reliably.
  • Don’t mix conflicting aesthetics in a single request; generate separately and curate.

Used by Product UGC.

  • Use a clear, well-lit reference with the face visible and unobstructed.
  • Choose a neutral expression and framing unless you want a specific look carried through.
  • Rights and consent: only use references of people you have permission to depict. You are responsible for the rights to any likeness you submit. See Content moderation for usage policies.

Used by Product Swap and Multi-Shot Video.

  • Pick a source video where the subject is clearly visible and consistently framed.
  • Stable footage with steady motion produces cleaner results than shaky or rapidly cut clips.
  • For Product Swap, match the new product’s category and rough shape to the one in the video for the most natural swap.