update skills

2026-07-04 21:13:36 -07:00 · 2026-03-17 16:53:22 -07:00
parent 0b0783ef8e
commit f9a530667e
389 changed files with 54512 additions and 1 deletions
@@ -0,0 +1,132 @@
+# CLI reference (`scripts/image_gen.py`)
+
+This file contains the “command catalog” for the bundled image generation CLI. Keep `SKILL.md` as overview-first; put verbose CLI details here.
+
+## What this CLI does
+- `generate`: generate new images from a prompt
+- `edit`: edit an existing image (optionally with a mask) — inpainting / background replacement / “change only X”
+- `generate-batch`: run many jobs from a JSONL file (one job per line)
+
+Real API calls require **network access** + `OPENAI_API_KEY`. `--dry-run` does not.
+
+## Quick start (works from any repo)
+Set a stable path to the skill CLI (default `CODEX_HOME` is `~/.codex`):
+
+```
+export CODEX_HOME="${CODEX_HOME:-$HOME/.codex}"
+export IMAGE_GEN="$CODEX_HOME/skills/imagegen/scripts/image_gen.py"
+```
+
+Dry-run (no API call; no network required; does not require the `openai` package):
+
+```
+python "$IMAGE_GEN" generate --prompt "Test" --dry-run
+```
+
+Generate (requires `OPENAI_API_KEY` + network):
+
+```
+uv run --with openai python "$IMAGE_GEN" generate --prompt "A cozy alpine cabin at dawn" --size 1024x1024
+```
+
+No `uv` installed? Use your active Python env:
+
+```
+python "$IMAGE_GEN" generate --prompt "A cozy alpine cabin at dawn" --size 1024x1024
+```
+
+## Guardrails (important)
+- Use `python "$IMAGE_GEN" ...` (or equivalent full path) for generations/edits/batch work.
+- Do **not** create one-off runners (e.g. `gen_images.py`) unless the user explicitly asks for a custom wrapper.
+- **Never modify** `scripts/image_gen.py`. If something is missing, ask the user before doing anything else.
+
+## Defaults (unless overridden by flags)
+- Model: `gpt-image-1.5`
+- Size: `1024x1024`
+- Quality: `auto`
+- Output format: `png`
+- Background: unspecified (API default). If you set `--background transparent`, also set `--output-format png` or `webp`.
+
+## Quality + input fidelity
+- `--quality` works for `generate`, `edit`, and `generate-batch`: `low|medium|high|auto`.
+- `--input-fidelity` is **edit-only**: `low|high` (use `high` for strict edits like identity or layout lock).
+
+Example:
+```
+python "$IMAGE_GEN" edit --image input.png --prompt "Change only the background" --quality high --input-fidelity high
+```
+
+## Masks (edits)
+- Use a **PNG** mask; an alpha channel is strongly recommended.
+- The mask should match the input image dimensions.
+- In the edit prompt, repeat invariants (e.g., “change only the background; keep the subject unchanged”) to reduce drift.
+
+## Optional deps
+Prefer `uv run --with ...` for an out-of-the-box run without changing the current project env; otherwise install into your active env:
+
+```
+uv pip install openai
+```
+
+## Common recipes
+
+Generate + also write a downscaled copy for fast web loading:
+
+```
+uv run --with openai --with pillow python "$IMAGE_GEN" generate \
+  --prompt "A cozy alpine cabin at dawn" \
+  --size 1024x1024 \
+  --downscale-max-dim 1024
+```
+
+Notes:
+- Downscaling writes an extra file next to the original (default suffix `-web`, e.g. `output-web.png`).
+- Downscaling requires Pillow (use `uv run --with pillow ...` or install it into your env).
+
+Generate with augmentation fields:
+
+```
+python "$IMAGE_GEN" generate \
+  --prompt "A minimal hero image of a ceramic coffee mug" \
+  --use-case "landing page hero" \
+  --style "clean product photography" \
+  --composition "centered product, generous negative space" \
+  --constraints "no logos, no text"
+```
+
+Generate multiple prompts concurrently (async batch):
+
+```
+mkdir -p tmp/imagegen
+cat > tmp/imagegen/prompts.jsonl << 'EOF'
+{"prompt":"Cavernous hangar interior with a compact shuttle parked center-left, open bay door","use_case":"game concept art environment","composition":"wide-angle, low-angle, cinematic framing","lighting":"volumetric light rays through drifting fog","constraints":"no logos or trademarks; no watermark","size":"1536x1024"}
+{"prompt":"Gray wolf in profile in a snowy forest, crisp fur texture","use_case":"wildlife photography print","composition":"100mm, eye-level, shallow depth of field","constraints":"no logos or trademarks; no watermark","size":"1024x1024"}
+EOF
+
+python "$IMAGE_GEN" generate-batch --input tmp/imagegen/prompts.jsonl --out-dir out --concurrency 5
+
+# Cleanup (recommended)
+rm -f tmp/imagegen/prompts.jsonl
+```
+
+Notes:
+- Use `--concurrency` to control parallelism (default `5`). Higher concurrency can hit rate limits; the CLI retries on transient errors.
+- Per-job overrides are supported in JSONL (e.g., `size`, `quality`, `background`, `output_format`, `n`, and prompt-augmentation fields).
+- `--n` generates multiple variants for a single prompt; `generate-batch` is for many different prompts.
+- Treat the JSONL file as temporary: write it under `tmp/` and delete it after the run (don’t commit it).
+
+Edit:
+
+```
+python "$IMAGE_GEN" edit --image input.png --mask mask.png --prompt "Replace the background with a warm sunset"
+```
+
+## CLI notes
+- Supported sizes: `1024x1024`, `1536x1024`, `1024x1536`, or `auto`.
+- Transparent backgrounds require `output_format` to be `png` or `webp`.
+- Default output is `output.png`; multiple images become `output-1.png`, `output-2.png`, etc.
+- Use `--no-augment` to skip prompt augmentation.
+
+## See also
+- API parameter quick reference: `references/image-api.md`
+- Prompt examples: `references/sample-prompts.md`
@@ -0,0 +1,28 @@
+# Codex network approvals / sandbox notes
+
+This guidance is intentionally isolated from `SKILL.md` because it can vary by environment and may become stale. Prefer the defaults in your environment when in doubt.
+
+## Why am I asked to approve every image generation call?
+Image generation uses the OpenAI Image API, so the CLI needs outbound network access. In many Codex setups, network access is disabled by default (especially under stricter sandbox modes), and/or the approval policy may require confirmation before networked commands run.
+
+## How do I reduce repeated approval prompts (network)?
+If you trust the repo and want fewer prompts, enable network access for the relevant sandbox mode and relax the approval policy.
+
+Example `~/.codex/config.toml` pattern:
+
+```
+approval_policy = "never"
+sandbox_mode = "workspace-write"
+
+[sandbox_workspace_write]
+network_access = true
+```
+
+Or for a single session:
+
+```
+codex --sandbox workspace-write --ask-for-approval never
+```
+
+## Safety note
+Use caution: enabling network and disabling approvals reduces friction but increases risk if you run untrusted code or work in an untrusted repository.
@@ -0,0 +1,36 @@
+# Image API quick reference
+
+## Endpoints
+- Generate: `POST /v1/images/generations` (`client.images.generate(...)`)
+- Edit: `POST /v1/images/edits` (`client.images.edit(...)`)
+
+## Models
+- Default: `gpt-image-1.5`
+- Alternatives: `gpt-image-1-mini` (for faster, lower-cost generation)
+
+## Core parameters (generate + edit)
+- `prompt`: text prompt
+- `model`: image model
+- `n`: number of images (1-10)
+- `size`: `1024x1024`, `1536x1024`, `1024x1536`, or `auto`
+- `quality`: `low`, `medium`, `high`, or `auto`
+- `background`: `transparent`, `opaque`, or `auto` (transparent requires `png`/`webp`)
+- `output_format`: `png` (default), `jpeg`, `webp`
+- `output_compression`: 0-100 (jpeg/webp only)
+- `moderation`: `auto` (default) or `low`
+
+## Edit-specific parameters
+- `image`: one or more input images (first image is primary)
+- `mask`: optional mask image (same size, alpha channel required)
+- `input_fidelity`: `low` (default) or `high` (support varies by model) - set it to `high` if the user needs a very specific edit and you can't achieve it with the default `low` fidelity.
+
+## Output
+- `data[]` list with `b64_json` per image
+
+## Limits & notes
+- Input images and masks must be under 50MB.
+- Use edits endpoint when the user requests changes to an existing image.
+- Masking is prompt-guided; exact shapes are not guaranteed.
+- Large sizes and high quality increase latency and cost.
+- For fast iteration or latency-sensitive runs, start with `quality=low`; raise to `high` for text-heavy or detail-critical outputs.
+- Use `input_fidelity=high` for strict edits (identity preservation, layout lock, or precise compositing).
@@ -0,0 +1,81 @@
+# Prompting best practices (gpt-image-1.5)
+
+## Contents
+- [Structure](#structure)
+- [Specificity](#specificity)
+- [Avoiding “tacky” outputs](#avoiding-tacky-outputs)
+- [Composition & layout](#composition--layout)
+- [Constraints & invariants](#constraints--invariants)
+- [Text in images](#text-in-images)
+- [Multi-image inputs](#multi-image-inputs)
+- [Iterate deliberately](#iterate-deliberately)
+- [Quality vs latency](#quality-vs-latency)
+- [Use-case tips](#use-case-tips)
+- [Where to find copy/paste recipes](#where-to-find-copypaste-recipes)
+
+## Structure
+- Use a consistent order: scene/background -> subject -> key details -> constraints -> output intent.
+- Include intended use (ad, UI mock, infographic) to set the mode and polish level.
+- For complex requests, use short labeled lines instead of a long paragraph.
+
+## Specificity
+- Name materials, textures, and visual medium (photo, watercolor, 3D render).
+- For photorealism, include camera/composition language (lens, framing, lighting).
+- Add targeted quality cues only when needed (film grain, textured brushstrokes, macro detail); avoid generic "8K" style prompts.
+
+## Avoiding “tacky” outputs
+- Don’t use vibe-only buzzwords (“epic”, “cinematic”, “trending”, “8k”, “award-winning”, “unreal engine”, “artstation”) unless the user explicitly wants that look.
+- Specify restraint: “minimal”, “editorial”, “premium”, “subtle”, “natural color grading”, “soft contrast”, “no harsh bloom”, “no oversharpening”.
+- For 3D/illustration, name the finish you want: “matte”, “paper grain”, “ink texture”, “flat color with soft shadow”; avoid “glossy plastic” unless requested.
+- Add a short negative line when needed (especially for marketing art): “Avoid: stock-photo vibe; cheesy lens flare; oversaturated neon; excessive bokeh; fake-looking smiles; clutter”.
+
+## Composition & layout
+- Specify framing and viewpoint (close-up, wide, top-down) and placement ("logo top-right").
+- Call out negative space if you need room for UI or overlays.
+
+## Constraints & invariants
+- State what must not change ("keep background unchanged").
+- For edits, say "change only X; keep Y unchanged" and repeat invariants on every iteration to reduce drift.
+
+## Text in images
+- Put literal text in quotes or ALL CAPS and specify typography (font style, size, color, placement).
+- Spell uncommon words letter-by-letter if accuracy matters.
+- For in-image copy, require verbatim rendering and no extra characters.
+
+## Multi-image inputs
+- Reference inputs by index and role ("Image 1: product, Image 2: style").
+- Describe how to combine them ("apply Image 2's style to Image 1").
+- For compositing, specify what moves where and what must remain unchanged.
+
+## Iterate deliberately
+- Start with a clean base prompt, then make small single-change edits.
+- Re-specify critical constraints when you iterate.
+
+## Quality vs latency
+- For latency-sensitive runs, start at `quality=low` and only raise it if needed.
+- Use `quality=high` for text-heavy or detail-critical images.
+- For strict edits (identity preservation, layout lock), consider `input_fidelity=high`.
+
+## Use-case tips
+Generate:
+- photorealistic-natural: Prompt as if a real photo is captured in the moment; use photography language (lens, lighting, framing); call for real texture (pores, wrinkles, fabric wear, imperfections); avoid studio polish or staging; use `quality=high` when detail matters.
+- product-mockup: Describe the product/packaging and materials; ensure clean silhouette and label clarity; if in-image text is needed, require verbatim rendering and specify typography.
+- ui-mockup: Describe a real product; focus on layout, hierarchy, and common UI elements; avoid concept-art language so it looks shippable.
+- infographic-diagram: Define the audience and layout flow; label parts explicitly; require verbatim text; use `quality=high`.
+- logo-brand: Keep it simple and scalable; ask for a strong silhouette and balanced negative space; avoid gradients and fine detail.
+- illustration-story: Define panels or scene beats; keep each action concrete; for continuity, restate character traits and outfit each time.
+- stylized-concept: Specify style cues, material finish, and rendering approach (3D, painterly, clay); add a short "Avoid" line to prevent tacky effects.
+- historical-scene: State the location/date and required period accuracy; constrain clothing, props, and environment to match the era.
+
+Edit:
+- text-localization: Change only the text; preserve layout, typography, spacing, and hierarchy; no extra words or reflow unless needed.
+- identity-preserve: Lock identity (face, body, pose, hair, expression); change only the specified elements; match lighting and shadows; use `input_fidelity=high` if likeness drifts.
+- precise-object-edit: Specify exactly what to remove/replace; preserve surrounding texture and lighting; keep everything else unchanged.
+- lighting-weather: Change only environmental conditions (light, shadows, atmosphere, precipitation); keep geometry, framing, and subject identity.
+- background-extraction: Request transparent background; crisp silhouette; no halos; preserve label text exactly; optionally add a subtle contact shadow.
+- style-transfer: Specify style cues to preserve (palette, texture, brushwork) and what must change; add "no extra elements" to prevent drift.
+- compositing: Reference inputs by index; specify what moves where; match lighting, perspective, and scale; keep background and framing unchanged.
+- sketch-to-render: Preserve layout, proportions, and perspective; add plausible materials, lighting, and environment; "do not add new elements or text."
+
+## Where to find copy/paste recipes
+For copy/paste prompt specs (examples only), see `references/sample-prompts.md`. This file focuses on principles, structure, and iteration patterns.
@@ -0,0 +1,384 @@
+# Sample prompts (copy/paste)
+
+Use these as starting points (recipes only). Keep user-provided requirements; do not invent new creative elements.
+
+For prompting principles (structure, invariants, iteration), see `references/prompting.md`.
+
+## Generate
+
+### photorealistic-natural
+```
+Use case: photorealistic-natural
+Primary request: candid photo of an elderly sailor on a small fishing boat adjusting a net
+Scene/background: coastal water with soft haze
+Subject: weathered skin with wrinkles and sun texture; a calm dog on deck nearby
+Style/medium: photorealistic candid photo
+Composition/framing: medium close-up, eye-level, 50mm lens
+Lighting/mood: soft coastal daylight, shallow depth of field, subtle film grain
+Materials/textures: real skin texture, worn fabric, salt-worn wood
+Constraints: natural color balance; no heavy retouching; no glamorization; no watermark
+Avoid: studio polish; staged look
+Quality: high
+```
+
+### product-mockup
+```
+Use case: product-mockup
+Primary request: premium product photo of a matte black shampoo bottle with a minimal label
+Scene/background: clean studio gradient from light gray to white
+Subject: single bottle centered with subtle reflection
+Style/medium: premium product photography
+Composition/framing: centered, slight three-quarter angle, generous padding
+Lighting/mood: softbox lighting, clean highlights, controlled shadows
+Materials/textures: matte plastic, crisp label printing
+Constraints: no logos or trademarks; no watermark
+Quality: high
+```
+
+### ui-mockup
+```
+Use case: ui-mockup
+Primary request: mobile app UI for a local farmers market with vendors and specials
+Scene/background: clean white background with subtle natural accents
+Subject: header, vendor list with small photos, "Today's specials" section, location and hours
+Style/medium: realistic product UI, not concept art
+Composition/framing: iPhone frame, balanced spacing and hierarchy
+Constraints: practical layout, clear typography, no logos or trademarks, no watermark
+```
+
+### infographic-diagram
+```
+Use case: infographic-diagram
+Primary request: detailed infographic of an automatic coffee machine flow
+Scene/background: clean, light neutral background
+Subject: bean hopper -> grinder -> brew group -> boiler -> water tank -> drip tray
+Style/medium: clean vector-like infographic with clear callouts and arrows
+Composition/framing: vertical poster layout, top-to-bottom flow
+Text (verbatim): "Bean Hopper", "Grinder", "Brew Group", "Boiler", "Water Tank", "Drip Tray"
+Constraints: clear labels, strong contrast, no logos or trademarks, no watermark
+Quality: high
+```
+
+### logo-brand
+```
+Use case: logo-brand
+Primary request: original logo for "Field & Flour", a local bakery
+Style/medium: vector logo mark; flat colors; minimal
+Composition/framing: single centered logo on plain background with padding
+Constraints: strong silhouette, balanced negative space; original design only; no gradients unless essential; no trademarks; no watermark
+```
+
+### illustration-story
+```
+Use case: illustration-story
+Primary request: 4-panel comic about a pet left alone at home
+Scene/background: cozy living room across panels
+Subject: pet reacting to the owner leaving, then relaxing, then returning to a composed pose
+Style/medium: comic illustration with clear panels
+Composition/framing: 4 equal-sized vertical panels, readable actions per panel
+Constraints: no text; no logos or trademarks; no watermark
+```
+
+### stylized-concept
+```
+Use case: stylized-concept
+Primary request: cavernous hangar interior with tall support beams and drifting fog
+Scene/background: industrial hangar interior, deep scale, light haze
+Subject: compact shuttle, parked center-left, bay door open
+Style/medium: cinematic concept art, industrial realism
+Composition/framing: wide-angle, low-angle, cinematic framing
+Lighting/mood: volumetric light rays cutting through fog
+Constraints: no logos or trademarks; no watermark
+```
+
+### historical-scene
+```
+Use case: historical-scene
+Primary request: outdoor crowd scene in Bethel, New York on August 16, 1969
+Scene/background: open field, temporary stages, period-accurate tents and signage
+Subject: crowd in period-accurate clothing, authentic staging and environment
+Style/medium: photorealistic photo
+Composition/framing: wide shot, eye-level
+Constraints: period-accurate details; no modern objects; no logos or trademarks; no watermark
+```
+
+## Asset type templates (taxonomy-aligned)
+
+### Website assets template
+```
+Use case: <photorealistic-natural|stylized-concept|product-mockup|infographic-diagram|ui-mockup>
+Asset type: <hero image / section illustration / blog header>
+Primary request: <short description>
+Scene/background: <environment or abstract background>
+Subject: <main subject>
+Style/medium: <photo/illustration/3D>
+Composition/framing: <wide/centered; specify negative space side>
+Lighting/mood: <soft/bright/neutral>
+Color palette: <brand colors or neutral>
+Constraints: <no text; no logos; no watermark; leave space for UI>
+```
+
+### Website assets example: minimal hero background
+```
+Use case: stylized-concept
+Asset type: landing page hero background
+Primary request: minimal abstract background with a soft gradient and subtle texture (calm, modern)
+Style/medium: matte illustration / soft-rendered abstract background (not glossy 3D)
+Composition/framing: wide composition; large negative space on the right for headline
+Lighting/mood: gentle studio glow
+Color palette: cool neutrals with a restrained blue accent
+Constraints: no text; no logos; no watermark
+```
+
+### Website assets example: feature section illustration
+```
+Use case: stylized-concept
+Asset type: feature section illustration
+Primary request: simple abstract shapes suggesting connection and flow (tasteful, minimal)
+Scene/background: subtle light-gray backdrop with faint texture
+Style/medium: flat illustration; soft shadows; restrained contrast
+Composition/framing: centered cluster; open margins for UI
+Color palette: muted teal and slate, low contrast accents
+Constraints: no text; no logos; no watermark
+```
+
+### Website assets example: blog header image
+```
+Use case: photorealistic-natural
+Asset type: blog header image
+Primary request: overhead desk scene with notebook, pen, and coffee cup
+Scene/background: warm wooden tabletop
+Style/medium: photorealistic photo
+Composition/framing: wide crop; subject placed left; right side left empty
+Lighting/mood: soft morning light
+Constraints: no text; no logos; no watermark
+```
+
+### Game assets template
+```
+Use case: stylized-concept
+Asset type: <game environment concept art / game character concept / game UI icon / tileable game texture>
+Primary request: <biome/scene/character/icon/material>
+Scene/background: <location + set dressing> (if applicable)
+Subject: <main focal element(s)>
+Style/medium: <realistic/stylized>; <concept art / character render / UI icon / texture>
+Composition/framing: <wide/establishing/top-down>; <camera angle>; <focal point placement>
+Lighting/mood: <time of day>; <mood>; <volumetric/fog/etc>
+Constraints: no logos or trademarks; no watermark
+```
+
+### Game assets example: environment concept art
+```
+Use case: stylized-concept
+Asset type: game environment concept art
+Primary request: cavernous hangar interior with tall support beams and drifting fog
+Scene/background: industrial hangar interior, deep scale, light haze
+Subject: compact shuttle, parked center-left, bay door open
+Foreground: painted floor markings; cables; tool carts along edges
+Style/medium: cinematic concept art, industrial realism
+Composition/framing: wide-angle, low-angle, cinematic framing
+Lighting/mood: volumetric light rays cutting through fog
+Constraints: no logos or trademarks; no watermark
+```
+
+### Game assets example: character concept
+```
+Use case: stylized-concept
+Asset type: game character concept
+Primary request: desert scout character with layered travel gear
+Silhouette: long coat with hood, wide boots, satchel
+Outfit/gear: dusty canvas, leather straps, brass buckles
+Face/hair: windworn face, short cropped hair
+Style/medium: character render; stylized realism
+Pose: neutral hero pose
+Background: simple neutral backdrop
+Constraints: no logos or trademarks; no watermark
+```
+
+### Game assets example: UI icon
+```
+Use case: stylized-concept
+Asset type: game UI icon
+Primary request: round shield icon with a subtle rune pattern
+Style/medium: painted game UI icon
+Composition/framing: centered icon; generous padding; clear silhouette
+Background: transparent
+Lighting/mood: subtle highlights; crisp edges
+Constraints: no text; no logos or trademarks; no watermark
+```
+
+### Game assets example: tileable texture
+```
+Use case: stylized-concept
+Asset type: tileable game texture
+Primary request: worn sandstone blocks
+Style/medium: seamless tileable texture; PBR-ish look
+Scale: medium tiling
+Lighting: neutral / flat lighting
+Constraints: seamless edges; no obvious focal elements; no text; no logos or trademarks; no watermark
+```
+
+### Wireframe template
+```
+Use case: ui-mockup
+Asset type: website wireframe
+Primary request: <page or flow to sketch>
+Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
+Layout: <sections in order; grid/columns>
+Annotations: <labels for key blocks>
+Resolution/orientation: <landscape or portrait to match expected device>
+Constraints: no color; no logos; no real photos; no watermark
+```
+
+### Wireframe example: homepage (desktop)
+```
+Use case: ui-mockup
+Asset type: website wireframe
+Primary request: SaaS homepage layout with clear hierarchy
+Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
+Layout: top nav; hero with headline and CTA; three feature cards; testimonial strip; pricing preview; footer
+Annotations: label each block ("Nav", "Hero", "CTA", "Feature", "Testimonial", "Pricing", "Footer")
+Resolution/orientation: landscape (wide) for desktop
+Constraints: no color; no logos; no real photos; no watermark
+```
+
+### Wireframe example: pricing page
+```
+Use case: ui-mockup
+Asset type: website wireframe
+Primary request: pricing page layout with comparison table
+Fidelity: low-fi grayscale wireframe; sketchy lines; simple boxes
+Layout: header; plan toggle; 3 pricing cards; comparison table; FAQ accordion; footer
+Annotations: label key areas ("Toggle", "Plan Card", "Table", "FAQ")
+Resolution/orientation: landscape for desktop or portrait for tablet
+Constraints: no color; no logos; no real photos; no watermark
+```
+
+### Wireframe example: mobile onboarding flow
+```
+Use case: ui-mockup
+Asset type: website wireframe
+Primary request: three-screen mobile onboarding flow
+Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
+Layout: screen 1 (logo placeholder, headline, illustration placeholder, CTA); screen 2 (feature bullets); screen 3 (form fields + CTA)
+Annotations: label each block and screen number
+Resolution/orientation: portrait (tall) for mobile
+Constraints: no color; no logos; no real photos; no watermark
+```
+
+### Logo template
+```
+Use case: logo-brand
+Asset type: logo concept
+Primary request: <brand idea or symbol concept>
+Style/medium: vector logo mark; flat colors; minimal
+Composition/framing: centered mark; clear silhouette; generous margin
+Color palette: <1-2 colors; high contrast>
+Text (verbatim): "<exact name>" (only if needed)
+Constraints: no gradients; no mockups; no 3D; no watermark
+```
+
+### Logo example: abstract symbol mark
+```
+Use case: logo-brand
+Asset type: logo concept
+Primary request: geometric leaf symbol suggesting sustainability and growth
+Style/medium: vector logo mark; flat colors; minimal
+Composition/framing: centered mark; clear silhouette
+Color palette: deep green and off-white
+Constraints: no text; no gradients; no mockups; no 3D; no watermark
+```
+
+### Logo example: monogram mark
+```
+Use case: logo-brand
+Asset type: logo concept
+Primary request: interlocking monogram of the letters "AV"
+Style/medium: vector logo mark; flat colors; minimal
+Composition/framing: centered mark; balanced spacing
+Color palette: black on white
+Constraints: no gradients; no mockups; no 3D; no watermark
+```
+
+### Logo example: wordmark
+```
+Use case: logo-brand
+Asset type: logo concept
+Primary request: clean wordmark for a modern studio
+Style/medium: vector wordmark; flat colors; minimal
+Text (verbatim): "Studio North"
+Composition/framing: centered text; even letter spacing
+Color palette: charcoal on white
+Constraints: no gradients; no mockups; no 3D; no watermark
+```
+
+## Edit
+
+### text-localization
+```
+Use case: text-localization
+Input images: Image 1: original infographic
+Primary request: translate all in-image text to Spanish
+Constraints: change only the text; preserve layout, typography, spacing, and hierarchy; no extra words; do not alter logos or imagery
+```
+
+### identity-preserve
+```
+Use case: identity-preserve
+Input images: Image 1: person photo; Image 2..N: clothing items
+Primary request: replace only the clothing with the provided garments
+Constraints: preserve face, body shape, pose, hair, expression, and identity; match lighting and shadows; keep background unchanged; no accessories or text
+Input fidelity (edits): high
+```
+
+### precise-object-edit
+```
+Use case: precise-object-edit
+Input images: Image 1: room photo
+Primary request: replace ONLY the white chairs with wooden chairs
+Constraints: preserve camera angle, room lighting, floor shadows, and surrounding objects; keep all other aspects unchanged
+```
+
+### lighting-weather
+```
+Use case: lighting-weather
+Input images: Image 1: original photo
+Primary request: make it look like a winter evening with gentle snowfall
+Constraints: preserve subject identity, geometry, camera angle, and composition; change only lighting, atmosphere, and weather
+Quality: high
+```
+
+### background-extraction
+```
+Use case: background-extraction
+Input images: Image 1: product photo
+Primary request: extract the product on a transparent background
+Output: transparent background (RGBA PNG)
+Constraints: crisp silhouette, no halos/fringing; preserve label text exactly; no restyling
+```
+
+### style-transfer
+```
+Use case: style-transfer
+Input images: Image 1: style reference
+Primary request: apply Image 1's visual style to a man riding a motorcycle on a white background
+Constraints: preserve palette, texture, and brushwork; no extra elements; plain white background
+```
+
+### compositing
+```
+Use case: compositing
+Input images: Image 1: base scene; Image 2: subject to insert
+Primary request: place the subject from Image 2 next to the person in Image 1
+Constraints: match lighting, perspective, and scale; keep background and framing unchanged; no extra elements
+Input fidelity (edits): high
+```
+
+### sketch-to-render
+```
+Use case: sketch-to-render
+Input images: Image 1: drawing
+Primary request: turn the drawing into a photorealistic image
+Constraints: preserve layout, proportions, and perspective; choose realistic materials and lighting; do not add new elements or text
+Quality: high
+```