mirror of
https://github.com/ksyasuda/dotfiles.git
synced 2026-03-21 18:11:27 -07:00
update skills
This commit is contained in:
132
.agents/skills/imagegen/references/cli.md
Normal file
132
.agents/skills/imagegen/references/cli.md
Normal file
@@ -0,0 +1,132 @@
|
||||
# CLI reference (`scripts/image_gen.py`)
|
||||
|
||||
This file contains the “command catalog” for the bundled image generation CLI. Keep `SKILL.md` as overview-first; put verbose CLI details here.
|
||||
|
||||
## What this CLI does
|
||||
- `generate`: generate new images from a prompt
|
||||
- `edit`: edit an existing image (optionally with a mask) — inpainting / background replacement / “change only X”
|
||||
- `generate-batch`: run many jobs from a JSONL file (one job per line)
|
||||
|
||||
Real API calls require **network access** + `OPENAI_API_KEY`. `--dry-run` does not.
|
||||
|
||||
## Quick start (works from any repo)
|
||||
Set a stable path to the skill CLI (default `CODEX_HOME` is `~/.codex`):
|
||||
|
||||
```
|
||||
export CODEX_HOME="${CODEX_HOME:-$HOME/.codex}"
|
||||
export IMAGE_GEN="$CODEX_HOME/skills/imagegen/scripts/image_gen.py"
|
||||
```
|
||||
|
||||
Dry-run (no API call; no network required; does not require the `openai` package):
|
||||
|
||||
```
|
||||
python "$IMAGE_GEN" generate --prompt "Test" --dry-run
|
||||
```
|
||||
|
||||
Generate (requires `OPENAI_API_KEY` + network):
|
||||
|
||||
```
|
||||
uv run --with openai python "$IMAGE_GEN" generate --prompt "A cozy alpine cabin at dawn" --size 1024x1024
|
||||
```
|
||||
|
||||
No `uv` installed? Use your active Python env:
|
||||
|
||||
```
|
||||
python "$IMAGE_GEN" generate --prompt "A cozy alpine cabin at dawn" --size 1024x1024
|
||||
```
|
||||
|
||||
## Guardrails (important)
|
||||
- Use `python "$IMAGE_GEN" ...` (or equivalent full path) for generations/edits/batch work.
|
||||
- Do **not** create one-off runners (e.g. `gen_images.py`) unless the user explicitly asks for a custom wrapper.
|
||||
- **Never modify** `scripts/image_gen.py`. If something is missing, ask the user before doing anything else.
|
||||
|
||||
## Defaults (unless overridden by flags)
|
||||
- Model: `gpt-image-1.5`
|
||||
- Size: `1024x1024`
|
||||
- Quality: `auto`
|
||||
- Output format: `png`
|
||||
- Background: unspecified (API default). If you set `--background transparent`, also set `--output-format png` or `webp`.
|
||||
|
||||
## Quality + input fidelity
|
||||
- `--quality` works for `generate`, `edit`, and `generate-batch`: `low|medium|high|auto`.
|
||||
- `--input-fidelity` is **edit-only**: `low|high` (use `high` for strict edits like identity or layout lock).
|
||||
|
||||
Example:
|
||||
```
|
||||
python "$IMAGE_GEN" edit --image input.png --prompt "Change only the background" --quality high --input-fidelity high
|
||||
```
|
||||
|
||||
## Masks (edits)
|
||||
- Use a **PNG** mask; an alpha channel is strongly recommended.
|
||||
- The mask should match the input image dimensions.
|
||||
- In the edit prompt, repeat invariants (e.g., “change only the background; keep the subject unchanged”) to reduce drift.
|
||||
|
||||
## Optional deps
|
||||
Prefer `uv run --with ...` for an out-of-the-box run without changing the current project env; otherwise install into your active env:
|
||||
|
||||
```
|
||||
uv pip install openai
|
||||
```
|
||||
|
||||
## Common recipes
|
||||
|
||||
Generate + also write a downscaled copy for fast web loading:
|
||||
|
||||
```
|
||||
uv run --with openai --with pillow python "$IMAGE_GEN" generate \
|
||||
--prompt "A cozy alpine cabin at dawn" \
|
||||
--size 1024x1024 \
|
||||
--downscale-max-dim 1024
|
||||
```
|
||||
|
||||
Notes:
|
||||
- Downscaling writes an extra file next to the original (default suffix `-web`, e.g. `output-web.png`).
|
||||
- Downscaling requires Pillow (use `uv run --with pillow ...` or install it into your env).
|
||||
|
||||
Generate with augmentation fields:
|
||||
|
||||
```
|
||||
python "$IMAGE_GEN" generate \
|
||||
--prompt "A minimal hero image of a ceramic coffee mug" \
|
||||
--use-case "landing page hero" \
|
||||
--style "clean product photography" \
|
||||
--composition "centered product, generous negative space" \
|
||||
--constraints "no logos, no text"
|
||||
```
|
||||
|
||||
Generate multiple prompts concurrently (async batch):
|
||||
|
||||
```
|
||||
mkdir -p tmp/imagegen
|
||||
cat > tmp/imagegen/prompts.jsonl << 'EOF'
|
||||
{"prompt":"Cavernous hangar interior with a compact shuttle parked center-left, open bay door","use_case":"game concept art environment","composition":"wide-angle, low-angle, cinematic framing","lighting":"volumetric light rays through drifting fog","constraints":"no logos or trademarks; no watermark","size":"1536x1024"}
|
||||
{"prompt":"Gray wolf in profile in a snowy forest, crisp fur texture","use_case":"wildlife photography print","composition":"100mm, eye-level, shallow depth of field","constraints":"no logos or trademarks; no watermark","size":"1024x1024"}
|
||||
EOF
|
||||
|
||||
python "$IMAGE_GEN" generate-batch --input tmp/imagegen/prompts.jsonl --out-dir out --concurrency 5
|
||||
|
||||
# Cleanup (recommended)
|
||||
rm -f tmp/imagegen/prompts.jsonl
|
||||
```
|
||||
|
||||
Notes:
|
||||
- Use `--concurrency` to control parallelism (default `5`). Higher concurrency can hit rate limits; the CLI retries on transient errors.
|
||||
- Per-job overrides are supported in JSONL (e.g., `size`, `quality`, `background`, `output_format`, `n`, and prompt-augmentation fields).
|
||||
- `--n` generates multiple variants for a single prompt; `generate-batch` is for many different prompts.
|
||||
- Treat the JSONL file as temporary: write it under `tmp/` and delete it after the run (don’t commit it).
|
||||
|
||||
Edit:
|
||||
|
||||
```
|
||||
python "$IMAGE_GEN" edit --image input.png --mask mask.png --prompt "Replace the background with a warm sunset"
|
||||
```
|
||||
|
||||
## CLI notes
|
||||
- Supported sizes: `1024x1024`, `1536x1024`, `1024x1536`, or `auto`.
|
||||
- Transparent backgrounds require `output_format` to be `png` or `webp`.
|
||||
- Default output is `output.png`; multiple images become `output-1.png`, `output-2.png`, etc.
|
||||
- Use `--no-augment` to skip prompt augmentation.
|
||||
|
||||
## See also
|
||||
- API parameter quick reference: `references/image-api.md`
|
||||
- Prompt examples: `references/sample-prompts.md`
|
||||
28
.agents/skills/imagegen/references/codex-network.md
Normal file
28
.agents/skills/imagegen/references/codex-network.md
Normal file
@@ -0,0 +1,28 @@
|
||||
# Codex network approvals / sandbox notes
|
||||
|
||||
This guidance is intentionally isolated from `SKILL.md` because it can vary by environment and may become stale. Prefer the defaults in your environment when in doubt.
|
||||
|
||||
## Why am I asked to approve every image generation call?
|
||||
Image generation uses the OpenAI Image API, so the CLI needs outbound network access. In many Codex setups, network access is disabled by default (especially under stricter sandbox modes), and/or the approval policy may require confirmation before networked commands run.
|
||||
|
||||
## How do I reduce repeated approval prompts (network)?
|
||||
If you trust the repo and want fewer prompts, enable network access for the relevant sandbox mode and relax the approval policy.
|
||||
|
||||
Example `~/.codex/config.toml` pattern:
|
||||
|
||||
```
|
||||
approval_policy = "never"
|
||||
sandbox_mode = "workspace-write"
|
||||
|
||||
[sandbox_workspace_write]
|
||||
network_access = true
|
||||
```
|
||||
|
||||
Or for a single session:
|
||||
|
||||
```
|
||||
codex --sandbox workspace-write --ask-for-approval never
|
||||
```
|
||||
|
||||
## Safety note
|
||||
Use caution: enabling network and disabling approvals reduces friction but increases risk if you run untrusted code or work in an untrusted repository.
|
||||
36
.agents/skills/imagegen/references/image-api.md
Normal file
36
.agents/skills/imagegen/references/image-api.md
Normal file
@@ -0,0 +1,36 @@
|
||||
# Image API quick reference
|
||||
|
||||
## Endpoints
|
||||
- Generate: `POST /v1/images/generations` (`client.images.generate(...)`)
|
||||
- Edit: `POST /v1/images/edits` (`client.images.edit(...)`)
|
||||
|
||||
## Models
|
||||
- Default: `gpt-image-1.5`
|
||||
- Alternatives: `gpt-image-1-mini` (for faster, lower-cost generation)
|
||||
|
||||
## Core parameters (generate + edit)
|
||||
- `prompt`: text prompt
|
||||
- `model`: image model
|
||||
- `n`: number of images (1-10)
|
||||
- `size`: `1024x1024`, `1536x1024`, `1024x1536`, or `auto`
|
||||
- `quality`: `low`, `medium`, `high`, or `auto`
|
||||
- `background`: `transparent`, `opaque`, or `auto` (transparent requires `png`/`webp`)
|
||||
- `output_format`: `png` (default), `jpeg`, `webp`
|
||||
- `output_compression`: 0-100 (jpeg/webp only)
|
||||
- `moderation`: `auto` (default) or `low`
|
||||
|
||||
## Edit-specific parameters
|
||||
- `image`: one or more input images (first image is primary)
|
||||
- `mask`: optional mask image (same size, alpha channel required)
|
||||
- `input_fidelity`: `low` (default) or `high` (support varies by model) - set it to `high` if the user needs a very specific edit and you can't achieve it with the default `low` fidelity.
|
||||
|
||||
## Output
|
||||
- `data[]` list with `b64_json` per image
|
||||
|
||||
## Limits & notes
|
||||
- Input images and masks must be under 50MB.
|
||||
- Use edits endpoint when the user requests changes to an existing image.
|
||||
- Masking is prompt-guided; exact shapes are not guaranteed.
|
||||
- Large sizes and high quality increase latency and cost.
|
||||
- For fast iteration or latency-sensitive runs, start with `quality=low`; raise to `high` for text-heavy or detail-critical outputs.
|
||||
- Use `input_fidelity=high` for strict edits (identity preservation, layout lock, or precise compositing).
|
||||
81
.agents/skills/imagegen/references/prompting.md
Normal file
81
.agents/skills/imagegen/references/prompting.md
Normal file
@@ -0,0 +1,81 @@
|
||||
# Prompting best practices (gpt-image-1.5)
|
||||
|
||||
## Contents
|
||||
- [Structure](#structure)
|
||||
- [Specificity](#specificity)
|
||||
- [Avoiding “tacky” outputs](#avoiding-tacky-outputs)
|
||||
- [Composition & layout](#composition--layout)
|
||||
- [Constraints & invariants](#constraints--invariants)
|
||||
- [Text in images](#text-in-images)
|
||||
- [Multi-image inputs](#multi-image-inputs)
|
||||
- [Iterate deliberately](#iterate-deliberately)
|
||||
- [Quality vs latency](#quality-vs-latency)
|
||||
- [Use-case tips](#use-case-tips)
|
||||
- [Where to find copy/paste recipes](#where-to-find-copypaste-recipes)
|
||||
|
||||
## Structure
|
||||
- Use a consistent order: scene/background -> subject -> key details -> constraints -> output intent.
|
||||
- Include intended use (ad, UI mock, infographic) to set the mode and polish level.
|
||||
- For complex requests, use short labeled lines instead of a long paragraph.
|
||||
|
||||
## Specificity
|
||||
- Name materials, textures, and visual medium (photo, watercolor, 3D render).
|
||||
- For photorealism, include camera/composition language (lens, framing, lighting).
|
||||
- Add targeted quality cues only when needed (film grain, textured brushstrokes, macro detail); avoid generic "8K" style prompts.
|
||||
|
||||
## Avoiding “tacky” outputs
|
||||
- Don’t use vibe-only buzzwords (“epic”, “cinematic”, “trending”, “8k”, “award-winning”, “unreal engine”, “artstation”) unless the user explicitly wants that look.
|
||||
- Specify restraint: “minimal”, “editorial”, “premium”, “subtle”, “natural color grading”, “soft contrast”, “no harsh bloom”, “no oversharpening”.
|
||||
- For 3D/illustration, name the finish you want: “matte”, “paper grain”, “ink texture”, “flat color with soft shadow”; avoid “glossy plastic” unless requested.
|
||||
- Add a short negative line when needed (especially for marketing art): “Avoid: stock-photo vibe; cheesy lens flare; oversaturated neon; excessive bokeh; fake-looking smiles; clutter”.
|
||||
|
||||
## Composition & layout
|
||||
- Specify framing and viewpoint (close-up, wide, top-down) and placement ("logo top-right").
|
||||
- Call out negative space if you need room for UI or overlays.
|
||||
|
||||
## Constraints & invariants
|
||||
- State what must not change ("keep background unchanged").
|
||||
- For edits, say "change only X; keep Y unchanged" and repeat invariants on every iteration to reduce drift.
|
||||
|
||||
## Text in images
|
||||
- Put literal text in quotes or ALL CAPS and specify typography (font style, size, color, placement).
|
||||
- Spell uncommon words letter-by-letter if accuracy matters.
|
||||
- For in-image copy, require verbatim rendering and no extra characters.
|
||||
|
||||
## Multi-image inputs
|
||||
- Reference inputs by index and role ("Image 1: product, Image 2: style").
|
||||
- Describe how to combine them ("apply Image 2's style to Image 1").
|
||||
- For compositing, specify what moves where and what must remain unchanged.
|
||||
|
||||
## Iterate deliberately
|
||||
- Start with a clean base prompt, then make small single-change edits.
|
||||
- Re-specify critical constraints when you iterate.
|
||||
|
||||
## Quality vs latency
|
||||
- For latency-sensitive runs, start at `quality=low` and only raise it if needed.
|
||||
- Use `quality=high` for text-heavy or detail-critical images.
|
||||
- For strict edits (identity preservation, layout lock), consider `input_fidelity=high`.
|
||||
|
||||
## Use-case tips
|
||||
Generate:
|
||||
- photorealistic-natural: Prompt as if a real photo is captured in the moment; use photography language (lens, lighting, framing); call for real texture (pores, wrinkles, fabric wear, imperfections); avoid studio polish or staging; use `quality=high` when detail matters.
|
||||
- product-mockup: Describe the product/packaging and materials; ensure clean silhouette and label clarity; if in-image text is needed, require verbatim rendering and specify typography.
|
||||
- ui-mockup: Describe a real product; focus on layout, hierarchy, and common UI elements; avoid concept-art language so it looks shippable.
|
||||
- infographic-diagram: Define the audience and layout flow; label parts explicitly; require verbatim text; use `quality=high`.
|
||||
- logo-brand: Keep it simple and scalable; ask for a strong silhouette and balanced negative space; avoid gradients and fine detail.
|
||||
- illustration-story: Define panels or scene beats; keep each action concrete; for continuity, restate character traits and outfit each time.
|
||||
- stylized-concept: Specify style cues, material finish, and rendering approach (3D, painterly, clay); add a short "Avoid" line to prevent tacky effects.
|
||||
- historical-scene: State the location/date and required period accuracy; constrain clothing, props, and environment to match the era.
|
||||
|
||||
Edit:
|
||||
- text-localization: Change only the text; preserve layout, typography, spacing, and hierarchy; no extra words or reflow unless needed.
|
||||
- identity-preserve: Lock identity (face, body, pose, hair, expression); change only the specified elements; match lighting and shadows; use `input_fidelity=high` if likeness drifts.
|
||||
- precise-object-edit: Specify exactly what to remove/replace; preserve surrounding texture and lighting; keep everything else unchanged.
|
||||
- lighting-weather: Change only environmental conditions (light, shadows, atmosphere, precipitation); keep geometry, framing, and subject identity.
|
||||
- background-extraction: Request transparent background; crisp silhouette; no halos; preserve label text exactly; optionally add a subtle contact shadow.
|
||||
- style-transfer: Specify style cues to preserve (palette, texture, brushwork) and what must change; add "no extra elements" to prevent drift.
|
||||
- compositing: Reference inputs by index; specify what moves where; match lighting, perspective, and scale; keep background and framing unchanged.
|
||||
- sketch-to-render: Preserve layout, proportions, and perspective; add plausible materials, lighting, and environment; "do not add new elements or text."
|
||||
|
||||
## Where to find copy/paste recipes
|
||||
For copy/paste prompt specs (examples only), see `references/sample-prompts.md`. This file focuses on principles, structure, and iteration patterns.
|
||||
384
.agents/skills/imagegen/references/sample-prompts.md
Normal file
384
.agents/skills/imagegen/references/sample-prompts.md
Normal file
@@ -0,0 +1,384 @@
|
||||
# Sample prompts (copy/paste)
|
||||
|
||||
Use these as starting points (recipes only). Keep user-provided requirements; do not invent new creative elements.
|
||||
|
||||
For prompting principles (structure, invariants, iteration), see `references/prompting.md`.
|
||||
|
||||
## Generate
|
||||
|
||||
### photorealistic-natural
|
||||
```
|
||||
Use case: photorealistic-natural
|
||||
Primary request: candid photo of an elderly sailor on a small fishing boat adjusting a net
|
||||
Scene/background: coastal water with soft haze
|
||||
Subject: weathered skin with wrinkles and sun texture; a calm dog on deck nearby
|
||||
Style/medium: photorealistic candid photo
|
||||
Composition/framing: medium close-up, eye-level, 50mm lens
|
||||
Lighting/mood: soft coastal daylight, shallow depth of field, subtle film grain
|
||||
Materials/textures: real skin texture, worn fabric, salt-worn wood
|
||||
Constraints: natural color balance; no heavy retouching; no glamorization; no watermark
|
||||
Avoid: studio polish; staged look
|
||||
Quality: high
|
||||
```
|
||||
|
||||
### product-mockup
|
||||
```
|
||||
Use case: product-mockup
|
||||
Primary request: premium product photo of a matte black shampoo bottle with a minimal label
|
||||
Scene/background: clean studio gradient from light gray to white
|
||||
Subject: single bottle centered with subtle reflection
|
||||
Style/medium: premium product photography
|
||||
Composition/framing: centered, slight three-quarter angle, generous padding
|
||||
Lighting/mood: softbox lighting, clean highlights, controlled shadows
|
||||
Materials/textures: matte plastic, crisp label printing
|
||||
Constraints: no logos or trademarks; no watermark
|
||||
Quality: high
|
||||
```
|
||||
|
||||
### ui-mockup
|
||||
```
|
||||
Use case: ui-mockup
|
||||
Primary request: mobile app UI for a local farmers market with vendors and specials
|
||||
Scene/background: clean white background with subtle natural accents
|
||||
Subject: header, vendor list with small photos, "Today's specials" section, location and hours
|
||||
Style/medium: realistic product UI, not concept art
|
||||
Composition/framing: iPhone frame, balanced spacing and hierarchy
|
||||
Constraints: practical layout, clear typography, no logos or trademarks, no watermark
|
||||
```
|
||||
|
||||
### infographic-diagram
|
||||
```
|
||||
Use case: infographic-diagram
|
||||
Primary request: detailed infographic of an automatic coffee machine flow
|
||||
Scene/background: clean, light neutral background
|
||||
Subject: bean hopper -> grinder -> brew group -> boiler -> water tank -> drip tray
|
||||
Style/medium: clean vector-like infographic with clear callouts and arrows
|
||||
Composition/framing: vertical poster layout, top-to-bottom flow
|
||||
Text (verbatim): "Bean Hopper", "Grinder", "Brew Group", "Boiler", "Water Tank", "Drip Tray"
|
||||
Constraints: clear labels, strong contrast, no logos or trademarks, no watermark
|
||||
Quality: high
|
||||
```
|
||||
|
||||
### logo-brand
|
||||
```
|
||||
Use case: logo-brand
|
||||
Primary request: original logo for "Field & Flour", a local bakery
|
||||
Style/medium: vector logo mark; flat colors; minimal
|
||||
Composition/framing: single centered logo on plain background with padding
|
||||
Constraints: strong silhouette, balanced negative space; original design only; no gradients unless essential; no trademarks; no watermark
|
||||
```
|
||||
|
||||
### illustration-story
|
||||
```
|
||||
Use case: illustration-story
|
||||
Primary request: 4-panel comic about a pet left alone at home
|
||||
Scene/background: cozy living room across panels
|
||||
Subject: pet reacting to the owner leaving, then relaxing, then returning to a composed pose
|
||||
Style/medium: comic illustration with clear panels
|
||||
Composition/framing: 4 equal-sized vertical panels, readable actions per panel
|
||||
Constraints: no text; no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### stylized-concept
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Primary request: cavernous hangar interior with tall support beams and drifting fog
|
||||
Scene/background: industrial hangar interior, deep scale, light haze
|
||||
Subject: compact shuttle, parked center-left, bay door open
|
||||
Style/medium: cinematic concept art, industrial realism
|
||||
Composition/framing: wide-angle, low-angle, cinematic framing
|
||||
Lighting/mood: volumetric light rays cutting through fog
|
||||
Constraints: no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### historical-scene
|
||||
```
|
||||
Use case: historical-scene
|
||||
Primary request: outdoor crowd scene in Bethel, New York on August 16, 1969
|
||||
Scene/background: open field, temporary stages, period-accurate tents and signage
|
||||
Subject: crowd in period-accurate clothing, authentic staging and environment
|
||||
Style/medium: photorealistic photo
|
||||
Composition/framing: wide shot, eye-level
|
||||
Constraints: period-accurate details; no modern objects; no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
## Asset type templates (taxonomy-aligned)
|
||||
|
||||
### Website assets template
|
||||
```
|
||||
Use case: <photorealistic-natural|stylized-concept|product-mockup|infographic-diagram|ui-mockup>
|
||||
Asset type: <hero image / section illustration / blog header>
|
||||
Primary request: <short description>
|
||||
Scene/background: <environment or abstract background>
|
||||
Subject: <main subject>
|
||||
Style/medium: <photo/illustration/3D>
|
||||
Composition/framing: <wide/centered; specify negative space side>
|
||||
Lighting/mood: <soft/bright/neutral>
|
||||
Color palette: <brand colors or neutral>
|
||||
Constraints: <no text; no logos; no watermark; leave space for UI>
|
||||
```
|
||||
|
||||
### Website assets example: minimal hero background
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: landing page hero background
|
||||
Primary request: minimal abstract background with a soft gradient and subtle texture (calm, modern)
|
||||
Style/medium: matte illustration / soft-rendered abstract background (not glossy 3D)
|
||||
Composition/framing: wide composition; large negative space on the right for headline
|
||||
Lighting/mood: gentle studio glow
|
||||
Color palette: cool neutrals with a restrained blue accent
|
||||
Constraints: no text; no logos; no watermark
|
||||
```
|
||||
|
||||
### Website assets example: feature section illustration
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: feature section illustration
|
||||
Primary request: simple abstract shapes suggesting connection and flow (tasteful, minimal)
|
||||
Scene/background: subtle light-gray backdrop with faint texture
|
||||
Style/medium: flat illustration; soft shadows; restrained contrast
|
||||
Composition/framing: centered cluster; open margins for UI
|
||||
Color palette: muted teal and slate, low contrast accents
|
||||
Constraints: no text; no logos; no watermark
|
||||
```
|
||||
|
||||
### Website assets example: blog header image
|
||||
```
|
||||
Use case: photorealistic-natural
|
||||
Asset type: blog header image
|
||||
Primary request: overhead desk scene with notebook, pen, and coffee cup
|
||||
Scene/background: warm wooden tabletop
|
||||
Style/medium: photorealistic photo
|
||||
Composition/framing: wide crop; subject placed left; right side left empty
|
||||
Lighting/mood: soft morning light
|
||||
Constraints: no text; no logos; no watermark
|
||||
```
|
||||
|
||||
### Game assets template
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: <game environment concept art / game character concept / game UI icon / tileable game texture>
|
||||
Primary request: <biome/scene/character/icon/material>
|
||||
Scene/background: <location + set dressing> (if applicable)
|
||||
Subject: <main focal element(s)>
|
||||
Style/medium: <realistic/stylized>; <concept art / character render / UI icon / texture>
|
||||
Composition/framing: <wide/establishing/top-down>; <camera angle>; <focal point placement>
|
||||
Lighting/mood: <time of day>; <mood>; <volumetric/fog/etc>
|
||||
Constraints: no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### Game assets example: environment concept art
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: game environment concept art
|
||||
Primary request: cavernous hangar interior with tall support beams and drifting fog
|
||||
Scene/background: industrial hangar interior, deep scale, light haze
|
||||
Subject: compact shuttle, parked center-left, bay door open
|
||||
Foreground: painted floor markings; cables; tool carts along edges
|
||||
Style/medium: cinematic concept art, industrial realism
|
||||
Composition/framing: wide-angle, low-angle, cinematic framing
|
||||
Lighting/mood: volumetric light rays cutting through fog
|
||||
Constraints: no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### Game assets example: character concept
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: game character concept
|
||||
Primary request: desert scout character with layered travel gear
|
||||
Silhouette: long coat with hood, wide boots, satchel
|
||||
Outfit/gear: dusty canvas, leather straps, brass buckles
|
||||
Face/hair: windworn face, short cropped hair
|
||||
Style/medium: character render; stylized realism
|
||||
Pose: neutral hero pose
|
||||
Background: simple neutral backdrop
|
||||
Constraints: no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### Game assets example: UI icon
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: game UI icon
|
||||
Primary request: round shield icon with a subtle rune pattern
|
||||
Style/medium: painted game UI icon
|
||||
Composition/framing: centered icon; generous padding; clear silhouette
|
||||
Background: transparent
|
||||
Lighting/mood: subtle highlights; crisp edges
|
||||
Constraints: no text; no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### Game assets example: tileable texture
|
||||
```
|
||||
Use case: stylized-concept
|
||||
Asset type: tileable game texture
|
||||
Primary request: worn sandstone blocks
|
||||
Style/medium: seamless tileable texture; PBR-ish look
|
||||
Scale: medium tiling
|
||||
Lighting: neutral / flat lighting
|
||||
Constraints: seamless edges; no obvious focal elements; no text; no logos or trademarks; no watermark
|
||||
```
|
||||
|
||||
### Wireframe template
|
||||
```
|
||||
Use case: ui-mockup
|
||||
Asset type: website wireframe
|
||||
Primary request: <page or flow to sketch>
|
||||
Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
|
||||
Layout: <sections in order; grid/columns>
|
||||
Annotations: <labels for key blocks>
|
||||
Resolution/orientation: <landscape or portrait to match expected device>
|
||||
Constraints: no color; no logos; no real photos; no watermark
|
||||
```
|
||||
|
||||
### Wireframe example: homepage (desktop)
|
||||
```
|
||||
Use case: ui-mockup
|
||||
Asset type: website wireframe
|
||||
Primary request: SaaS homepage layout with clear hierarchy
|
||||
Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
|
||||
Layout: top nav; hero with headline and CTA; three feature cards; testimonial strip; pricing preview; footer
|
||||
Annotations: label each block ("Nav", "Hero", "CTA", "Feature", "Testimonial", "Pricing", "Footer")
|
||||
Resolution/orientation: landscape (wide) for desktop
|
||||
Constraints: no color; no logos; no real photos; no watermark
|
||||
```
|
||||
|
||||
### Wireframe example: pricing page
|
||||
```
|
||||
Use case: ui-mockup
|
||||
Asset type: website wireframe
|
||||
Primary request: pricing page layout with comparison table
|
||||
Fidelity: low-fi grayscale wireframe; sketchy lines; simple boxes
|
||||
Layout: header; plan toggle; 3 pricing cards; comparison table; FAQ accordion; footer
|
||||
Annotations: label key areas ("Toggle", "Plan Card", "Table", "FAQ")
|
||||
Resolution/orientation: landscape for desktop or portrait for tablet
|
||||
Constraints: no color; no logos; no real photos; no watermark
|
||||
```
|
||||
|
||||
### Wireframe example: mobile onboarding flow
|
||||
```
|
||||
Use case: ui-mockup
|
||||
Asset type: website wireframe
|
||||
Primary request: three-screen mobile onboarding flow
|
||||
Fidelity: low-fi grayscale wireframe; hand-drawn feel; simple boxes
|
||||
Layout: screen 1 (logo placeholder, headline, illustration placeholder, CTA); screen 2 (feature bullets); screen 3 (form fields + CTA)
|
||||
Annotations: label each block and screen number
|
||||
Resolution/orientation: portrait (tall) for mobile
|
||||
Constraints: no color; no logos; no real photos; no watermark
|
||||
```
|
||||
|
||||
### Logo template
|
||||
```
|
||||
Use case: logo-brand
|
||||
Asset type: logo concept
|
||||
Primary request: <brand idea or symbol concept>
|
||||
Style/medium: vector logo mark; flat colors; minimal
|
||||
Composition/framing: centered mark; clear silhouette; generous margin
|
||||
Color palette: <1-2 colors; high contrast>
|
||||
Text (verbatim): "<exact name>" (only if needed)
|
||||
Constraints: no gradients; no mockups; no 3D; no watermark
|
||||
```
|
||||
|
||||
### Logo example: abstract symbol mark
|
||||
```
|
||||
Use case: logo-brand
|
||||
Asset type: logo concept
|
||||
Primary request: geometric leaf symbol suggesting sustainability and growth
|
||||
Style/medium: vector logo mark; flat colors; minimal
|
||||
Composition/framing: centered mark; clear silhouette
|
||||
Color palette: deep green and off-white
|
||||
Constraints: no text; no gradients; no mockups; no 3D; no watermark
|
||||
```
|
||||
|
||||
### Logo example: monogram mark
|
||||
```
|
||||
Use case: logo-brand
|
||||
Asset type: logo concept
|
||||
Primary request: interlocking monogram of the letters "AV"
|
||||
Style/medium: vector logo mark; flat colors; minimal
|
||||
Composition/framing: centered mark; balanced spacing
|
||||
Color palette: black on white
|
||||
Constraints: no gradients; no mockups; no 3D; no watermark
|
||||
```
|
||||
|
||||
### Logo example: wordmark
|
||||
```
|
||||
Use case: logo-brand
|
||||
Asset type: logo concept
|
||||
Primary request: clean wordmark for a modern studio
|
||||
Style/medium: vector wordmark; flat colors; minimal
|
||||
Text (verbatim): "Studio North"
|
||||
Composition/framing: centered text; even letter spacing
|
||||
Color palette: charcoal on white
|
||||
Constraints: no gradients; no mockups; no 3D; no watermark
|
||||
```
|
||||
|
||||
## Edit
|
||||
|
||||
### text-localization
|
||||
```
|
||||
Use case: text-localization
|
||||
Input images: Image 1: original infographic
|
||||
Primary request: translate all in-image text to Spanish
|
||||
Constraints: change only the text; preserve layout, typography, spacing, and hierarchy; no extra words; do not alter logos or imagery
|
||||
```
|
||||
|
||||
### identity-preserve
|
||||
```
|
||||
Use case: identity-preserve
|
||||
Input images: Image 1: person photo; Image 2..N: clothing items
|
||||
Primary request: replace only the clothing with the provided garments
|
||||
Constraints: preserve face, body shape, pose, hair, expression, and identity; match lighting and shadows; keep background unchanged; no accessories or text
|
||||
Input fidelity (edits): high
|
||||
```
|
||||
|
||||
### precise-object-edit
|
||||
```
|
||||
Use case: precise-object-edit
|
||||
Input images: Image 1: room photo
|
||||
Primary request: replace ONLY the white chairs with wooden chairs
|
||||
Constraints: preserve camera angle, room lighting, floor shadows, and surrounding objects; keep all other aspects unchanged
|
||||
```
|
||||
|
||||
### lighting-weather
|
||||
```
|
||||
Use case: lighting-weather
|
||||
Input images: Image 1: original photo
|
||||
Primary request: make it look like a winter evening with gentle snowfall
|
||||
Constraints: preserve subject identity, geometry, camera angle, and composition; change only lighting, atmosphere, and weather
|
||||
Quality: high
|
||||
```
|
||||
|
||||
### background-extraction
|
||||
```
|
||||
Use case: background-extraction
|
||||
Input images: Image 1: product photo
|
||||
Primary request: extract the product on a transparent background
|
||||
Output: transparent background (RGBA PNG)
|
||||
Constraints: crisp silhouette, no halos/fringing; preserve label text exactly; no restyling
|
||||
```
|
||||
|
||||
### style-transfer
|
||||
```
|
||||
Use case: style-transfer
|
||||
Input images: Image 1: style reference
|
||||
Primary request: apply Image 1's visual style to a man riding a motorcycle on a white background
|
||||
Constraints: preserve palette, texture, and brushwork; no extra elements; plain white background
|
||||
```
|
||||
|
||||
### compositing
|
||||
```
|
||||
Use case: compositing
|
||||
Input images: Image 1: base scene; Image 2: subject to insert
|
||||
Primary request: place the subject from Image 2 next to the person in Image 1
|
||||
Constraints: match lighting, perspective, and scale; keep background and framing unchanged; no extra elements
|
||||
Input fidelity (edits): high
|
||||
```
|
||||
|
||||
### sketch-to-render
|
||||
```
|
||||
Use case: sketch-to-render
|
||||
Input images: Image 1: drawing
|
||||
Primary request: turn the drawing into a photorealistic image
|
||||
Constraints: preserve layout, proportions, and perspective; choose realistic materials and lighting; do not add new elements or text
|
||||
Quality: high
|
||||
```
|
||||
Reference in New Issue
Block a user