Lab / Multi-Ref Modes / 7-Pose DSLR Character Sheet

7-Pose DSLR Character Sheet

Pure T2I · 7-panel 工业级 character bible (3 face profiles top · 4 body profiles bottom) · DSLR aesthetic · 直接进 IP refs/ · 适合 R2V multi-ref + character lock

项目内首验证 2026-05-01 · insmind_com 2-stage pipeline image part. Layout 严格命中 · 7 panel = 3 face (front/45°/side) 上行 + 4 body (front/45°/side/back) 下行. Outfit lock 完美跨 7 panel (olive bomber + white tee + 深 jeans + 白鞋). Identity 7/7 同人 (Asian male default · 因 prompt 没指定性别). DSLR aesthetic 兑现 (软光 · gradient 白底 · photoreal). Fault-tolerant: 'white tea' typo 仍渲染 white tee. KEY INSIGHT: pure T2I + 文字 anchor 已能产出工业级 charsheet · 不需要 ref input. LIMITATION: 没指定性别 → default 男 · multi-stage pipeline 必须 stage 1 提前 lock 角色 anchor (i.e., 加 'female named Sarah' 当 stage 2 video 角色是女). Sister · multiref-5frame-charsheet (5 panel · validated_2plus on kitsunebi anime aesthetic).

observed_once · ¥0.5 · aspect 4:3 · providers · gpt_image_2 · id · multiref-7pose-dslr-charsheet
7-Pose DSLR Character Sheet
sample · experiments/dslr_charsheet_test/dslr_7pose_charsheet_v1.png
Run Record
2026-05-01T03:30:22+08:00
✓ pass¥0.544.6s wall· 894KB
provider · gpt_image_2 (apimart) · aspect_ratio=4:3 · size=4:3 · n=1
prompt · inline (no library entry · 195 chars)
refs · none (pure T2I)
task_id · task_01KQFXWGHGSBBHXJZ48P2F9MME
script · experiments/dslr_charsheet_test/test_v1.py
view full sidecar yaml
date: '2026-05-01T03:30:22+08:00'
result: pass
prompt:
  text: >-
    DSLR Character Sheet on a white back drop wearing a white tea and a bomber jacket. With three
    face profiles (front, 45° and side) and four full body portrait profiles (front, 45°, side and
    back).
refs: []
provider:
  id: gpt_image_2
  relay: apimart
  config:
    aspect_ratio: '4:3'
    size: '4:3'
    'n': 1
output:
  path: ./dslr_7pose_charsheet_v1.png
  bytes: 915527
  wall_seconds: 44.6
  task_id: task_01KQFXWGHGSBBHXJZ48P2F9MME
script: experiments/dslr_charsheet_test/test_v1.py
cost_yuan: 0.5
notes: >-
  Stage 1 of insmind_com 2-stage pipeline (image → video). DSLR 7-pose character sheet · pure T2I ·
  no refs. Tests: (a) does gpt-image-2 honor 'three face profiles + four body profiles' = 7 panel
  layout · (b) does 'DSLR' aesthetic anchor pull toward photoreal lighting · (c) outfit lock (white
  tee + bomber jacket) across all 7 panels · (d) implicit identity consistency (no ref · model
  invents · should auto-lock). Verbatim contains typo 'white tea' (intended: 'white tee') · kept
  verbatim per discipline. Compare against existing multiref-5frame-charsheet (5 panel ·
  validated_2plus on kitsunebi).
prompt not yet extracted to library
Verbatim prompt lives in the experiment script. Pending migration to recipes/image_gen/gpt_image_2/prompts/.

Cross-Reference

methodology · — no dedicated recipe doc · referenced in memory only —
axis · §Multi-Ref Modes · image_urls 编辑模式 · ref-driven generation

Sister capabilities · same axis

Basic T2I
production_repeatable
Pure text-to-image · 无 ref · 最简单的入口形态
2-Ref Scene + Character Composition
observed_once
1 场景 ref + 1 角色 ref → 角色置入场景 · 场景 1:1 复刻 + 角色 identity 锁
5-Frame Charsheet Generation
validated_2plus
单 ref → 多角度 / 多表情 / 多服装 charsheet · turnaround 替代品
Multi-Ref Real-Minor IP
validated_2plus
真儿童照 ref + multi-ref edit mode · ApiMart relay 通过 OpenAI minor filter
Single-Image Landing-Page Composite
observed_external
1 张图含 nav / hero / cards / footer 4 区 · commercial UI ad pre-viz