7-Pose DSLR Character Sheet

Pure T2I · 7-panel 工业级 character bible (3 face profiles top · 4 body profiles bottom) · DSLR aesthetic · 直接进 IP refs/ · 适合 R2V multi-ref + character lock

项目内首验证 2026-05-01 · insmind_com 2-stage pipeline image part. Layout 严格命中 · 7 panel = 3 face (front/45°/side) 上行 + 4 body (front/45°/side/back) 下行. Outfit lock 完美跨 7 panel (olive bomber + white tee + 深 jeans + 白鞋). Identity 7/7 同人 (Asian male default · 因 prompt 没指定性别). DSLR aesthetic 兑现 (软光 · gradient 白底 · photoreal). Fault-tolerant: 'white tea' typo 仍渲染 white tee. KEY INSIGHT: pure T2I + 文字 anchor 已能产出工业级 charsheet · 不需要 ref input. LIMITATION: 没指定性别 → default 男 · multi-stage pipeline 必须 stage 1 提前 lock 角色 anchor (i.e., 加 'female named Sarah' 当 stage 2 video 角色是女). Sister · multiref-5frame-charsheet (5 panel · validated_2plus on kitsunebi anime aesthetic).

observed_once · ¥0.5 · aspect 4:3 · providers · gpt_image_2 · id · multiref-7pose-dslr-charsheet

sample · experiments/dslr_charsheet_test/dslr_7pose_charsheet_v1.png

Run Record

2026-05-01T03:30:22+08:00

✓ pass¥0.544.6s wall· 894KB

provider · gpt_image_2 (apimart) · aspect_ratio=4:3 · size=4:3 · n=1

prompt · inline (no library entry · 195 chars)

refs · none (pure T2I)

task_id · task_01KQFXWGHGSBBHXJZ48P2F9MME

script · experiments/dslr_charsheet_test/test_v1.py

view full sidecar yaml

date: '2026-05-01T03:30:22+08:00'
result: pass
prompt:
  text: >-
    DSLR Character Sheet on a white back drop wearing a white tea and a bomber jacket. With three
    face profiles (front, 45° and side) and four full body portrait profiles (front, 45°, side and
    back).
refs: []
provider:
  id: gpt_image_2
  relay: apimart
  config:
    aspect_ratio: '4:3'
    size: '4:3'
    'n': 1
output:
  path: ./dslr_7pose_charsheet_v1.png
  bytes: 915527
  wall_seconds: 44.6
  task_id: task_01KQFXWGHGSBBHXJZ48P2F9MME
script: experiments/dslr_charsheet_test/test_v1.py
cost_yuan: 0.5
notes: >-
  Stage 1 of insmind_com 2-stage pipeline (image → video). DSLR 7-pose character sheet · pure T2I ·
  no refs. Tests: (a) does gpt-image-2 honor 'three face profiles + four body profiles' = 7 panel
  layout · (b) does 'DSLR' aesthetic anchor pull toward photoreal lighting · (c) outfit lock (white
  tee + bomber jacket) across all 7 panels · (d) implicit identity consistency (no ref · model
  invents · should auto-lock). Verbatim contains typo 'white tea' (intended: 'white tee') · kept
  verbatim per discipline. Compare against existing multiref-5frame-charsheet (5 panel ·
  validated_2plus on kitsunebi).

7-Pose DSLR Character Sheet

Cross-Reference

Sister capabilities · same axis