Pure T2I · 7-panel 工业级 character bible (3 face profiles top · 4 body profiles bottom) · DSLR aesthetic · 直接进 IP refs/ · 适合 R2V multi-ref + character lock
项目内首验证 2026-05-01 · insmind_com 2-stage pipeline image part. Layout 严格命中 · 7 panel = 3 face (front/45°/side) 上行 + 4 body (front/45°/side/back) 下行. Outfit lock 完美跨 7 panel (olive bomber + white tee + 深 jeans + 白鞋). Identity 7/7 同人 (Asian male default · 因 prompt 没指定性别). DSLR aesthetic 兑现 (软光 · gradient 白底 · photoreal). Fault-tolerant: 'white tea' typo 仍渲染 white tee. KEY INSIGHT: pure T2I + 文字 anchor 已能产出工业级 charsheet · 不需要 ref input. LIMITATION: 没指定性别 → default 男 · multi-stage pipeline 必须 stage 1 提前 lock 角色 anchor (i.e., 加 'female named Sarah' 当 stage 2 video 角色是女). Sister · multiref-5frame-charsheet (5 panel · validated_2plus on kitsunebi anime aesthetic).
date: '2026-05-01T03:30:22+08:00'
result: pass
prompt:
text: >-
DSLR Character Sheet on a white back drop wearing a white tea and a bomber jacket. With three
face profiles (front, 45° and side) and four full body portrait profiles (front, 45°, side and
back).
refs: []
provider:
id: gpt_image_2
relay: apimart
config:
aspect_ratio: '4:3'
size: '4:3'
'n': 1
output:
path: ./dslr_7pose_charsheet_v1.png
bytes: 915527
wall_seconds: 44.6
task_id: task_01KQFXWGHGSBBHXJZ48P2F9MME
script: experiments/dslr_charsheet_test/test_v1.py
cost_yuan: 0.5
notes: >-
Stage 1 of insmind_com 2-stage pipeline (image → video). DSLR 7-pose character sheet · pure T2I ·
no refs. Tests: (a) does gpt-image-2 honor 'three face profiles + four body profiles' = 7 panel
layout · (b) does 'DSLR' aesthetic anchor pull toward photoreal lighting · (c) outfit lock (white
tee + bomber jacket) across all 7 panels · (d) implicit identity consistency (no ref · model
invents · should auto-lock). Verbatim contains typo 'white tea' (intended: 'white tee') · kept
verbatim per discipline. Compare against existing multiref-5frame-charsheet (5 panel ·
validated_2plus on kitsunebi).
recipes/image_gen/gpt_image_2/prompts/.