V2 — Cross-section / topology
2D-view → 3D inference · cannot be shortcut by code · geometric
Question-answering task families generated by the BabyVision-v2 deterministic pipeline. Each family targets a distinct reasoning skill; every task ships with executable ground truth (no LLM-as-judge).
2D-view → 3D inference · cannot be shortcut by code · geometric
pixel-level GT · SSIM / LPIPS · defends against LLM-as-judge
rotate · slice · deform · geometry-grounded GT
multi-tool trace · scored on answer + trace fidelity
force control · MPM fracture · MPM granular · FEM cloth · P1/P2 — discover by acting on objects
M3D · IXI brain · distance / angle / topology / volume
Live task-instance data lands when QA-Gen artifacts begin shipping. See task-families.md for the canonical definitions.