Hi, training dataset prompt is like "Camera move right, pull out, then pan right. A whale breaching the ocean surface." which includes camera movement, but when performing VBench evaluation, the prompt would not include camera movement, just like "a drone flying over a snowy forest". So I do not understand why the model perform well in Vbench, because the model lose camera movement info and noise wrapper should not work
Hi, training dataset prompt is like "Camera move right, pull out, then pan right. A whale breaching the ocean surface." which includes camera movement, but when performing VBench evaluation, the prompt would not include camera movement, just like "a drone flying over a snowy forest". So I do not understand why the model perform well in Vbench, because the model lose camera movement info and noise wrapper should not work