Wrong Number of Frames in SkyreelsVideoPipeline#90
Wrong Number of Frames in SkyreelsVideoPipeline#90xh-liu-tech wants to merge 1 commit intoSkyworkAI:mainfrom
Conversation
|
What's the rationale here? Latent frames are not the same as actual frames in the video - iirc each latent frame represents 4 actual video frames, which checks out with the 24 latent frames -> 96 video frames Can you clarify what issue was in the test and what it resolved? |
It is because we should pass the original frame count to |
|
Interesting, thanks for elaborating. I will test it too. I ended looking at this same block of code this weekend while debugging other issues like multi-GPUs running into division errors for 720x720 or the frame count being soft capped at 192 (after that it loops) - I wonder if it's related. |
|
It seems to be shuffling problems from one area to another. On 49 frames test with 1920x1088 (previously working), now generates this exception. That said, before the code change it was already having a different issue of becoming text2video after one second. Maybe it's exposing a deeper problem with other areas of the code. |
|
I just tried this setting on a single RTX 4090, and it seems to be working well. python3 video_generate.py \
--model_id "Skywork/SkyReels-V1-Hunyuan-I2V" \
--task_type i2v \
--guidance_scale 6.0 \
--height 720 \
--width 720 \
--num_frames 289 \
--prompt "FPS-24, An old lady is talking happily" \
--embedded_guidance_scale 1.0 \
--quant \
--offload \
--high_cpu_memory \
--image "image.png" \
--parameters_level \
--sequence_batchCould you share your running parameters so that I can test it from my side? |
|
Are you actually able to get 289 frames without sudden static noise on frame 193? I've been trying to debug that all week. Amazing if you did. I think my error is specific to multi-gpu. I am testing on a rented service with 8x H100s |
I checked the generated video and found that there was indeed static noise at 8s (maybe the 193 frame).
I'm sorry that I couldn't test it on multi-GPU. |
|
I did - it's just that one line right to num_frames instead of latent right? It actually says it is 13, just that the script is expecting 49. Unfortunate to hear about the 8 sec mark - yeah that's frame 193. Still trying to figure that one out |
I think this means the temporal dimension of |
|
Yeah this is messy - I spent much of the week going down a rabbithole with latent model input and latent model input mismatches. |

It seems that
num_latent_frameswas incorrectly used inSkyreelsVideoPipelinebecause the number of latent frames is calculated inside theprepare_latents. The current implementation will generate only 25 frames instead of 97 frames.