Working inference on a RTX4070 #40

frutiemax92 · 2025-01-10T18:22:26Z

It is possible to run inference under 12GB VRAM with those modifications. It's not really fast as you need to swap the transformer to cpu, then text_encoder to gpu, etc. I also had to remove the torch.compile directives due to the precision being bfloat16 everywhere.

Working inference on a RTX4070

aed4a3e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Working inference on a RTX4070 #40

Working inference on a RTX4070 #40

frutiemax92 commented Jan 10, 2025

Working inference on a RTX4070 #40

Are you sure you want to change the base?

Working inference on a RTX4070 #40

Conversation

frutiemax92 commented Jan 10, 2025