I'm totally new to using AI, and recently bought an RTX 4060 as secondary GPU for stuff like this #1142

Deus-nsf · 2025-03-30T17:23:13Z

Deus-nsf
Mar 30, 2025

I was curious, since the card is already offloading quite a bunch of Windows stuff, I have around 4-5GB of VRAM remaining, is it enough to run the model and have a good experience or would you recommend having more VRAM for a better experience?

I want to use the model to accelerate .NET apps development with Visual Studio.
I'm asking to know if with such a limited amount of VRAM, it is still worth learning to use, or if I should wait to maybe have better hardware before?

Of course I'd like as much as possible to avoid running the model on the CPU that goes without saying.

Answered by martindevans

Mar 30, 2025

llama.cpp can do partial offloading, where some of the model runs on the GPU and the rest runs on the CPU. If you've got 4-5GB of VRAM then a smallish model (8B or less) at 4bits quantisation should fit, as long as you keep the context size reasonably small.

View full answer

martindevans · 2025-03-30T18:22:40Z

martindevans
Mar 30, 2025
Maintainer

llama.cpp can do partial offloading, where some of the model runs on the GPU and the rest runs on the CPU. If you've got 4-5GB of VRAM then a smallish model (8B or less) at 4bits quantisation should fit, as long as you keep the context size reasonably small.

1 reply

Deus-nsf Mar 30, 2025
Author

I see, thanks for the answer :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I'm totally new to using AI, and recently bought an RTX 4060 as secondary GPU for stuff like this #1142

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

I'm totally new to using AI, and recently bought an RTX 4060 as secondary GPU for stuff like this #1142

Deus-nsf Mar 30, 2025

Replies: 1 comment · 1 reply

martindevans Mar 30, 2025 Maintainer

Deus-nsf Mar 30, 2025 Author

Deus-nsf
Mar 30, 2025

Replies: 1 comment 1 reply

martindevans
Mar 30, 2025
Maintainer

Deus-nsf Mar 30, 2025
Author