Need help #1385

RottenLeader · 2025-02-22T01:53:21Z

Dell precision 5540 , 32GB ram

Running a AI method is very laggy

and I use this AI

https://huggingface.co/allenai/OLMo-2-1124-13B-GGUF/blob/main/OLMo-2-1124-13B-Q5_K_M.gguf

LostRuins · 2025-02-22T03:08:17Z

Do you have a dedicated gpu? If you do, it will be much faster.

Otherwise, try a smaller model, like this: https://huggingface.co/bartowski/L3-8B-Stheno-v3.2-GGUF/resolve/main/L3-8B-Stheno-v3.2-Q4_K_S.gguf?download=true

RottenLeader · 2025-02-22T06:28:46Z

This my display adapters

MoeMonsuta · 2025-02-22T07:33:25Z

This my display adapters

You might be able to run a 3B tops. https://huggingface.co/models?search=3b%20gguf Try Qwen 2.5 3b or Llama 3.2 3b?

LostRuins · 2025-02-22T08:13:46Z

Ok you should definitely select Use Cublas, that card should support it. That should provide much faster speeds compared to CPU. Try running a 7B model partially offloaded.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need help #1385

Need help #1385

RottenLeader commented Feb 22, 2025

LostRuins commented Feb 22, 2025

RottenLeader commented Feb 22, 2025

MoeMonsuta commented Feb 22, 2025

LostRuins commented Feb 22, 2025

Need help #1385

Need help #1385

Comments

RottenLeader commented Feb 22, 2025

LostRuins commented Feb 22, 2025

RottenLeader commented Feb 22, 2025

MoeMonsuta commented Feb 22, 2025

LostRuins commented Feb 22, 2025