Introducing Lumo, the AI where every conversation is confidential | Proton

AusatKeyboardPremi@lemmy.world · 2 days ago

Introducing Lumo, the AI where every conversation is confidential | Proton

TheLeadenSea@sh.itjust.works · 2 days ago

You know what’s even more confidential? Running models locally on my machine without connecting to some third party’s servers.

JumpyWombat@lemmy.ml · 2 days ago

Which is great, but limited to smaller models with slower response time (provided that you have a GPU, ofc)

Passerby6497@lemmy.world · 2 days ago

Hey, I have amd so I can only run CPU based, and I’ll have you know that I can absolutely still run models with even slower response times!

exu@feditown.com · 2 days ago

You can run models on AMD GPUs though

Passerby6497@lemmy.world · 2 days ago

Really?

When I was looking into ollama, I could have sworn it was Nvidia or CPU. Can you point me to the docs to make it work on AMD? Running Bazzite if it matters.

exu@feditown.com · 2 days ago

Ollama only has some of the backends from llama.cpp for unknown reasons.

https://github.com/ggml-org/llama.cpp?tab=readme-ov-file#supported-backends

JumpyWombat@lemmy.ml · 2 days ago

The only hard limits are your RAM and time.

ferric_carcinization@lemmy.ml · 2 days ago

With the new Swap™ technology, you are no longer* limited** by your RAM. Our*** brand-new**** Swap™ technology turns your unused disk space into usable***** memory at almost****** no******* perceivable performance impact. When combined with our ZSwap™ compression technology, you can now achieve an up to 5000% or better******** unused-disk-to-memory conversion ratio********* than many RAM-downloading services.**********