Do you host your own AI?

SuspiciousCarrot78@aussie.zone · 2 days ago

Do you host your own AI?

PetteriPano@lemmy.world · 1 day ago

Running qwen3.6 27b through llama.cpp.

It’s about as capable as sonnet 3.5.

I use it for light scripting, but real coding is done by cloud models.

I’m also using it as the brain for my Hermes agent. It sends me digests of news, subreddits, chats that I’d like to read but don’t have time for. It does a great job researching things on the web for me, too.

PapaSkwat@lemmy.wtf · 1 day ago

That’s a great model and it’s the one I use too.

SuspiciousCarrot78@aussie.zone · 1 day ago

Do you mean Sonnet 4.5?

I don’t have the rig to run it at real speeds but I’ve played with it over API. Seems pretty good.

PetteriPano@lemmy.world · 13 hours ago

No, it needs a lot more babysitting than 4.5 does. 3.5 was on the same level of mistakes, at least on the quants I have to use.