> they are going to want to monetize LLMs more and more Not only can you run rea...

vjk800 · 2026-01-23T09:21:26 1769160086

Can you? I imagine e.g. Google is using material not available to the public to train their models (unsencored Google books, etc.). Also, the chat bots, like Gemini, are not just pure LLMs anymore, but they also utilize other tools as part of their computation. I've asked Gemini computationally heavy questions and it successfully invokes Python scripts to answer them. I imagine it can also use other tools than Python, some of which might not even be publicly known.

I'm not sure what the situation is currently, but I can easily see private data and private resources leading to much better AI tools, which can not be matched by open source solutions.

fennecbutt · 2026-01-24T15:06:27 1769267187

Yes, because local models can run Internet search tools. Even the big boys like openai etc I prefer the results quality when it's made a search - and they seem to have realised this too, the majority of my queries now kick off searches.

croon · 2026-01-23T11:19:57 1769167197

While they will always have premiere models that only run on data center hardware at first, the good news about the tooling is that tool calls are computationally very minimal and no problem to sandbox/run locally, at least in theory, we would still need to do the plumbing for it.

So I agree that open source solutions will likely lag behind, but that's fine. Gemini 2.5 wasn't unusable when Gemini 3 didn't exist, etc.

OGEnthusiast · 2026-01-23T06:32:49 1769149969

How do you verify the models you download also aren't trying to get you to buy stuff?

tvink · 2026-01-23T06:55:29 1769151329

I guess you.. ask them a bunch of recommendations? I would imagine this would not be incredibly hard to test as a community

ben_w · 2026-01-23T12:50:57 1769172657

Before November 30, 2022 that would have worked, but I think it stopped being reliable sometime between the original ChatGPT and today.

As per dead internet theory, how confident are we that the community which tells us which LLM is safe or unsafe is itself made of real people, and not mostly astroturfing by the owners of LLMs which are biased to promote things for money?

Even DIY testing isn't necessarily enough, deceptive alignment has been shown to be possible as a proof-of-concept for research purposes, and one example of this is date-based: show "good" behaviour before some date, perform some other behaviour after that date.

awesome_dude · 2026-01-23T06:43:43 1769150623

Proudly bought to you by Slurm