More

snats · 2026-01-14T20:57:02 1768424222

personal website: snats.xyz weblog: weblog.snats.xyz

snats · 2025-09-28T02:45:34 1759027534

I also did a couple of experiments with pruning LLMs[1] using genetic algorithms and you can just keep removing a surprising amount of layers in big models before they start to have a stroke.

[1]https://snats.xyz/pages/articles/pruningg.html

kridsdale1 · 2025-09-28T05:09:26 1759036166

I suspect this applies to human beings as well.

TeMPOraL · 2025-09-28T14:25:55 1759069555

There's https://en.wikipedia.org/wiki/Hydrocephalus# and there are cases of people living normal lives, not realizing they're missing most of their brains, until this gets discovered on some unrelated medical test. Or people who survived an unplanned brain surgery by rebar or bullet. Etc.

littlestymaar · 2025-09-28T07:01:16 1759042876

Well, humans don't have “layers” in the first place…

Legend2440 · 2025-09-28T08:00:21 1759046421

Of course not, that’s ogres.

snats · 2025-07-17T00:50:06 1752713406

i get it. i want one of those. the problem is that most cellphones are not actual cellphones, they are entertainment machines. they are a pocket tv / social media feed place. most usage for my normal friends is for that.

snats · 2025-04-27T23:19:53 1745795993

I am working on the https://moviemovie.club/about, it's a tiny website about film review.

It works like a run club, where you have to make a review first to see other people's reviews.

I am currently implementing watchlists, comments and a mural to make it feel a bit less lonely. Right now I like the UI but it feels to lonely.

dewey · 2025-04-28T05:16:12 1745817372

This seems like it would only work if “reviews” would be something rare to come by. Like some forums where you have to contribute to be able to download attachments, or see higher level subforums.

But reviews are everywhere, good ones too so it will be a hard chicken egg problem to solve.

snats · 2025-03-14T21:20:25 1741987225

Not an insider but imo the work on diffusion language models like LLaDA is really exciting. It's pretty obvious that LLMs are good but they are pretty slow. And in a world where people want agents you want a lot of the time something that might not be that smart but is capable of going really fast + searches fast. You only need to solve search in a specific domain for most agents. You don't need to solve the entire knowledge of human history in a single set of weights

snats · 2025-03-14T16:51:53 1741971113

It's pretty funny to test in-distribution for AI models. But they fail horribly once you push them a bit[1].

I recently made LLMs play Minesweeper and ALL LLMs that I tested had a pretty bad win to loose ratio. Like the only model that won more than 3 times was R1 (mind you there were 50 games).

[1] https://snats.xyz/pages/articles/minesweeper_bench.html

snats · 2025-02-13T23:09:13 1739488153

yup, if i went to do a PhD interpretability is the only interesting subject for academia IMO right now

snats · on Dec 30, 2024

It's more of a distilled model, not a fair 1:1 comparison

snats · on Dec 3, 2024

I use .XYZ because it was pretty cheap when I bought it

aidenn0 · on Dec 4, 2024

I use .xyz because I have a very common first and last name, and nearly all of the permutations of them were taken on .net, .com, .org, and .us; .xyz seems to price based on how desirable they think the name is, so I still couldn't get $FIRST-$LAST.xyz for a reasonable price, but I got something close.

carbine · on Dec 3, 2024

I do too, aesthetically it's great. Unfortunately the rise in phishing from xyz domains means if you use it to send email your deliverability is likely to suck.

snats · on Nov 20, 2024

I post about ML/DL and SWE at https://bsky.app/profile/snats.xyz