Egregoros · Phoenix Framework

Post

Remote status

Context

Shadowman311

I think the funniest part of this computer memory shortage is that all of it is just being bought up by OpenAI to give the illusion of a growing company, who is then immediately shelving it in warehouses and never using it. When this industry crashes the amount of brand new GPUs flooding the secondary market is going to be nuts

john_darksoul

@john_darksoul@ariamispainted.world remote

@Shadowman311 >illusion of a growing company

They were just trying to starve competitors of ram. Everyone was already aware of their tenuous situation. They have first mover advantage and are perceived as the “brand name” LLM. They’re trying to maintain this status and advance it by kneecapping everyone else. It won’t work. As long as the Chinese keep getting the results of these models for free no advancement made by spending billions will matter in the face of a competitor 6 months behind getting it for free.

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@john_darksoul >getting the results of these models for free
>getting it for free
What do you mean here exactly?

john_darksoul

@john_darksoul@ariamispainted.world remote

@WandererUber They’re getting all the weights through espionage no? I’m sure they’re still spending on hardware, but if a new version comes out with more parameters they get those models by stealing them. The new Chinese open source model that just came out straight up answered as Claude.

bronze

@bronze@pl.kitsunemimi.club remote

@john_darksoul @WandererUber >the new Chinese open source model that just came out straight up answered as Claude
uhh where can I find this model? I need to download it for research purposes

john_darksoul

@john_darksoul@ariamispainted.world remote

@bronze @WandererUber I think it’s called Kim K2, but deepseek is still open and is still updating iirc. I don’t think these are easy to run though. You’ll probably need a lower parameter one.

hazlin at rest

@hazlin@shortstacksran.ch remote

@john_darksoul @bronze @WandererUber
I remember it being k2.5 that someone claimed identified as Claude when you said hi. I didn't have access to it at that time, and its output is quite different when I compare it to Claude 4.5 Opus.

Maybe the screenshot was fake? idk Kimi K2.5 Reasoning sure is a lot cheaper than Claude 4.5 Opus

john_darksoul

@john_darksoul@ariamispainted.world remote

@hazlin @bronze @WandererUber They’re doing everything cheaper. I wonder if the models are actually cheaper to run or if they’re just trying to be disruptive. If they can maintain their pace and cost we’ll likely see legislation here soon to ban their use in the US.

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@john_darksoul @bronze @hazlin you need more technical insight tbqh
I already told you guys what they did in this thread, and also it's easily googleable, plus it pops up when you google any of the claims you made.

Nobody knows anything about the theoretical limits of efficiency of tokens/watt, not even on the same hardware.
A smaller model will be cheaper though. And you can "distill" models. Which is what they do. And I said that they did.

john_darksoul

@john_darksoul@ariamispainted.world remote

@WandererUber @bronze @hazlin You need to be less naive about the Chinese man

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@john_darksoul @bronze @hazlin I don't need to believe in some QAnon tier conspiracy of them somehow stealing the weights when what they actually did also easily explains it and better

john_darksoul

@john_darksoul@ariamispainted.world remote

@WandererUber @bronze @hazlin Homie, they steal EVERYTHING. I just watched a video on their burgeoning RAM business that started with them getting in trouble for stealing tech. Then when that company was blacklisted, a new company started where they left off. There is no conspiracy. It's SOP for any chinese tech company to "accelerate" by getting info wherever they can. The idea that any of this would be found scandalous is laughable.

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@john_darksoul @bronze @hazlin you're missing the entirely niche technical thing I'm saying and it's pissing me off

john_darksoul

@john_darksoul@ariamispainted.world remote

@WandererUber @bronze @hazlin I understand what you said about the deepseek. They trained that model on the output of ChatGPT instead of stealing the tech. I’m saying they’re probably doing both

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@john_darksoul @bronze @hazlin and yet you still have zero evidence they pulled that off. Because they haven't.
When they distilled, openAI made this into a public incident. Had they stolen the model, for an open source one, which is what you said, this would also have happened. It didn't. So it's not true.
really not that complicated

BroDrillard

@BroDrillard@nicecrew.digital remote

I have no proof they have. But apparently, they wiretapped everybody from bri'ish ministers to the FBI. How much harder can AI weights be?

https://www.zerohedge.com/geopolitical/next-level-spying-how-china-read-wests-wiretaps-years

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@BroDrillard @john_darksoul @bronze @hazlin kind of a red string in the thread that the answer is really obvious if you google some technical details first before asking the questions

Replying to @WandererUber@poa.st

bronze

@bronze@pl.kitsunemimi.club remote

@WandererUber @john_darksoul @BroDrillard @hazlin can all you guys chill out im trying to figure out how to download this kimi k2.5 model and make it code for me

Replies

Replying to @bronze@pl.kitsunemimi.club

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@bronze @john_darksoul @BroDrillard @hazlin faustian

Replying to @WandererUber@poa.st

bronze

@bronze@pl.kitsunemimi.club remote

@WandererUber @john_darksoul @BroDrillard @hazlin next up, i hope some chinese company downloads that 300TB spotify torrent and makes an open source music model thats better than suno

Replying to @bronze@pl.kitsunemimi.club

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

@bronze I need an sfx model btw, there's no good ones I'm aware of