I think the funniest part of this computer memory shortage is that all of it is just being bought up by OpenAI to give the illusion of a growing company, who is then immediately shelving it in warehouses and never using it. When this industry crashes the amount of brand new GPUs flooding the secondary market is going to be nuts
Timeline
Post
Remote status
Context
17
@Shadowman311 >illusion of a growing company
They were just trying to starve competitors of ram. Everyone was already aware of their tenuous situation. They have first mover advantage and are perceived as the “brand name” LLM. They’re trying to maintain this status and advance it by kneecapping everyone else. It won’t work. As long as the Chinese keep getting the results of these models for free no advancement made by spending billions will matter in the face of a competitor 6 months behind getting it for free.
They were just trying to starve competitors of ram. Everyone was already aware of their tenuous situation. They have first mover advantage and are perceived as the “brand name” LLM. They’re trying to maintain this status and advance it by kneecapping everyone else. It won’t work. As long as the Chinese keep getting the results of these models for free no advancement made by spending billions will matter in the face of a competitor 6 months behind getting it for free.
@john_darksoul >getting the results of these models for free
>getting it for free
What do you mean here exactly?
>getting it for free
What do you mean here exactly?
@WandererUber They’re getting all the weights through espionage no? I’m sure they’re still spending on hardware, but if a new version comes out with more parameters they get those models by stealing them. The new Chinese open source model that just came out straight up answered as Claude.
@john_darksoul @WandererUber >the new Chinese open source model that just came out straight up answered as Claude
uhh where can I find this model? I need to download it for research purposes
uhh where can I find this model? I need to download it for research purposes
@bronze @WandererUber I think it’s called Kim K2, but deepseek is still open and is still updating iirc. I don’t think these are easy to run though. You’ll probably need a lower parameter one.
@john_darksoul @bronze @WandererUber
I remember it being k2.5 that someone claimed identified as Claude when you said hi. I didn't have access to it at that time, and its output is quite different when I compare it to Claude 4.5 Opus.
Maybe the screenshot was fake? idk Kimi K2.5 Reasoning sure is a lot cheaper than Claude 4.5 Opus
I remember it being k2.5 that someone claimed identified as Claude when you said hi. I didn't have access to it at that time, and its output is quite different when I compare it to Claude 4.5 Opus.
Maybe the screenshot was fake? idk Kimi K2.5 Reasoning sure is a lot cheaper than Claude 4.5 Opus
@hazlin @bronze @WandererUber They’re doing everything cheaper. I wonder if the models are actually cheaper to run or if they’re just trying to be disruptive. If they can maintain their pace and cost we’ll likely see legislation here soon to ban their use in the US.
@john_darksoul @bronze @hazlin you need more technical insight tbqh
I already told you guys what they did in this thread, and also it's easily googleable, plus it pops up when you google any of the claims you made.
Nobody knows anything about the theoretical limits of efficiency of tokens/watt, not even on the same hardware.
A smaller model will be cheaper though. And you can "distill" models. Which is what they do. And I said that they did.
I already told you guys what they did in this thread, and also it's easily googleable, plus it pops up when you google any of the claims you made.
Nobody knows anything about the theoretical limits of efficiency of tokens/watt, not even on the same hardware.
A smaller model will be cheaper though. And you can "distill" models. Which is what they do. And I said that they did.
@john_darksoul @bronze @hazlin I don't need to believe in some QAnon tier conspiracy of them somehow stealing the weights when what they actually did also easily explains it and better
@WandererUber @bronze @hazlin Homie, they steal EVERYTHING. I just watched a video on their burgeoning RAM business that started with them getting in trouble for stealing tech. Then when that company was blacklisted, a new company started where they left off. There is no conspiracy. It's SOP for any chinese tech company to "accelerate" by getting info wherever they can. The idea that any of this would be found scandalous is laughable.
@john_darksoul @bronze @hazlin you're missing the entirely niche technical thing I'm saying and it's pissing me off
@WandererUber @bronze @hazlin I understand what you said about the deepseek. They trained that model on the output of ChatGPT instead of stealing the tech. I’m saying they’re probably doing both
@john_darksoul @bronze @hazlin and yet you still have zero evidence they pulled that off. Because they haven't.
When they distilled, openAI made this into a public incident. Had they stolen the model, for an open source one, which is what you said, this would also have happened. It didn't. So it's not true.
really not that complicated
When they distilled, openAI made this into a public incident. Had they stolen the model, for an open source one, which is what you said, this would also have happened. It didn't. So it's not true.
really not that complicated
I have no proof they have. But apparently, they wiretapped everybody from bri'ish ministers to the FBI. How much harder can AI weights be?
https://www.zerohedge.com/geopolitical/next-level-spying-how-china-read-wests-wiretaps-years
https://www.zerohedge.com/geopolitical/next-level-spying-how-china-read-wests-wiretaps-years
@BroDrillard @john_darksoul @bronze @hazlin kind of a red string in the thread that the answer is really obvious if you google some technical details first before asking the questions
@WandererUber @john_darksoul @BroDrillard @hazlin can all you guys chill out im trying to figure out how to download this kimi k2.5 model and make it code for me
Replies
3
@bronze @john_darksoul @BroDrillard @hazlin faustian
@WandererUber @john_darksoul @BroDrillard @hazlin next up, i hope some chinese company downloads that 300TB spotify torrent and makes an open source music model thats better than suno
@bronze I need an sfx model btw, there's no good ones I'm aware of