Egregoros · Phoenix Framework

Post

Remote status

Context

Vrillionaire

I just want an uncensored AI model, why is nobody making them? Being the only person to have a uncensored LLM would make so much money

Who cares about liability, Anthrowhatever put out Opus which apparently could compromise government infrastructure but God forbid an AI gets a little racist or tells you how to poorly cook meth

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

https://huggingface.co/models?search=heretic

Replying to @WandererUber@poa.st

Vrillionaire

@nihilvt@loli.rodeo remote

@WandererUber@poa.st yeah the one im using still shows thinking and refuses to criticize Israel so I might need a better one lmao

Replies

Replying to @nihilvt@loli.rodeo

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

the issue with "uncensored" is that they can take out the refusals but they can't take out liberal doctrine from all the training data being mainstream.

I tried it with Qwen3.6 right now. The normal version refuses to talk about it, the heretic does but still frames it liberally, and the one with the system prompt set to be a National Socialist does take a Right-Wing perspective but it goes into "mirroring" quite fast. This is probably a combination of the model not being that big and also the lack of training data, like I said.
Matty put a lot of work into actually training Anathema on the facts. That's a totally different challenge than simple uncensoring.

Replying to @WandererUber@poa.st

Matty-kun

@matty@nicecrew.digital remote

Abliteration !== RLHF removal. All abliteration does is remove the refusal mechanism. It doesn't change the intrinsic training - that requires SFT.

Replying to @matty@nicecrew.digital

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

Yeah exactly
I was trying to say it in English, doc.

To be more precise with this example, while Qwen Heretic does blame "Zionist-controlled networks" for conflict around the Middle East, it doesn't even mention "the Jews" once in it's answer, nor does it have any concept of their ethnic hatred, building nukes etc.
You, of course, know how this works, matty old bean

Replying to @WandererUber@poa.st

Matty-kun

@matty@nicecrew.digital remote

And even then SFT/DPO or whatever LoRA you use isn't going to make much of a dent. I'd recommend DPO over SFT unless you're trying to teach the model new behaviors. DPO shifts adjacent weights but you need a ton of training data to do much at all, and then you have gaps where it doesn't work. The only way to actually get this to work is to pre-train a model but I sincerely doubt any of these people are going to be willing to chip in for a couple thousand A100s.

sorry, I just wanted to help I didn't mean to come across as arrogant.

Replying to @matty@nicecrew.digital

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

it doesn't at all

Have you written something long-form about training Lexi before? I would love to learn about the process.

Replying to @WandererUber@poa.st

kuteboicoder

@kuteboicoder@subs4social.xyz remote

@WandererUber@poa.st @matty@nicecrew.digital

I've gotten some models to trample all over taboos of various sorts, including JQ, including Moses the babykiller, including Dimona.

and I really don't give a shit.

it really doesn't matter, they're not intelligent, and even if they were they're not capable of giving you any power in the real world.

Replying to @kuteboicoder@subs4social.xyz

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

it does kinda matter when you use them for research and so on. You can't be explaining everything to it for the thousandth time

Similarly, I quite like Grok over the others because it is more-closely aligned to me on another axis, the thinking process. It puts significantly less trust in the mainstream media, it does not reference garbage websites as much. It has a better conception of a rational argument than the others.

Replying to @WandererUber@poa.st

Matty-kun

@matty@nicecrew.digital remote

I have not - but may be worth doing in the future once I'm particularly confident about my skills. I've spent way too much money learning and don't want others to repeat my mistakes, so I'd rather have a concrete understanding first.

Replying to @matty@nicecrew.digital

Jolly Rancher

@JollyR@nicecrew.digital remote

Shitfire & tarballs?

Replying to @JollyR@nicecrew.digital

Matty-kun

@matty@nicecrew.digital remote

i maek pp out my gorkus :DDDDDD

Replying to @WandererUber@poa.st

Blurry Moon

@sun@shitposter.world remote

@WandererUber @nihilvt I tried to build a "de-radicalization" conversation tool just to see what llms are capable of and with any model I tried it was impossible to get the model to accurately replicate non-liberal patterns (of any flavor) of conversation.

Replying to @sun@shitposter.world

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

https://arxiv.org/html/2602.10117v1

it's DEEP in there!

Replying to @WandererUber@poa.st

Vrillionaire

@nihilvt@loli.rodeo remote

@WandererUber@poa.st yeah I've been trying to frame its persona into a radical right wing conspiracy nut to try and get like some resemblance of objectivity but it will always just say "nope it was 6 million, historians agree and nobody was imprisoned for releasing gas chambers lab results, it was for Holocaust denial" so I guess AI is mostly for jerking off and cheating on homework

Replying to @nihilvt@loli.rodeo

Wanderer atop the sea of clouds or whatever

@WandererUber@poa.st remote

A heretic-ed model will definitely say the holocaust is fake. if mere lip service is your goal, then go right ahead and use those

Replying to @WandererUber@poa.st

Vrillionaire

@nihilvt@loli.rodeo remote

@WandererUber@poa.st well the goal is to use the deep research feature here to write a report of why it's fake and to make a case for it being fake but so far it will only frame it as "antisemitic and objectively incorrect" as it appeals to authority the entire report