I don't see how more advanced models won't get gated to specific known KYC'd entities. Classification-style guardrails will never be sufficient. Distillation attacks too are really hard to prevent. Open-source models can have their guardrails easily stripped away so it'll be incredibly dangerous to continue to release more and more capable OSS models that can and will be used to give bad actors 100x leverage.
UrineSqueegee 2 hours ago [-]
Should be pointed out this is an opinion article
andxor 1 hours ago [-]
This is an opinion piece.
jadar 1 hours ago [-]
I feel like this headline is a bit over-stated. There is not a ton of evidence it was about a jailbreak, and neither was there evidence that is was about retribution.
dualvariable 31 minutes ago [-]
Ultimately, I bet Anthropic is fine with this because they needed to take Fable down to improve the guardrails (that were getting a ton of pushback) and they consider treating Fable as "too dangerous" to just be good PR hype for them. And they just get a little more anti-Trump "cred".
d4rkp4ttern 1 hours ago [-]
TechCrunch articles should be ignored into oblivion.
exabrial 1 hours ago [-]
I think this is pretty low quality content for HN.
cratermoon 2 hours ago [-]
So the article calls it "knowledge gaps". Has technical expertise ever mattered when the law wants to ban or restrict something it doesn't like? The DMCA comes to mind.
Veer_Pratap08 1 hours ago [-]
[dead]
catigula 2 hours ago [-]
[flagged]
SG- 2 hours ago [-]
Look at how the Trump administration treats Canada, it's the same thing. They lie and make up reasons to punish countries that hurts their feelings.
2 hours ago [-]
Rendered at 17:46:36 GMT+0000 (Coordinated Universal Time) with Vercel.
Feds freaked over Fable 5 after simple 'fix this code' prompt, not jailbreak (theregister.com) 398 points | 6 hours ago | 223 comments
I suspect there's more to the story than has been reported too, but I'd like information to help turn those suspicions into something more concrete.
This seems too simple, and too complex at the same time.