Need to let loose a primal scream without collecting footnotes first? Have a sneer percolating in your system but not enough time/energy to make a whole post about it? Go forth and be mid: Welcome to the Stubsack, your first port of call for learning fresh Awful youāll near-instantly regret.
Any awful.systems sub may be subsneered in this subthread, techtakes or no.
If your sneer seems higher quality than you thought, feel free to cutānāpaste it into its own post ā thereās no quota for posting and the bar really isnāt that high.
The post Xitter web has spawned soo many āesotericā right wing freaks, but thereās no appropriate sneer-space for them. Iām talking redscare-ish, reality challenged āculture criticsā who write about everything but understand nothing. Iām talking about reply-guys who make the same 6 tweets about the same 3 subjects. Theyāre inescapable at this point, yet I donāt see them mocked (as much as they should be)
Like, there was one dude a while back who insisted that women couldnāt be surgeons because they didnāt believe in the moon or in stars? I think each and every one of these guys is uniquely fucked up and if I canāt escape them, I would love to sneer at them.
(Semi-obligatory thanks to @dgerard for starting this)
Itās almost completely ineffective, sorry. Itās certainly not as effective as exfiltrating weights via neighborly means.
On Glaze and Nightshade, my prior rant hasnāt yet been invalidated and thereās no upcoming mathematics which tilt the scales in favor of anti-training techniques. In general, scrapers for training sets are now augmented with alignment models, which test inputs to see how well the tags line up; your example might be rejected as insufficiently normal-cat-like.
I think that āforce-feedingā is probably not the right metaphor. At scale, more effort goes into cleaning and tagging than into scraping; most of that āforcedā input is destined to be discarded or retagged.
yeah this is the thing Iāve been thinking a lot about
fucking reCaptcha is literally mass-weaponising users for data filtration, and there is no good counter besides just not using reCaptcha (which is something one canāt easily pull off without things like regulatory action, massive reputational problems that make people gtfo, etc)
I have similar worries about cloudflare being such a massive chokepoint and using that position to enable āai bot filterā services. feels extremely monopolistic, but ianal and Iām not entirely sure what the case grounds/structure on that would be (if any)
the only other viable strategy at the moment is fully breaking contact with any potential bad traffic systems, and thatās extremely fucking dire because thatās yet another nail in the coffin of the increasingly less open internet
The whole Cloudflare bot detection is so weird and eerie. Iāve had issues where I canāt get past it presumably just because Iām using some in-application browser just to get a login cookie, but other times it just lets fucking curl through no questions asked.
Fucking what. Iāve heard of sites blocking curl and Iāve been able to get around it by copying user agent and sometimes cookies from the browser. Now Iām cursed with the knowledge that I could probably just scrape stuff from everywhere