Need to let loose a primal scream without collecting footnotes first? Have a sneer percolating in your system but not enough time/energy to make a whole post about it? Go forth and be mid: Welcome to the Stubsack, your first port of call for learning fresh Awful youā€™ll near-instantly regret.

Any awful.systems sub may be subsneered in this subthread, techtakes or no.

If your sneer seems higher quality than you thought, feel free to cutā€™nā€™paste it into its own post ā€” thereā€™s no quota for posting and the bar really isnā€™t that high.

The post Xitter web has spawned soo many ā€œesotericā€ right wing freaks, but thereā€™s no appropriate sneer-space for them. Iā€™m talking redscare-ish, reality challenged ā€œculture criticsā€ who write about everything but understand nothing. Iā€™m talking about reply-guys who make the same 6 tweets about the same 3 subjects. Theyā€™re inescapable at this point, yet I donā€™t see them mocked (as much as they should be)

Like, there was one dude a while back who insisted that women couldnā€™t be surgeons because they didnā€™t believe in the moon or in stars? I think each and every one of these guys is uniquely fucked up and if I canā€™t escape them, I would love to sneer at them.

(Semi-obligatory thanks to @dgerard for starting this, and happy new year in advance.)

  • YourNetworkIsHaunted@awful.systems
    link
    fedilink
    English
    arrow-up
    9
    Ā·
    edit-2
    3 days ago

    Nobody outside the company has been able to confirm whether the impressive benchmark performance of OpenAIā€™s o3 model represents a significant leap in actual utility or just a significant gap in the value of those benchmarks. However, they have released information showing that the most ostensibly-powerful model costs orders of magnitude more. The lede is in that first graph, which shows that for whatever performance gain o3 costs over ~$10 per request with the headline-grabbing version costing ~$1500 per request.

    I hope theyā€™ve been able to identify a market willing to pay out the ass for performance that, even if it somehow isnā€™t over hyped, is roughly equivalent to an average college graduate.

    • JFranek@awful.systems
      link
      fedilink
      English
      arrow-up
      7
      Ā·
      3 days ago

      Iā€™m wondering about the benchmark too. Itā€™s way above my level to figure out how it can be gamed. But, buried in the article:

      Moreover, ARC-AGI-1 is now saturating ā€“ besides o3ā€™s new score, the fact is that a large ensemble of low-compute Kaggle solutions can now score 81% on the private eval.

      The most expensive o3 version achieved 87.5%

    • skillissuer@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      10
      Ā·
      edit-2
      3 days ago

      if all of that $1500 cost is electricity, and at arbitrarily chosen but probably high electricity price of $0.2/kWh, thatā€™s 7.5MWh per request. could be easily twice that. this is approx how much electricity four 4-person households consume in a year in poland. or about half of american one. six tons of TNT equivalent, or almost 2/3 ton of oil equivalent if you prefer

      • YourNetworkIsHaunted@awful.systems
        link
        fedilink
        English
        arrow-up
        7
        Ā·
        3 days ago

        Actually wait Iā€™m pretty sure itā€™s even worse because Iā€™m terrible at reading logarithmic scales. Itā€™s roughly halfway between $1,000 and $10,000 on their log scale, which if I do the math while actually awake works out closer to $3,000.