Hacker Newsnew | past | comments | ask | show | jobs | submit | user43928's commentslogin

I cannot comment on the likelihood of xAI being competitive as an AI lab in the future.

However, their "overbuying" of compute means they can now rent it out for $2.32B/month.

That seems like solid business to me, and raises the question whether your claim of xAI needing 'ongoing enormous funding' is accurate.


Most of that is from Anthropic which is IPO'ing?

Prior to IPO they want to show the best growth numbers possible, so how long they continue with the deal post IPO remains to be seen.


54% is Anthropic.

Considering their models had outages with the message 'model overloaded' just a few days ago, I doubt that their compute demand is ending soon.


It clearly shows that the billions investedin xAI are not returning what they should with the product they invested in.

No it does not. The Late 2024 M4 Mac mini benchmarks x1.6 faster in ST and 2x in MT.

The Mac mini costs $600.


The mac mini is a wonder but it's not a great gaming machine[1]. You can see that these stats are about 1/2 of what the Steam Machine does, so I think the comparison is pretty apt.

[1] https://www.xda-developers.com/mac-mini-m4-gaming-hands-on/


Yes, for gaming specifically, the Mac mini is often limited to 2D games.

However, at the price of $1130 for Steam Machine + controller, you might as well buy the Mac mini and a PS5 on top for $1250.

It just seems like a poor deal.

The best argument I have heard is that people already have large Steam libraries, but then again, those people typically already own a gaming PC.


Neither of those options is good value. PS5 charges more for games and you have to pay extra to connect to the Internet. The value proposition of PC gaming is openness. You can play on whatever you want, on anything that can run it; you're not locked into one hardware vendor or game store. This means competitive pricing across all market segments, except where consoles sell at a loss to buy lock-in.

>PS5 charges more for games and you have to pay extra to connect to the Internet

To be clear, you have to pay to play certain games online. A lot of popular ones are free and general internet access does not cost anything.

A lot of folks also aren't all that interested in playing games online anyway.


You're right, I was imprecise. Additionally, I wasn't aware that some multiplayer games are free. I'm obviously not a PS5 user.

Buying a gaming pc is always a bad deal compared to a PS5! Even though anyone buying a gaming PC is getting a "bad deal" - many people prefer it. You can do lots of things on a PC that you can't on a PS5 - and there are reasons someone might want a 6" cube instead of a full PS5 and a mac mini. None of them are low price but they are reasons nonetheless.

A great example of the target audience are the people who've been playing games on the Steam Deck, but want something with a bit more oomf without the hassle of building a PC. I am not in that demographic! But I have a friend who is. He's quite happy to pay more for convenience. He already has a gaming laptop, but I can see him getting this to replace his ancient Steam Link.


> Buying a gaming pc is always a bad deal compared to a PS5!

If you only compare the hardware, that's true. Even if you don't consider all the other functionality that a PC has vs. a console, add all the different ways to get free and heavily discounted games on Steam/PC, and the results of that calculation might start to look very different.


Your response is essentially OPs reasoning, read it again :)

Anyways, just wanted to add that the steam machine and PCs killer differentiator: a truly open platform that no mac, ps5 and other consoles can offer. Do whatever you want, install whatever software you want, whatever OS you want. Break the rules, face the consequences. Live life like a living being, not as a slave to some corpo.


> Your response is essentially OPs reasoning

I take it you meant GP (as in, the post I was responding to - which to this post is actually GGP but I digress).

I don't think it is. Their reasoning is:

> there are reasons someone might want a 6" cube instead of a full PS5 and a mac mini. None of them are low price but they are reasons nonetheless.

Mine is that it is indeed price, only not the price of the hardware alone but rather the price of the ecosystem as a whole. Another aspect that I didn't cover is that a game that you buy today for PC will likely still work on whatever PC you have 20, 30 years from now. The same cannot be said for consoles.

I do agree with your second paragraph though! :)


what makes a great gaming machine? It plays Dwarf Fortress amazingly. And nethack. I'm running factorio on my Mac. The older Mac mini's run windows. Game of Thrones a ton of money at proton so anything that runs Linus and X86 has a shit ton of games yeah even if it doesn't have your pet game.

> what makes a great gaming machine?

A piece of hardware that runs a basket of popular higher-end games at close to 60fps is generally what people look for. If you know you wanna run DF you can use much cheaper hardware, but if you wanna run "games" you wanna check that your target pc performs good enough on a selection of games.


What makes a gaming machine...

Good - at parity with a PS5 Pro or XBSX in the latest AAA titles.

Great - better than PS5 Pro or XBSX in the latest AAA titles.


The lowest spec M4 Mac Mini on apple.com is $799 today. The next generation Mini will likely be more expensive due to memory pricing, and as the Steam Machine already includes current higher memory pricing, that would be a fairer comparison, no?

You are correct. I see now that the offer I saw for $600 is largely out of stock, and only available in Florida.

Now play games on it and show me the benchmarks.

I mean that for real: I’ve been impressed by the performance of the M4 Mini I own, but a gaming machine it is not


That is true.

Maybe in the future. There should be a new generation of Mac Mini's soon, further extending the performance lead of Apple chips.

Maybe once Fable is back or the next OpenAI model releases, we could take a look at implementing a compatibility layer to translate DirectX games to Metal.

Even if that should yet be out of reach, such a project may become more feasible if AI progress keeps up.


We've been hearing about future Apple gaming wins longer than we've been waiting for Star Citizen launch.

At some point you need to face the reality of it not happening.


I doubt it will ever be because Apple doesn’t understand the non casual gaming market.

I agree. And at work it has been producing some of the worst GUI test cases I have ever seen.

What is tested often makes no sense at all, completely implausible edge cases are tested on internals, while it doesn't create tests for the overall application using user events.

And some things in these test cases are downright ridiculous: instead of instantiating your classes, it sets up some barebones fake objects reimplementing some of the behavior of your actual class, then ignores the TypeScript errors via force cast or similar.

Then it proceeds to slap some test ids on the output, stubs components and dependencies more or less randomly, adds some assertions on test ids and calls it a day.

Apparently that's good enough for many colleagues to open a MR for that garbage.

That said, at home with SOTA models I happily hand large units of work to it, outsource much of the thinking, and get workable results. I think this is the future.


The products generally work just fine on my MacBook.

I have not encountered major issues in either the Claude Code CLI, the Codex Desktop app, or Claude Desktop app.

They generally get the job done. I don't measure disk writes or analyze the GPU usage.


Isn't it arguably the opposite?

A CORS header in the response tells your browser to relax CORS restrictions.


I previously gave this author and the bun rewrite the benefit of the doubt. But an obvious slop PR to the WebKit repository?

I'd tap out here too if I was a maintainer. Even if the change was perfect, if you could not be bothered to write the PR description, I am not going to waste my time with it.

Edit: My bad, the PR is to a fork, in that case it's not our business how the PR description is written.


It’s a PR on their private fork, they’re not expecting to have this accepted upstream.

GitHub Copilot in vscode has two ways to access Opus: the Copilot harness or the Claude Code Agent SDK within Copilot.

And that's if we assume that the vscode GHCP default Agent ("Local") is the same as the "Copilot CLI" one that is also selectable in vscode. I have not tried that one.

A few weeks ago the Claude Code Agent SDK was much better than the default Copilot Agent, but nowadays I am not sure.


Has there been any evidence of a well known provider rerouting to lower quality models?

Last I saw, engineers working at OpenAI denied this on HN.

I saw that someone set up a tracker that aims to record the performance of the models, and so far it has not shown any statistically significant deviation in performance for Codex, and not yet enough data for Claude: https://marginlab.ai/trackers/codex/


Yes, OpenAI admits they silently reroute sensitive requests to different models for user welfare at least: https://openai.com/index/building-more-helpful-chatgpt-exper...

The implementation was so borked, SamA went back on Reddit and apologised: https://old.reddit.com/r/ChatGPT/comments/1o6jins/updates_fo...

Model re-routing happens for coding tasks too. For example, in OpenAI support pages used to (at least 1 month ago when I checked) mention that if they automatically use a cheaper -mini to accomplish the task behind the scenes, you’ll be charged -mini prices even if you selected a more expensive model. I just checked again and they’ve removed it, but there’s probably archives.

Finally, even if they’re the same weights, you don’t know what quantisation you’re running at. Adaptive quantisation based on load (given workday peaks), or similar techniques, have been happening since the ChatGPT 3.5 days; the techniques are probably more advanced now.


What you linked appears to be related to GPT-5's Auto router in the ChatGPT app back then, and that it supposedly would choose the 'good' model over the pretty bad Instant model for mental health requests.

That's pretty far from the hypothesis that either OpenAI or Anthropic is using adaptive quantisation based on load for their professional coding agent tools.

This is what I think engineers working at OpenAI explicitly denied, and for which we have seen zero evidence yet.

Many people seem to believe it anyway, but the non-deterministic nature of the tools appears to be the more plausible explanation for perceived degradation, in my opinion.


> Has there been any evidence of a well known provider rerouting to lower quality models?

The firm [Anthropic] would deliberately degrade the model’s performance in ways that were invisible to the user.

https://news.ycombinator.com/item?id=48485958


I also don't think I need to prompt Claude differently than Codex.

The most important thing to be aware of in my opinion would be that Claude is better at UI design, and leaves a lot more comments in the code.

Other than that the results seem similar, at least functionally. I do not usually review the code style.


Are people really going to perform invasive procedures over mere concern if there are no symptoms and the doctor recommends against it?

There are numerous comments here from experienced people addressing this. Yes, that happens and a doctor who dismisses the concern can be sued for malpractice if something actually does show up, so they are put in a difficult position. For some reason you just assume that doctors will recommend against an invasive procedure when there is a positive tomography result.

Review the numerous comments that address this as a statistical issue -- which it very much is when talking about the scale that Midjourney is claiming.


People take horse dewormer against COVID so yes they will do all kinds of irrational things

Oh we're still doing the "horse dewormer" thing despite 250 million humans taking it each year?

Yes, because it's nonsense and those 250m humans need to get off Twitter.

250M people take it as an anti-parasitic, as they should.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: