Hacker Newsnew | past | comments | ask | show | jobs | submit | helmsb's commentslogin

They also announced new APIs for native Linux containerization on MacOS with a specific focus on security and performance. This seems like it may be in support of that as well. Anything they can do to improve the performance of containers is a huge win. https://youtu.be/JvQtvbhtXmo?si=3OphClGvylHggmSW


Lest we forget, Jobs LOVED the original iMac Hockey Puck Mouse and wanted the iPhone to only run a handful of Apple-designed apps.


The original iPhone was supposed to be used for PWAs. Job's vision actually made way more sense in retrospect although what Apple produced was ultimately much more profitable for them.


I do, I use Fastmail and create aliases for every service. It's interesting to see how fast companies will "lose" or sell your email address.

I've seen it as fast as 24 hours my unique email address is being used by others even though their privacy policy says that they will never share your info.


I did a few tests and asked it some legal questions. 4o gave me the correct answer immediately.

o1 preview gave a much more in depth but completely wrong answer. It took 5 follow ups to get it to recognize that it hallucinated a non-existent law


That is very interesting. Would you mind testing the same prompt with Claude Sonnet 3.5 and Opus? If not available to you, would you be willing to share the prompt/question? Thank you.


This is interesting since they claim it does well on STEM questions, which I’d assume would be a similar level of reasoning complexity for a human.


This is an interesting one because math is doing so much of the heavy lifting. And symbolic math has a far smaller representational space than numerical math.

There is one other wonderful thing about symbolic math, the glorious '=' sign. It's structured everywhere from top-to-bottom, left-to-right, which is amenable to the next token prediction behavior and multi-attention heads of transformer based LLMs.

My guess is that problem statement formation into an equation is as difficult of a problem for these as actually running through the equations. However, having taken the Physics GRE, and knowing they try for parity of difficulty between years (even though they normalize it), the problems are fairly standard and have permutations of a problem type between the years.

This is not to diminish how cool this is, just that standardized tests do have an element of predictability to them. I find this result actually neat though; it's an actual qualitative improvement over non-CoT LLMs, even if things like Mathematica can do the steps more reliably post problem formation. I think that judiciously used, this is a valuable feature.


A difficult to guess fraction of all of these results are training to the test in various forms


Perhaps the smaller model used in o1 is over trained on arxiv and code relative to 4o (or undertrained on legal text)


But the AI generated article from the front page yesterday said I shouldn’t use Screens anymore… /s


They first introduced a product called Snowball which is a big briefcase of hard drives that would ship you, you transfer up to a petabyte of data and send back. This was the extension of that so they went with another snow related name. Also their data archival service called Glacier which all imply "cold storage."


I do this all the time. Change a character so I can quickly see that I’m running the build that I thought when iterating internally.

This theory makes the most sense to me.


You still have to go through security with PreCheck. It's usually faster and requires fewer steps that non-PreCheck passengers have to go through.

I used to fly up the East Coast and back every other week and I couldn't imagine doing it without it. Seeing the 1-hour standard line and instead making it through PreCheck in less than 5 minutes. I would totally use a self-service system.

When people get PreCheck the normal reaction is "That was easy, why doesn't everyone do that?" I joke that the real "test" when you sign up for PreCheck is can you successfully book the appointment and show up?


I do PreCheck as well. Perhaps I could have phrased my comment better. Currently the only non-automated/non-self service part of it is the person checking your id and boarding pass.


No you’re NOT a moron, just a minor mistake. Have a wonderfully Wednesday.


I switched to Kagi and really like it. It really shows you what's possible when you're not optimizing for ads.

Lots of cool functionality like:

Lenses: Allows you to run a search to prioritize certain domains to give you a particular "lens" to view it through.

Personalized results: Allows you to give additional weight or decrease the weight of certain domains.

Search Bangs: Shortcuts for executing the search somewhere else like Reddit, Wikipédia, Google, etc.

I decided to pay for it not because of the ads or privacy or anything else but because it allows me to make "search" work better for my needs.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: