More

bremac · 2025-02-04T09:09:54 1738660194

Location: San Francisco, CA

Remote: Yes

Willing to relocate: Yes

Technologies: Java, C, C++, SQL, Python, Kubernetes, Linux, perf, PostgreSQL, OCaml

Résumé/CV: http://macdonell.net/resume.pdf

Email: brendan@macdonell.net

Staff engineer with over a decade of experience leading products to successful delivery and beyond. Recently focused on high-performance streaming analytics at Sight Machine (I led the design and implementation of their stream processing engine, and before that, their data acquisition product), I can also work as a generalist backend developer or team lead.

Shoot me an email if you want to chat!

bremac · on Jan 2, 2025

  Location: San Francisco, CA
  Remote: Yes
  Willing to relocate: Yes
  Technologies: Java, C, C++, SQL, Python, Kubernetes, Linux, perf, PostgreSQL, OCaml
  Résumé/CV: http://macdonell.net/resume.pdf
  Email: brendan@macdonell.net

Staff engineer with over a decade of experience leading products to successful delivery and beyond. Recently focused on high-performance streaming analytics at Sight Machine (I led the design and implementation of their stream processing product, and before that, their data acquisition product), I can also work as a generalist backend developer or team lead.

Shoot me an email if you want to chat.

No management positions, please!

bremac · on Dec 22, 2024

Unfortunately none of the hardware used for testing supports FP16 arithmetic. Between Intel and AMD, the only platform that supports AVX512-FP16 is currently Sapphire Rapids.

Cold_Miserable · on Dec 24, 2024

Alderlake supports AVX512-FP16. Still only 9.6x faster than div. Most likely reciprocal is just too slow.

bremac · on Oct 28, 2024

Value types still require allocation for types larger than 128 bits if the value is either nullable or atomic — that seems like a reasonable trade-off to me.

pizlonator · on Oct 28, 2024

Oh! Reasonable trade-off indeed. Thank you for clarifying.

bremac · on Sept 26, 2024

Keep in mind that you still need send a print job to the fake printer to trigger the exploit. If you send the job to your real printer, nothing happens.

crote · on Sept 27, 2024

The exploit allows an attacker to overwrite your real printer with their fake printer.

bremac · on March 16, 2024

Per the bug report, all versions since Java 8 are affected.

CharlesW · on March 16, 2024

> Affected Version: 8,11,17,21,22

This has changed (they added 21) since I posted the comment above, so it looks like they’re still getting a handle on it.

bremac · on Jan 17, 2024

Unfortunately, unless the JIT can prove that the address you are accessing via the segment is non-negative, it can't elide the bounds check. See https://mail.openjdk.org/pipermail/panama-dev/2023-July/0194... for a bit more information.

bremac · on Jan 17, 2024

As the comment you replied to indicates, both of those APIs perform bounds-checking. In certain tight loops, this can add up to quite a bit of overhead [1]. However, it's not documented, but if you really know what you are doing you can convince the JIT to elide the bounds checks for MemorySegments [2].

[1] https://mail.openjdk.org/pipermail/panama-dev/2023-July/0193...

[2] https://mail.openjdk.org/pipermail/panama-dev/2023-July/0194...

kaba0 · on Jan 17, 2024

As pron mentioned, you can use the internal unsafe API without bound checks, you just need a flag for that

bremac · on May 2, 2023

Reading between the lines, it sounds as if they're using mmap. There is no "append" operation on a memory mapping, so the file would need to be preallocated before mapping it.

If the preallocation is done using fallocate or just writing zeros, then by default it's backed by blocks on disk, and readahead must hit the disk since there is data there. On the other hand, preallocating with fallocate using FALLOC_FL_ZERO_RANGE or (often) with ftruncate() will just update the logical file length, and even if readahead is triggered it won't actually hit the disk.

EE84M3i · on May 2, 2023

For the file being entirely pre-allocated case I understand, but for the file hole case I'm not sure I understand why you'd get such high disk activity.

If the index block also got evicted from the page cache, then could reading into a file hole still trigger a fault? Or is the "holiness" of a page for a mapping stored in the page table?

loeg · on May 3, 2023

I suspect page size/aligned file holes could be backed by a read-only zero page via PTE as an optimization, but they might not be (I'm not as familiar with Linux mmap/filesystems as with FreeBSD).

It is quite possible the filesystem caches, e.g., the file extent tree (including holiness) separately from the backing inode/on-disk sectors for the tree.

ayende · on May 3, 2023

Using _ftruncate_ or FALLOC_FL_ZERO_RANGE is a bad idea for a database. The problem is that you may get an out of disk space error mid operation.

If you are using mmap, that will express itself as a segmentation fault, which you really don't want.

You _need_ to allocate the file ahead of time, so you can properly behave there.

bremac · on April 9, 2023

I think you may have linked to the wrong graph — while that graph does have a spike on it, the spike happens when the index was broadened in May 2020 to include savings and money market accounts.