Remix.run Logo
zokier 13 hours ago

iirc those were oriented more towards large HPC clusters rather than computation on single node?

Jtsummers 12 hours ago | parent | next [-]

Chapel, at least, aims for both. You can write loops that it will try to compile to use SIMD instructions, or even for the GPU: https://chapel-lang.org/docs/technotes/gpu.html

convolvatron 12 hours ago | parent | prev [-]

the distinction matters less and less. Inside the GPU there is already plenty of locality to exploit (catches, schedulers, warps). nvlink is a switch memory access network, so that already gets you some fairly large machines with multiple kinds of locality.

throwing infiniband or IP on top is really structurally more of the same.

Chapel definitely can target a single GPU.