I could never grasp the conventions for "axis=". Everyone has some memory trick,...

mbeex · on Dec 29, 2020

Never could I. What helps me often is to consider the C language heritage of Python. There, the beginner also has the slightly confusing some_var[y][x]=some_value, caused by the computers memory model behind. Consequently, I'm always looking for the hierarchy of fastest changing indexes. This explains/justifies a lot of the design decisions made also for numpy.

bonoboTP · on Dec 29, 2020

It's simple. It's the axis that will disappear (or become length 1 with keepdims) after performing the operation.

throwawayiionqz · on Dec 29, 2020

It's not simple because there are two axis of interest: the axis along you sum, and the axis that are preserved (ie, that gives the dimension of the returned array).

Both of them are of interest and after you were confused once about which one to supply to axis=..., there is no way back to clear the confusion. With einsum there is no confusion.

evanb · on Dec 29, 2020

If you have a high-dimensional array then in your latter convention if you had to specify the “remaining” axes you’d need to provide a lengthy list. That’d be user-hostile.