I don't know exactly what this code does, but it looks like it is trying to access the second src of an rms_norm tensor, which only has one src: https://github.com/ggml-org/ggml/blob/fcc2a5c0cfd81ee0517ee42f1acdc371ec92d598/src/ggml-cuda/ggml-cuda.cu#L2904