What is the usage of vhadd_s8 in Neon intrinsics?

Question

I think the behaviors of narrowing addition are quite strange. For example, int8x8_t vhadd_s8(int8x8_t a, int8x8_t b):

Signed Halving Add. This instruction adds corresponding signed integer values from the two source SIMD&FP registers, shifts each result right one bit, places the results into a vector, and writes the vector to the destination SIMD&FP register.

Can anyone explain its usage scenario? The following is an example in Rust:

    let a_v: Vec = vec![8; 8];
    let b_v: Vec = vec![1; 8];
    unsafe {
        let a = vld1_s8(a_v.as_ptr());
        let b = vld1_s8(b_v.as_ptr());
        let c = vhadd_s8(a, b);
        println!("{:?}", c);
    }

Shift 8 right one bit becomes 4, and Shift 1 right one bit becomes 0. So the result is all 4. In which scenario, users would expect such 4 as the result?

What is the usage of vhadd_s8 in Neon intrinsics?

Answers (1)

Related Questions