Change in apple metal kernel does not change execution time

Question

I modified the kernel code in the above project https://developer.apple.com/documentation/metal/basic_tasks_and_concepts/performing_calculations_on_a_gpu?preferredLanguage=occ

from this


kernel void add_arrays(device const float* inA,
                       device const float* inB,
                       device float* result,
                       uint index [[thread_position_in_grid]])
{
    // the for-loop is replaced with a collection of threads, each of which
    // calls this function.
     result[index] = inA[index] + inB[index] ;
    
}

to this just embedding same calculation inside a for loop


kernel void add_arrays(device const float* inA,
                       device const float* inB,
                       device float* result,
                       uint index [[thread_position_in_grid]])
{
    // the for-loop is replaced with a collection of threads, each of which
    // calls this function.
    for(int i=0;i<1000000;i++){ // added
        result[index] = inA[index] + inB[index] ;
    }
}

but the execution time of the program does not change , am I doing something wrong

Change in apple metal kernel does not change execution time

Answers (1)

Related Questions