Optimally finding the index of the maximum element in BASH array

Question

I am using bash in order to process software responses on-the-fly and I am looking for a way to find the index of the maximum element in the array.

The data that gets fed to the bash script is like this:

And so I create two arrays. There is

arr1 = [ 25 72 3 0 0 ]
arr2 = [ 9   0 3 4 7 ]

And what I need is to find the index of the maximum number in arr1 in order to use it also for arr2. But I would like to see if there is a quick - optimal way to do this.

Would it maybe be better to use a dictionary structure [key][value] with the data I have? Would this make the process easier?

I have also found [1] (from user jhnc) but I don't quite think it is what I want.

My brute - force approach is the following:


function MAX {

   arr1=( 25 72 3 0 0 )
   arr2=( 9   0 3 4 7 )

   local indx=0
   local max=${arr1[0]}
   local flag

   for ((i=1; i<${#arr1[@]};i++)); do

       #To avoid invalid arithmetic operators when items are floats/doubles
       flag=$( python <<< "print(${arr1$[${i}]} > ${max})")    

       if [ $flag == "True" ]; then

           indx=${i}
           max=${arr1[${i}]}

       fi

    done

    echo "MAX:INDEX = ${max}:${indx}"
    echo "${arr1[${indx}]}"
    echo "${arr2[${indx}]}"

}

This approach obviously will work, BUT, is it the optimal one? Is there a faster way to perform the task?

arr1 = [ 99.97 0.01 0.01 0.01 0 ]
arr2 = [ 0 6 4 3 2 ]

In this example, if an array contains floats then I would get a

syntax error: invalid arithmetic operator (error token is ".97)

So, I am using

flag=$( python <<< "print(${arr1$[${i}]} > ${max})")

In order to overcome this issue.

chepner · Accepted Answer

Finding a maximum is inherently an O(n) operation. But there's no need to spawn a Python process on each iteration to perform the comparison. Write a single awk script instead.

awk 'BEGIN {
   split(ARGV[1], a1);
   split(ARGV[2], a2);
   max=a1[1];
   indx=1;
   for (i in a1) {
     if (a1[i] > max) {
       indx = i;
       max = a1[i];
     }
   }
   print "MAX:INDEX = " max ":" (indx - 1)
   print a1[indx]
   print a2[indx]
}' "${arr1[*]}" "${arr2[*]}"

The two shell arrays are passed as space-separated strings to awk, which splits them back into awk arrays.

Optimally finding the index of the maximum element in BASH array

Answers (2)

Related Questions