Jig232
Jig232

Reputation: 37

Hadoop Pig Max Command

I have one file that contains data of all the countries from all over the world.

I want to find out the country which has maximum airport.

I have written below code:

A = load 'airports.dat' USING PigStorage (',') AS(AirportID:int,Name:chararray,City:chararray,Country:chararray,IATA:chararray,IATAothers:chararray,Latitude:float,Longitude:float,Altitude:float,Timezone:float,DST:chararray,Zone:chararray);
B= GROUP A BY Country;
C= FOREACH B GENERATE A.Country, COUNT(A) AS Count;

but after this I am not getting how to find the maximum.

Can anybody please help.

Upvotes: 0

Views: 230

Answers (1)

Ran Locar
Ran Locar

Reputation: 561

You have created the number of airports per country. What you need to do now, is take the row with the highest number:

D = order C by $1 DESC;
E = limit D 1;
dump E;

Upvotes: 1

Related Questions