How does 'group as' work in Pig?

Question

I'm having trouble understanding how group by group_name works in a foreach loop.

Let's say we already have a variable named grouped_data that was defined as:

grouped_data = group dataset by (emp_id, dept_id);

And then we want to iterate over each record in grouped_data with an aggregated column added in. So the following is written:

with_hours_worked = FOREACH grouped_data 
                    GENERATE group AS grp, 
                             SUM(dataset.worked_hours) AS hours ;

I'm confused as to what is going on in that last line, especially the group AS grp part. Is grp a tuple? Is the line from grouped_data converted back into a group? If so, why?

How does 'group as' work in Pig?

Answers (1)

Related Questions

How does &#39;group as&#39; work in Pig?

Answers (1)

Related Questions

How does 'group as' work in Pig?