user3046061
user3046061

Reputation: 353

How do I work globally on a complex data structure using subroutines?

Specifically my data structure looks like this

{
  "SomeGuy" => {
    date_and_time => "11-04-2013",
    Id => 7,
    nr => 52,
    picks => [
      { "This is an option" => "Option3" },
      { "This is another option" => "Option4" },
      { "This is another option" => "Option1" },
      { "And another one" => "Something" },
    ],
    time_of => "06:11 AM",
    total => 1,
  },
  "Another Guy" => { ... },
}

This is output via Data::Dump. The actual data structure contains a lot more records like "SomeGuy". The structure is identical for all of them.

I populate this data structure this way:

$guys->{$profile}{options}[$total++]{$var} = $var2;
$guys->{$profile}{Id} = $i;
$guys->{$profile}{date_and_time} = get_date($Time[0]);
$guys->{$profile}{time_of} = $Time[1];
$guys->{$profile}{total} = keys (% {$guys->{$profile}{options}[0]});
$guys->{$profile}{nr} = $pNr;

Having all this, what I want to do next is operate on this data structure. I repeat that there many many records in the data structure.

When I output the contents of it I get it in jumbled order, as to say, not in the order that it was populated. I've tried this with Data::Dumper, Data::Dump and iterating through the records myself by hand.

I know that the methods in the Data namespace are notorious for this, that's why Data::Dumper provides a way to sort through a subroutine and Data::Dump provides a default one.

So I have the data structure. It looks like I expect it to be, I have all the data as I knew it should look in it, valid. I want to sort the records according to their Id field. My thinking is that I have to use a subroutine and essentially pass a reference of the data structure to it and do the sorting there.

 sub sortt {
    my $dref = shift @_;
    foreach my $name ( sort { $dref->{$a}{Id} <=> $dref->{$b}{Id} } keys %$dref ) {
      print "$data->{$name}{Id}: $name \n";
    }
 }

This gets called with (in the same scope where the structure is populated, so no worries there):

 sortt(\$guys);

The error is:

 Not a HASH reference at perlprogram.pl line 452

So I go and use ref in the subroutine to make sure I'm passing an actual reference. And it says REF.

Next I go into desperate mode and try some stupid things like calling it with:

sortt(\%$guys)

But if I'm not mistaking this just sends a copy to the subroutine and just sorts that copy locally, so no use there.

It's no use if I make a copy and return it from the subroutine, I just want to pass a reference of my data structure and sort it and have it reflect those changes globally (or in the calling scope per se). How would I do this?

Upvotes: 0

Views: 185

Answers (5)

Borodin
Borodin

Reputation: 126732

With your heart set on a sorted data structure, I recommend the following. It is a simple array of hashes and, rather than using the name string as the key for a single-element hash, it adds a new name key to each guy's data.

I hope that, with the Data::Dump output, it is self-explanatory. It is sorted by the Id field as you requested, but it still has the disadvantage that a separate index array would allow any ordering at all without modifying or copying the original hash data.

use strict;
use warnings;

use Data::Dump;

my $guys = {
  "SomeGuy " => {
    date_and_time => "11-04-2013",
    Id => 7,
    nr => 52,
    picks => [
      { "This is an option" => "Option3" },
      { "This is another option" => "Option4" },
      { "This is another option" => "Option1" },
      { "And another one" => "Something" },
    ],
    time_of => "06:11 AM",
    total => 1,
  },
  "Another Guy" => { Id => 9, nr => 99, total => 6 },
  "My Guy" => { Id => 1, nr => 42, total => 3 },
};

my @guys_sorted = map {
  my $data = $guys->{$_};
  $data->{name} = $_;
  $data;
}
sort {
  $guys->{$a}{Id} <=> $guys->{$b}{Id}
} keys %$guys;

dd \@guys_sorted;

output

[
  { Id => 1, name => "My Guy", nr => 42, total => 3 },
  {
    date_and_time => "11-04-2013",
    Id => 7,
    name => "SomeGuy ",
    nr => 52,
    picks => [
      { "This is an option" => "Option3" },
      { "This is another option" => "Option4" },
      { "This is another option" => "Option1" },
      { "And another one" => "Something" },
    ],
    time_of => "06:11 AM",
    total => 1,
  },
  { Id => 9, name => "Another Guy", nr => 99, total => 6 },
]

Upvotes: 0

Borodin
Borodin

Reputation: 126732

Perl hashes are inherently unordered. There is no way you can sort them, or reorder them at all. You have to write code to to access the elements in the order you want.

Your sortt subroutine does nothing but print the ID and the name of each hash element, sorted by the ID. Except that it doesn't, because you are trying to use the variable $data when you have actually set up $dref. That is likely to be the cause of your Not a HASH reference error, although unless you show your entire code, or at least indicate which is perlprogram.pl line 452, then we cannot help further.

The best way to do what you want is to create an array of hash keys, which you can sort in whatever order you want. Like this

my @names_by_id = sort { $guys->{$a}{Id} <=> $guys->{$b}{Id} } keys %$guys;

Then you can use this to access the hash in sorted order, like this, which prints the same output as your sortt is intended to, but formatted a little more nicely

for my $name (@names_by_id) {
  printf "%4d: %s\n", $guys->{$name}{Id}, $name;
}

If you want to do anything else with the hash elements in sorted order then you have to use this technique.

Upvotes: 1

DVK
DVK

Reputation: 129433

Leaving aside your syntax problem with "Not a ref", you are approaching the problem from the wrong end in the first place. I'll leave small syntactic details to others (see Ikegami's comment).

You can NOT sort them at all, because $guys is a hash, not an array. Hashes are NOT ever sorted in Perl. If you want to sort it, your have three solutions:

  1. Store an ordered list of names as a separate array.

    my @ordered_names = sort { $guys->{$a}{Id} <=> $guys->{$b}{Id} } keys %$guys

    Then you use the array for ordering and go back to hashref for individual records.

  2. Add the name to the individual guy's hash, instead of outer hash reference. $guys should be an array reference. The downside to this approach is that you can't look up a person's record by their name any more - if that functionality is needed, use #1.

    @codnodder's answer shows how to do that, if you don't care about accessing records by name.

  3. Use a Tie::* module. (Tie::IxHash, Tie::Hash::Sorted). NOT recommended since it's slower.

Upvotes: 4

David W.
David W.

Reputation: 107040

That's a pretty complex data structure. If you commonly use structures like this in your program, I suggest you take your Perl programming skills up a notch, and look into learning a bit of Object Oriented Perl.

Object Oriented Perl is fairly straight forward: Your object is merely that data structure you've previously created. Methods are merely subroutines that work with that object. Most methods are getter/setter methods to set up your structure.

Initially, it's a bit more writing, but once you get the hang of it, the extra writing is easily compensated by the saving is debugging and maintaining your code.

Object Oriented Perl does two things: It first makes sure that your structure is correct. For example:

$some_guy->{Picks}->[2]->{"this is an option"} = "Foo!";

Whoops! That should have been {picks}. Imagine trying to find that error in your code.

In OO-Perl, if I mistyped a method's name, the program will immediately pick it up:

$some_guy->picks(
    {
        "This is an option" -> "Foo!",
        "This is option 2"  => "Bar!",
    }
)

If I had $some_guy->Picks, I would have gotten a runtime error.

It also makes you think of your structure as an object. For example, what are you sorting on? You're sorting on your Guys' IDs, and the guys are stored in a hash called %guys.

# $a and $b are hash keys from `%guys`.
# $guys{$a} and $guys{$b} represent the guy objects.
# I can use the id method to get they guys' IDs
sort_guys_by_id {
    guys{$a}->id cmp guys{$b}->id;  #That was easy!
}

Take a look at the tutorial. You'll find yourself writing better programs with fewer errors.

Upvotes: 0

codnodder
codnodder

Reputation: 1675

$guys is already a hash ref, so you just need sortt($guys)

If you want a sorted data structure, you need something like this:

my @guys_sorted =
    map  { { $_ => $guys->{$_} } }
    sort { $guys->{$a}{Id} <=> $guys->{$b}{Id} } keys %$guys;

print(Dumper(\@guys_sorted));

Or, in a sub:

sub sortt {
    # returns a SORTED ARRAY of HASHREFS
    my $ref = shift;
    return map  { { $_ => $ref->{$_} } }
           sort { $ref->{$a}{Id} <=> $ref->{$b}{Id} } keys %$ref;
}

print(Dumper([sortt($guys)]));

Upvotes: 0

Related Questions