Extracting words from file but each word once

Question

I want to write a perl program which reads the file and extracts the dates in it. However if a date passes more than one times I will print it only once. For example:

On 01/10/2011 I went home. On 02/02/2012, I
went to my school. On 02/02/2012, I went
to London.

The output should be:

01/10/2011
02/02/2012

I can do it by adding the dates to an array and control it in every time I read an new date. But I am asking for a more efficient way. Is there a logical way to do it? or any data structure in perl?

mpapec · Accepted Answer

It will scan line by line looking for dates in \d\d/\d\d/\d{4} format and save them in hash as keys.

When file reading is done, it prints these unique keys.

perl -nE '$s{$_}++ for m| (\d\d/\d\d/\d{4}) |xg;}{say for sort keys %s' file

It can be translated to more readable form (plus some checks)

use strict;
open my $fh, "<", "file" or die $!;

my %s;
while (my $line = <$fh>) {

  my @dates = $line =~ m| (\d\d/\d\d/\d{4}) |xg;

  for my $date (@dates) {
    $s{$date} += 1;
  }
}

for my $date (sort keys %s) {

  print $date, "
";
}

Extracting words from file but each word once

Answers (2)

Related Questions