Search and Replace using Perl

Question

I have some tags with values like below,


is The human nervous system?
A tag is a keyword or label that categorizes your question with other, similar questions

Terms for anatomical directions in the nervous system
A tag is a keyword or label that categorizes your question with other, similar questions


Anatomical terms: is referring to directions
.
.
.

The output I need is like below,


Is the Human Nervous System?
A tag is a keyword or label that categorizes your question with other, similar questions


Terms for Anatomical Directions in the Nervous System
A tag is a keyword or label that categorizes your question with other, similar questions

Anatomical Terms: Is Referring to Directions
.
.

how could I do this using perl. Here all prepositions and articles will be in lower case. Now the condition is slightly differs as below

condition is if a word that is in @lowercase (suppose is) and it is the first word of the and is in lower case then it should be upper case. Again if any @lowercase word after colon in the should be in upper case.

jimtut · Accepted Answer

New answer to match the updated question (sample input and desired output changed since the original question). Updated again on Mar 9, 2014, per the op's request to always uppercase the first word in a title tag.

#!/usr/bin/perl

use strict;
use warnings;

# Add your articles and prepositions here!!!
my @lowercase = qw(a an at for in is the to);

# Use a hash since lookup is easier later.
my %lowercase;
# Populate the hash with keys and values from @lowercase.
# Values could have been anything, but it needs to match the number of keys, so this is easiest.
@lowercase{@lowercase} = @lowercase;

open(F, "foo.txt") or die $!;
while() {
  if (m/^ tags
    my $titleTag = $line;
    $titleTag =~ s/^(<[^>]*>).*/$1/;
    # Remove any tags in <brackets>
    $line =~ s/<[^>]*>//g;
    # Uppercase the first letter in every word, except for those in a certain list.
    my $first = 1;
    foreach my $word (split(/\s/, $line)) {
      if ($first) {
        $first = 0;
        push(@words, ucfirst($word));
        next;
      }
      if ($first || exists $lowercase{$word}) { push(@words, "$word") }
      else { push(@words, ucfirst($word)) }
    }
    print $titleTag . join(" ", @words) . "
";
  }
  else {
    print $_;
  }
}
close(F)

This code does make 2 assumptions:

Each ... is on a single line. It never wraps to more than one line in the file.
The opening </code> tag is at the beginning of the line. This can be easily be changed in the code if desired though.</li> </ol>

Search and Replace using Perl

Answers (2)

Related Questions