Find a string with space and without space from the INI file

Question

My Input *INI file:


leftarrow = {
ot\leftarrow}

rightarrow = {
ot
ightarrow}

leftrightarrow = {
ot\leftrightarrow}

I need to find the ot\leftarrow and replace with the string leftarrow. The things is here ot\leftarrow we need to check whether the space is available in the finding string eg: ot \leftarrow. Both the things we need to replace the string.

In Main file:

 $str=~s/\not\leftarrow/\nleftarrow/g;
 $str=~s/\not \leftarrow/\nleftarrow/g;

My Code:

foreach my $repStr(@tags) #storing all the `*INI` lines in the Array
{
    my ($findStr, $replaceStr) = ($repStr =~ /^([^\s]*)\s*=\s*\{([^\{\}]*)\}/i);
    $str=~s/$replaceStr/$findStr/g;

    #I need to check the string with space

}

Could anyone can guide me the way to code the request.

Sinan &#220;n&#252;r · Accepted Answer

To fix ideas, the following are given:

A configuration file

This maps TeX/LaTeX macro names to sequences of TeX/LaTeX commands. It is in macro name = command sequence format. E.g.:


leftarrow = {
ot\leftarrow}

rightarrow = {
ot
ightarrow}

leftrightarrow = {
ot\leftrightarrow}

It is not clear what function is served by the braces which enclose the replacements. Do they appear in the source file (see below)? Or, do they delimit whitespace? In short, are they supposed to be stripped?

It would have been much better if you had provided just a paragraph of sample text instead of requiring us to make assumptions about it.

A TeX/LaTeX source file

In this, there are command sequences which need to be replaced with macro names based on the mapping in the configuration file mentioned above. However, because TeX eats spaces after macro invocations, one needs to consider the possibility of a space character in between every command in the replacement sequence. In the example above, there are only two commands in each replacement sequence, but it is not hard to imagine more.

It would have been much better if you had provided just a paragraph of sample text instead of requiring us to construct it.

You seem to have cooked up a custom way of parsing the mapping of replacements. I would recommend using a decent parser instead, e.g. Config::INI::Reader.

#!/usr/bin/env perl

use strict;
use warnings;

use Config::INI::Reader;

my $ini_contents = <<'EO_INI';

leftarrow = {
ot\leftarrow}

rightarrow = {
ot
ightarrow}

leftrightarrow = {
ot\leftrightarrow}
EO_INI

my $tex_source = <<'EO_TEX';
Lorem ipsum dolor 
ot\leftarrow{} sit amet, ea quem idque senserit eum, in

ot 
ightarrow{} duo amet recusabo sensibus. Mei velit suavitate ei, ferri
consequuntur vis eu, qui unum volumus an. Rebum democritum no nec, et 
ot
\leftrightarrow{} eam natum patrioque, mentitum evertitur reprimique nec te.
Usu et docendi 
ot
ightarrow{} partiendo, eos ut assum errem simul.
EO_TEX

# Helper function to deal with matches with spaces
# because our mapping does not have sequences
# containing spaces.
sub match_to_key {
    my ($s) = @_;
    $s =~ s/\s+//g;
    return $s;
}

# Assume mappings appear in a single global section only
my $macro_definition = Config::INI::Reader->read_string($ini_contents)->{_};

# Assuming { and } need to be removed
for (values %$macro_definition) {
    s/^\{//;
    s/\}\z//;
}

# map command sequences to replacement macros
$macro_definition = { reverse %$macro_definition };

my $command_sequence_pat = join '|',
    sort { length($b) <=> length($a) }
    map join('\s?', map quotemeta, m{ (\\w+) }gx),
    keys %$macro_definition
;

print "Text before replacement:
";

print ">>>$tex_source<<<

";

$tex_source =~ s/($command_sequence_pat)/$macro_definition->{match_to_key($1)}/g;

print "Text after replacement:
";

print ">>>$tex_source<<<

";

Note that the wrapping of original text might get messed up.

Output:

Text before replacement:
>>>Lorem ipsum dolor 
ot\leftarrow{} sit amet, ea quem idque senserit eum, in

ot 
ightarrow{} duo amet recusabo sensibus. Mei velit suavitate ei, ferri
consequuntur vis eu, qui unum volumus an. Rebum democritum no nec, et 
ot
\leftrightarrow{} eam natum patrioque, mentitum evertitur reprimique nec te.
Usu et docendi 
ot
ightarrow{} partiendo, eos ut assum errem simul.
<<<

Text after replacement:
>>>Lorem ipsum dolor 
leftarrow{} sit amet, ea quem idque senserit eum, in

rightarrow{} duo amet recusabo sensibus. Mei velit suavitate ei, ferri
consequuntur vis eu, qui unum volumus an. Rebum democritum no nec, et 
leftrightarrow{} eam natum patrioque, mentitum evertitur reprimique nec te.
Usu et docendi 
rightarrow{} partiendo, eos ut assum errem simul.
<<<

Find a string with space and without space from the INI file

Answers (2)

A configuration file

A TeX/LaTeX source file

Related Questions