How to properly replace long strings in Perl

Question

Let's say I want to replace every occurrence of 12345 in file1 with the contents of file2. The following code works:

use strict;
use warnings;

local $/;

open my $fh, 'file1' or die "$!";
my $file1 = <$fh>;
close $fh;

open $fh, 'file2' or die "$!";
my $file2 = <$fh>;
close $fh;

my $file3 = $file1 =~ s/12345/$file2/gr;

open $fh, '>>', 'file3' or die "$!";
print $fh $file3;
close $fh;

I'm probably overthinking this but from a theoretical standpoint what would be the proper, efficient way to do this without causing a lot of copying in memory? Also, I'm new to Perl so feel free to point out anything here that goes against Perl best practices.

Nothing wrong with this. Can clean it up a little and add checks, and/or use libraries like File::Slurper (or Path::Tiny, specially if have other use of it elsewhere in the program). — zdim, Commented Sep 24 at 21:15
I assume you know this but let me state it just in case -- the value of local is that it holds inside the nearest scope and what it affected gets undone on scope exit (previous values restored). Read the link. So we want to make sure that it is scoped as tightly as possible so normally you'd enclose local and then the <> filehandle manipulation in a block. If there is no further use of fh put it all in a block and then the filehandle gets closed on scope exit, too. See for instance this post — zdim, Commented Sep 25 at 20:02
use autodie 'open', ':default'; is nice if you aren't going to use File::Slurper — ysth, Commented Sep 26 at 13:54

ikegami · Accepted Answer · 2025-09-24 22:03:05Z

If you're writing out to a different file, and if the pattern doesn't span lines, you could read the file to modify one line at a time.

But reading the entire file into memory is also perfectly acceptable (and faster) unless you have concerns about the size of the file. And it's simpler if you want to write back to the same file.

Adding some whitespace makes of a huge improvement in readability (which @Miller did by editing your question).

But a module like File::Slurper can make it even cleaner.

use File::Slurper qw( read_text write_text );

my $to_insert_qfn = ...;
my $file_qfn      = ...;

my $to_insert = read_text( $to_insert_qfn );

my $file = read_text( $file_qfn );
$file =~ s/12345/$to_insert/g;
write_text( $file_qfn, $file );

Gilles Quénot · Accepted Answer · 2025-09-24 22:13:21Z

What I would do, is to use the power of a Perl one-liner:

$ cat file1
ok
12345
ok
12345

$ cat file2
FELEBLEB

$ perl -spe 's/\b12345\b/$file2/g' -- -file2=$(<file2) file1
ok
FELEBLEB
ok
FELEBLEB

If you need to replace in place:

$ perl -i -spe 's/\b12345\b/$file2/g' -- -file2=$(<file2) file1

Breakdown:

-i
Edit files in place (changes are written directly into file1). (You could use -i.bak to keep a backup copy.)

-s
Enables parsing of command-line options into Perl variables. This allows passing -file2=... and then using $file2 inside the script.

-p
Wraps the given code ('s/.../.../g') in a loop that reads each line, applies the code, and prints the line back.

-e 's/\b12345\b/$file2/g'
The actual Perl code: s/.../.../g is a global regex substitution.
\b12345\b = match the exact number 12345 as a whole word (\b: word boundary).

$file2: Perl variable whose value is passed via the command line.

--:
End of Perl options. Everything after is either -s variables or file names.

-file2=$(<file2):
Sets the Perl variable $file2.

$(<file2):
Shell syntax that reads the entire contents of the file file2.

So $file2 (in the Perl script) gets that content.

file1:
The input file where Perl will search for 12345 and replace it with the content of $file2.

In short: This command reads the contents of file2, assigns them to the Perl variable $file2, then searches through file1 for whole-word occurrences of 12345 and replaces them with $file2, editing file1 in place.

ysth · Accepted Answer · 2025-09-29 22:58:09Z

First of all, because you asked for general Perl advice, strict and warnings are great, so great that they are automatic when you specify a sufficient minimum perl version: if you say if use v5.12; or higher you get strict, if use v5.36; or higher you get both. Also, new default features will be enabled and some legacy misfeatures disabled based on the version. So decide on your minimum version, and read up on what great modern perl stuff you can use.

If file1 is large, you can avoid lots of copying by looping and only reading up to the string you are replacing (assuming it is a fixed string, not a pattern). (If file1 is small, the extra overhead of this will make it slightly slower.). You do this by setting the input record separator to the string, then using chomp to remove it (and incidentally detect whether it was in fact there or you just reached the end of the file):

$/ = '12345';
while (my $chunk = readline($fh1)) {
    if (chomp $chunk) {
        print $fh3 $chunk, $file2;
    }
    else {
        print $fh3 $chunk;
    }   
}

Collectives™ on Stack Overflow

How to properly replace long strings in Perl

3 Answers 3

1 Comment

Breakdown:

Comments

2 Comments

Your Answer

Post as a guest

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

Breakdown:

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related