Reputation: 420
How do I write code in Python to do this? I want to read two strings that are multiple lines and much text, mainly to compare how similar they are (qualitatively.)
s1 = 'I want to read these texts side by side and see how similar they really are'
s2 = 'I really want to read these here texts side by side to see how similar they are (qualitatively)'
print_side_by_side(s1,s2)
Output:
I want to read these texts side by side and see ho I really want to read these here texts side by sid
w similar they really are e to see how similar they are (qualitatively)
Upvotes: 3
Views: 6213
Reputation: 56895
Here's an approach using slicing:
def print_side_by_side(a, b, size=30, space=4):
while a or b:
print(a[:size].ljust(size) + " " * space + b[:size])
a = a[size:]
b = b[size:]
s1 = 'I want to read these texts side by side and see how similar they really are'
s2 = 'I really want to read these here texts side by side to see how similar they are (qualitatively)'
print_side_by_side(s1, s2)
Output:
I want to read these texts sid I really want to read these he
e by side and see how similar re texts side by side to see h
they really are ow similar they are (qualitati
vely)
This can be generalized to work on any number of strings:
def side_by_side(strings, size=30, space=4):
strings = list(strings)
result = []
while any(strings):
line = []
for i, s in enumerate(strings):
line.append(s[:size].ljust(size))
strings[i] = s[size:]
result.append((" " * space).join(line))
return "\n".join(result)
if __name__ == "__main__":
strings = "aaaaaaaa", "bbbbbbbbbbbbbb", "ccccccc"
print(side_by_side(strings, size=5, space=1))
Output:
aaaaa bbbbb ccccc
aaa bbbbb cc
bbbb
If you want to handle strings with newlines, either pre-split them in advance of calling this algorithm or try something like:
def side_by_side(strings, size=30, space=4):
strings = list(strings)
result = []
while any(strings):
line = []
for i, s in enumerate(strings):
buf = s[:size]
try:
n = buf.index("\n")
line.append(buf[:n].ljust(size))
strings[i] = s[n+1:]
except ValueError:
line.append(buf.ljust(size))
strings[i] = s[size:]
result.append((" " * space).join(line))
return "\n".join(result)
Note that if you're looking for more capability than this, Python has an industrial-strength solution for general diffing called difflib.
Upvotes: 7
Reputation: 7846
Another approach in which you slice and zip the two strings in order to print them side by side is this:
s1 = 'I want to read these texts side by side and see how similar they really are'
s2 = 'I really want to read these here texts side by side to see how similar they are (qualitatively)'
maxChars = 40
maxLength = max(len(s1),len(s2))
s1 = s1.ljust(maxLength," ")
s2 = s2.ljust(maxLength," ")
s1 = [s1[i:i+maxChars] for i in range(0,len(s1),maxChars)]
s2 = [s2[i:i+maxChars] for i in range(0,len(s2),maxChars)]
for elem1, elem2 in zip(s1,s2):
print(elem1.ljust(maxChars," "), end=" ")
print(elem2)
Output:
I want to read these texts side by side I really want to read these here texts s
and see how similar they really are ide by side to see how similar they are
(qualitatively)
Upvotes: 0
Reputation: 420
This is what I've done. Let me know if there is a better way!
def print_side_by_side(sa,sb):
def populate_line(donor, receiver, lim):
while donor and len(receiver)<lim:
new_char = donor.pop(0)
if new_char =='\n':
break
receiver.append(new_char)
while len(receiver) < lim:
receiver.append(' ')
la = list(sa)
lb = list(sb)
# number of chars per line; may have to tweak
nline = 100
na = nline//2
nb = nline-na
lines_a = []
lines_b = []
while la or lb:
line_a = []
line_b = []
populate_line(la,line_a,na)
populate_line(lb,line_b,nb)
lines_a.append(line_a)
lines_b.append(line_b)
while len(lines_a) > len(lines_b):
lines_b.append([' ' for k in range(nb)])
while len(lines_b) > len(lines_a):
lines_a.append([' ' for k in range(na)])
assert len(lines_a) == len(lines_b)
lines_a = [''.join(l) for l in lines_a]
lines_b = [''.join(l) for l in lines_b]
lines = [lines_a[k] + ' ' + lines_b[k] for k in range(len(lines_a))]
print('\n'.join(lines))
Upvotes: 0