Reputation: 117

What is an easy way to remove duplicates from only part of the string in Python?

I have a list of strings that goes like this:

I would like to remove all duplicates where second 2 numbers are the same. So after running it through program I would get something like this:

But something like

would also be correct.

Upvotes: 0

Answers (3)

One Lyner

Reputation: 2004

Here is a nice and fast trick you can use (assuming l is your list):

list({ s.split(';', 1)[1] : s for s in l }.values())

No need to import anything, and fast as can be.

In general you can define:

def custom_unique(L, keyfunc):
    return list({ keyfunc(li): li for li in L }.values())

Upvotes: 1

Tom Ron

Reputation: 6181

You can group the items by this key and then use the first item in each group (assuming l is your list).

import itertools
keyfunc = lambda x: x.split(";", 1)[1]
[next(g) for k, g in itertools.groupby(sorted(l, key=keyfunc), keyfunc)]

Upvotes: 1

snatchysquid

Reputation: 1352

Here is a code on the few first items, just switch my list with yours:

x = [
'7;213;164',
'8;213;164',
'9;145;112',
'10;145;112',
'11;145;112',
]
new_list = []
for i in x:
    check = True
    s_part = i[i.find(';'):]
    for j in new_list:
        if s_part in j:
            check = False
    if check == True:
        new_list.append(i)

print(new_list)

Output:

['7;213;164', '9;145;112']

Upvotes: 0

What is an easy way to remove duplicates from only part of the string in Python?

Answers (3)

Related Questions