What is a promise in Javascript?

Question

Jorge Ponti

Asked: 2020-07-19 10:27:38 +0800 CST 2020-07-19 10:27:38 +0800 CST 2020-07-19 10:27:38 +0800 CST

Remove duplicate items in a list

772

I need to find the most efficient way to remove duplicates from a list in Python.

I am doing it this way:

for i in mj:
    if i not in mj2:
        mj2.append(i)

where kjis a list like [2, 4, 4, 4, 4, 4, 9, 9]and the output mj2is of the form:

   [2, 4, 9]

There is a more efficient way that doesn't include loops, since I have to parse large lists.

4 Answers

Voted

César · Answer 1 · 2020-07-19T10:31:35+08:00

Best Answer

César

2020-07-19T10:31:35+08:002020-07-19T10:31:35+08:00

The simplest is to use set():

>>> mj = [2, 4, 4, 4, 4, 4, 9, 9]
>>> mj2 = set(mj1)
>>> mj2
set([9, 2, 4])
>>> list(mj2)
[9, 2, 4]

If you want to keep the order (since the setsare an unordered list of elements), you can pass a sortat the end:

>>> sorted(list(mj2))
[2, 4, 9]

Another option, if your list is originally ordered and you want to maintain the order, you can use the class OrderedDictand leverage it to maintain this order:

>>> from collections import OrderedDict
>>> OrderedDict.fromkeys(mj)
OrderedDict([(2, None), (4, None), (9, None)])
>>> OrderedDict.fromkeys(mj).keys()
[2, 4, 9]

OrderedDictis an implementation of dictionaries that allows you to "remember" the order in which its elements have been inserted. Therefore, you can use the fromkeysdictionary method to use the elements of mjas the keys of the dictionary, since the elements of mjare pre-ordered so the order is preserved.

15

Blasito · Answer 2 · 2020-07-19T10:31:27+08:00

Blasito

2020-07-19T10:31:27+08:002020-07-19T10:31:27+08:00

You can test how the performance is with the following line of code:

mj2 = sorted(set(mj))

although using sorted able to consume some resource. If you don't have problems with the order you can use as follows:

mj2 = set(mj)

4

ChemaCortes · Answer 3 · 2020-07-19T14:18:32+08:00

ChemaCortes

2020-07-19T14:18:32+08:002020-07-19T14:18:32+08:00

If the original list is very large and ordered, it's much more efficient to use itetools.groupbywhich creates an iterator without creating new lists:

from itetools import groupby

mj2 = (k for (k,_) in groupby(m2))

It is possible to get the first elements without processing the whole list:

first = next(mj2)
second = next(mj2)
...

3

Javier Fiestas Botella · Answer 4 · 2022-09-19T13:59:25+08:00

Javier Fiestas Botella

2022-09-19T13:59:25+08:002022-09-19T13:59:25+08:00

For 'groups' you can do it like this too.

grupos= [['calculo', 1], ['calculo', 1], ['calculo', 1], ['Algebra', 5], ['Algebra', 5], ['Algebra', 5]]

grupos_sin_repetir = []
for i in grupos:
    if i not in grupos_sin_repetir:        
      grupos_sin_repetir.append(i)
    
print(grupos_sin_repetir)

1

Remove duplicate items in a list

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?