How to remove duplicate rows of a Numpy array

Question 1

In a Numpy array, there are several duplicate rows. How can I remove those duplicate rows?

E.g.

array([[1, 2, 3],
[4, 5, 6],
[7, 8, 9],
[1, 2, 3],
[4, 5, 6]])

Question 2

You can use numpy .unique() to select only unique rows. Since you want unique rows, you need to use parameter axis=0 .

>>> import numpy as np
>>> data=np.array([[1,2,3],[4,5,6],[7,8,9],[1,2,3],[4,5,6]])
>>> data
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9],
       [1, 2, 3],
       [4, 5, 6]])
>>> np.unique(data, axis=0)
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

pkumar81 · Answer 1 · 2019-10-16T11:47:12+0000

You can use numpy .unique() to select only unique rows. Since you want unique rows, you need to use parameter axis=0 .

>>> import numpy as np
>>> data=np.array([[1,2,3],[4,5,6],[7,8,9],[1,2,3],[4,5,6]])
>>> data
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9],
       [1, 2, 3],
       [4, 5, 6]])
>>> np.unique(data, axis=0)
array([[1, 2, 3],
       [4, 5, 6],
       [7, 8, 9]])

How to remove duplicate rows of a Numpy array

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories