[Pandas(팬더스)] DataFrame중복 행 삭제 / 특정 열을 기준으로 중복 행 삭제

Notice

Recent Posts

Recent Comments

Tags more

Archives

관리 메뉴

활연개랑

Python

승해tmdhey 2021. 7. 15. 14:25

DataFrame에서 중복 행을 삭제하고싶은경우

DataFrame.drop_duplicates()를 사용한다.

DataFrame이 fifa인 경우,

import pandas as pd

fifa.drop_duplicates()

위와같이 코드를 사용했을 경우 중복된 행중 가장 처음으로 나온 행을 제외한 나머지 행들은 삭제된다.

특정 열을 기준으로 중복 행을 삭제하고싶은 경우 ,

subset을 사용해준다.

subset은 아래와 같이 사용할 수 있다.

import pandas as pd

fifa.drop_duplicates(subset=['특정 열 이름'])

이처럼 비교하고싶은 특정 열을 subset 매개변수 안에 써주고, 중복행을 삭제하는 drop_duplicates를 사용해주면

특정 열에서의 중복만을 비교할 수 있다.

[pandas(팬더스)] apply/ filter 차이점 (0)	2021.07.19
[pandas(팬더스)] DataFrame 합치기,병합 (concat, merge, join) 차이 (axis=0/ axis=1 차이) (0)	2021.07.15
[python(파이썬)] selenium 함수 (0)	2021.07.12
[python(파이썬)] bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library? 오류/에러 해결 (0)	2021.07.12
[python(파이썬)] warnings 라이브러리로 경고 메시지 무시하기 (warnings.filterwarnings('ignore')) (0)	2021.07.08

'Python' Related Articles