To remove duplicates in a dataframe, use distinct()
. Duplicates are rows that are replicated at least twice.
# With the pipe operator
df <- df %>%
distinct()
# Without the pipe operator
df <- distinct(df)
Documentation
Subset distinct/unique rows — distinct
Select only unique/distinct rows from a data frame. This is similarto unique.data.frame() but considerably faster.
