本次總結(jié)來(lái)源網(wǎng)絡(luò)肃续,有多處參考
在R語(yǔ)言中黍檩,去掉重復(fù)數(shù)據(jù)的函數(shù)是:duplicated
刪掉所有列中數(shù)據(jù)一樣的:
>test <- data.frame(
x1 = c(1,2,3,4,5,1,3,5),
x2 = c("a","b","c","d","e","a","b","e"),
x3 = c("a","b","c","d","e","a","c","e"))
> test
x1 x2 x3
1 1 a a
2 2 b b
3 3 c c
4 4 d d
5 5 e e
6 1 a a
7 3 b c
8 5 e e
> test[!duplicated(test),] #刪掉所有列上都重復(fù)的
x1 x2 x3
1 1 a a
2 2 b b
3 3 c c
4 4 d d
5 5 e e
7 3 b c
選擇性的刪除重復(fù)的
> test[!duplicated(test[,c(2,3)]),]
x1 x2 x3
1 1 a a
2 2 b b
3 3 c c
4 4 d d
5 5 e e
7 3 b c