Mercurial > repos > davidvanzessen > extract_duplicates
annotate extract_duplicates.r @ 1:a3c4e3e62e10 draft
Uploaded
| author | davidvanzessen |
|---|---|
| date | Tue, 01 Sep 2015 07:49:47 -0400 |
| parents | 02cf2dd19564 |
| children | 1f1640608245 |
| rev | line source |
|---|---|
| 0 | 1 args <- commandArgs(trailingOnly = TRUE) |
| 2 | |
| 3 input=args[1] | |
| 4 column=as.numeric(args[2]) | |
| 5 header=(args[3] == "yes") | |
| 6 out_file=args[4] | |
| 7 | |
| 1 | 8 dat = read.table(input, header=header, sep="\t", fill=T, stringsAsFactors=F, quote="") |
| 0 | 9 |
| 10 duplicates = dat[duplicated(dat[,column]),column] | |
| 11 | |
| 12 dat = dat[dat[,column] %in% duplicates,] | |
| 13 | |
| 14 dat = dat[order(dat[,column]),] | |
| 15 | |
| 16 write.table(dat, out_file, sep="\t", row.names=F, col.names=header, quote=F) |
