-- remove rows without gene_id (duplicated entries) --- May 31st 2019 grep 'gene_id' /project/umw_nathan_lawson/Annotation/GRCz11/refseq_GRCz11/GRCz11refSeqUcsc.fixed.gtf >/project/umw_nathan_lawson/Annotation/GRCz11/refseq_GRCz11/GRCz11refSeqUcsc.fixed2.gtf grep -v 'gene_id' shows that those rows without gene_id has a duplicate row with gene_id, so removed those without gene_id