Sabiia Seb
PortuguêsEspañolEnglish
Embrapa
        Busca avançada

Botão Atualizar


Botão Atualizar

Registro completo
Provedor de dados:  Nature Precedings
País:  United Kingdom
Título:  GENCODE: Creating a Validated Manually Annotated Geneset for the Whole Human Genome
Autores:  A. Bignell
A. Frankish
B. Aken
M. Diekhans
F. Kokocinski
M. Lin
M. Tress
J. Van Baren
I. Barnes
T. Hunt
D. Carvalho-Silva
C. Davidson
S. Donaldson
J. Gilbert
E. Hart
M. Kay
R. Kinsella
D. Lloyd
J. Loveland
J. E. Mudge
C. Snow
J. Vamathevan
L. Wilming
M. Brent
M. Gerstein
R. Guigó
R. Harte
M. Kellis
S. Searle
J. Harrow
T. Hubbard
Data:  2009-04-23
Ano:  2009
Palavras-chave:  Genetics & Genomics
Bioinformatics
Resumo:  The Human and Vertebrate Analysis and Annotation (HAVANA) group at the Wellcome Trust Sanger Institute produced the manually annotated geneset for the Encyclopedia of DNA Elements (ENCODE) pilot project and, as part of the Gencode subgroup, are reprising this role in the scale up to cover the whole human genome. Our manual annotation is checked computationally and validated experimentally. Loci and transcripts predicted to be absent from the initial annotation are identified by comparison with a number of state-of-the-art algorithms for identifying exons, splice sites, transcripts and pseudogenes. Where novel features are confirmed the annotation is updated. Annotated coding transcripts are analysed to assess their coding potential by investigating patterns of conservation within the coding sequence (CDS) and comparing predicted secondary structures of annotated CDSs to similar proteins with solved structures. Annotated coding transcripts are also checked against the current set of human Consensus CDSs (CCDS) to check agreement with other participating centres (EBI, NCBI, & UCSC).

An initial round of annotation and analysis of chromosomes 21 and 22 has shown that while HAVANA annotation is both comprehensive and robust, it has benefitted from computational review. 13 novel non-coding loci, 27 novel splice variants and 6 extensions to existing variants were identified, many of which were found using supporting EST/mRNA sequences that were not present at the time of initial annotation. Fewer than 10 annotated CDSs required reclassification, no CCDS sequences required updating and 26 novel pseudogene were added. The annotation of human chromosome 2 is complete and we are currently annotating chromosomes 3 and 7. Data from all members of Gencode is distributed via DAS and is now visible in our Zmap annotation interface, allowing assessment of computational predictions contemporaneous with first-pass gene annotation.
Tipo:  Poster
Identificador:  http://precedings.nature.com/documents/3155/version/1

oai:nature.com:10.1038/npre.2009.3155.1

http://dx.doi.org/10.1038/npre.2009.3155.1
Fonte:  Nature Precedings
Direitos:  Creative Commons Attribution 3.0 License
Fechar
 

Empresa Brasileira de Pesquisa Agropecuária - Embrapa
Todos os direitos reservados, conforme Lei n° 9.610
Política de Privacidade
Área restrita

Embrapa
Parque Estação Biológica - PqEB s/n°
Brasília, DF - Brasil - CEP 70770-901
Fone: (61) 3448-4433 - Fax: (61) 3448-4890 / 3448-4891 SAC: https://www.embrapa.br/fale-conosco

Valid HTML 4.01 Transitional