Sabiia Seb
PortuguêsEspañolEnglish
Embrapa
        Busca avançada

Botão Atualizar


Botão Atualizar

Registro completo
Provedor de dados:  Nature Precedings
País:  United Kingdom
Título:  Examining the uses of shared data
Autores:  Heather A. Piwowar
Douglas B. Fridsma
Data:  2007-07-11
Ano:  2007
Palavras-chave:  Bioinformatics
Resumo:  Background
 Many initiatives and repositories exist to encourage the sharing of research data, and thousands of microarray gene expression datasets are publicly available. Many studies reuse this data, but it is not well understood which datasets are reused and for what purpose.

 Materials and Methods
 We trained a machine-learning algorithm to automatically classify full-text gene expression microarray studies into two classes: those that generated original microarray data (n=900) and those which only reused data (n=250). We then compared the Medical Subject Heading (MeSH) terms of two classes to identify MeSH topics which were over- or under-represented by publications with reused data.

 Results
 Studies on humans, mice, chordata, and invertebrates were equally likely to be conducted using original or shared microarray data, whereas shared data was used in a relatively high proportion of studies involving fungi (odds ratio (OR)=2.4), and a relatively low proportion involving rats, bacteria, viruses, plants, or genetically-altered or inbred animals (OR<0.05). Unsurprisingly, when we looked at Major MeSH terms to represent the primary purpose of the studies, statistical and computational methods clearly dominated. The only biomedical topics with a relatively high proportion of data reuse Major MeSH terms were Promoter Regions, Evolution, and Protein Interaction Mapping.

 Discussion
 Identifying areas of particularly successful microarray data reuse—such as Saccharomyces cerevisiae datasets and studies of promoter regions and evolution—can highlight best practices to be used when developing research agendas, tools, standards, repositories, and communities in areas which have yet to receive major benefits from shared data.

Tipo:  Poster
Identificador:  http://precedings.nature.com/documents/425/version/1

oai:nature.com:10.1038/npre.2007.425.1

http://dx.doi.org/10.1038/npre.2007.425.1
Fonte:  Nature Precedings
Direitos:  Creative Commons Attribution 3.0 License
Fechar
 

Empresa Brasileira de Pesquisa Agropecuária - Embrapa
Todos os direitos reservados, conforme Lei n° 9.610
Política de Privacidade
Área restrita

Embrapa
Parque Estação Biológica - PqEB s/n°
Brasília, DF - Brasil - CEP 70770-901
Fone: (61) 3448-4433 - Fax: (61) 3448-4890 / 3448-4891 SAC: https://www.embrapa.br/fale-conosco

Valid HTML 4.01 Transitional