Sabiia Seb
PortuguêsEspañolEnglish
Embrapa
        Busca avançada

Botão Atualizar


Botão Atualizar

Ordenar por: 

RelevânciaAutorTítuloAnoImprime registros no formato resumido
Registros recuperados: 1
Primeira ... 1 ... Última
Imagem não selecionada

Imprime registro no formato completo
TEXT CATEGORIZATION USING ONLY FRAGMENTS OF DOCUMENTS AgEcon
Pilaszy, Istvan; Dobrowiecki, Tadeusz.
In this paper we presented a lot of experiments that examine how the particular parts of the documents do contribute to the performance of a classifier. We evaluated text classifiers on two very different text corpora. We conclude that some parts of the text are more important from the point of text classification performance. Giving higher weights to more important parts can increase the performance of the classifier. The question, that which parts are more or less important depends on the nature of the documents in the corpora. Some tasks that remains to be done: − More text corpora should be investigated. − In section 6.4 we optimized the number of features to be kept independent from the section. However, it could be optimized for each section. −...
Tipo: Journal Article Palavras-chave: Machine learning; Text categorization; Classifier ensembles; Research and Development/Tech Change/Emerging Technologies.
Ano: 2007 URL: http://purl.umn.edu/58927
Registros recuperados: 1
Primeira ... 1 ... Última
 

Empresa Brasileira de Pesquisa Agropecuária - Embrapa
Todos os direitos reservados, conforme Lei n° 9.610
Política de Privacidade
Área restrita

Embrapa
Parque Estação Biológica - PqEB s/n°
Brasília, DF - Brasil - CEP 70770-901
Fone: (61) 3448-4433 - Fax: (61) 3448-4890 / 3448-4891 SAC: https://www.embrapa.br/fale-conosco

Valid HTML 4.01 Transitional