Naïve bayes web page classification with HTML mark-up enrichment

  • Víctor Fresno Fernández
  • , Raquel Martínez Unanue
  • , Soto Montalvo Herranz
  • , Arantza Casillas Rubio

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

11 Citations (Scopus)

Abstract

In text and web page classification, Bayesian prior probabilities are usually based on term frequencies, term counts within a page and among all the pages. However, new approaches in web page representation use HTML mark-up information to find the term relevance in a web page. This paper presents a Naive Bayes web page classification system for these approaches.

Original languageEnglish
Title of host publicationProceedings of the International Multi-Conference on Computing in the Global Information Technology, ICCGI'06
PublisherIEEE Computer Society
Pages48-53
Number of pages6
ISBN (Print)0769526292, 9780769526294
DOIs
Publication statusPublished - 2006
Externally publishedYes
EventInternational Multi-Conference on Computing in the Global Information Technology, ICCGI'06 - Bucharest, Romania
Duration: 1 Aug 20063 Aug 2006

Publication series

NameProceedings of the International Multi-Conference on Computing in the Global Information Technology, ICCGI'06

Conference

ConferenceInternational Multi-Conference on Computing in the Global Information Technology, ICCGI'06
Country/TerritoryRomania
CityBucharest
Period1/08/063/08/06

Fingerprint

Dive into the research topics of 'Naïve bayes web page classification with HTML mark-up enrichment'. Together they form a unique fingerprint.

Cite this