April 14, 2008   Sign In |  About ebizQ |  Contact Us |  Join ebizQ Gold Club

ITGumbo: spicing IT up

IT Copywrite

Technology and application of technology.

ebizQ presents ITGumbo: a spicy blog network where vendors and IT professionals share ideas about creating Business Agility.

November 2007 Archives

Web Resource Classification for Search Engine Taxonomy

As more data on WWW is made available with semantic annotation of web resources the categorization of web resources based on classification with characteristics identifiable by normative metadata shall be the key to development of semantic web applications. The inclusion of normative metadata from standard ontology as web resource descriptors in POWDER DR or embedded in the web content with RDFa shall provide data for search engine indexing to build SE taxonomy.

  • Characteristics are properties of a 'thing'.
  • Classification is the identification of characteristics for categorization.
  • Categorization is grouping of resources with same characteristics.
  • Taxonomy is the process of classification.

The value of HTML language elements and attributes such as content attribute in META element with attribute name='keywords', heading elements, alt, name, title, hreflang and media, etc. are used by search engines to classify web content. While these attributes may still be used for web resource classification, the normative metadata shall extend the vocabulary that is used for web resource classification. The difference is that HTML element and attribute tag value and not tag name are used for web resource classification, the RDF/OWL class property (predicate) as well as values (object) shall be used for classification of semantic web resources (subject). It is important to note that HTML tag names do not mean name attribute value but tag names such as META, TH, A, H1-H6, etc.

» Continue reading Web Resource Classification for Search Engine Taxonomy.