<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>IT Copywrite</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/" />
    <link rel="self" type="application/atom+xml" href="http://itgumbo.com/itcopywrite/atom.xml" />
    <id>tag:itgumbo.com,2008-01-07:/itcopywrite//26</id>
    <updated>2008-05-01T17:35:05Z</updated>
    <subtitle>Technology and application of technology.</subtitle>
    <generator uri="http://www.sixapart.com/movabletype/">Movable Type Publishing Platform 4.01</generator>

<entry>
    <title>Simple steps to add RDFa to web content</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/05/simple_steps_to_add_rdfa_to_we.php" />
    <id>tag:itgumbo.com,2008:/itcopywrite//26.1553</id>

    <published>2008-05-01T10:42:36Z</published>
    <updated>2008-05-01T17:35:05Z</updated>

    <summary><![CDATA[ In many web content publishing tools it is not possible to modify the contents of &lt;html&gt; and &lt;head&gt; element. Therefore it may not be possible to mention correct Document Type (DOCTYPE) and Content Type (@http-equiv) values. However it is...]]></summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Search Engine" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="google" label="Google" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="normativemetadata" label="normative metadata" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdf" label="RDF" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdfa" label="RDFa" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticweb" label="semantic web" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="yahoo" label="Yahoo!" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[ <div xmlns:dcterms="http://purl.org/dc/terms/" about="">

<p>In many web content publishing tools it is not possible to modify the contents of &lt;html&gt; and &lt;head&gt; element. Therefore it may not be possible to mention correct Document Type (DOCTYPE) and Content Type (@http-equiv) values. However it is still possible to add <acronym title="Resource Description Framework">RDF</acronym> normative metadata to your web content. Web content annotated with RDFa is scanned for RDF triples by major Search Engines such as Google and Yahoo!.</p>
</div>]]>
        <![CDATA[<div xmlns:dcterms="http://purl.org/dc/terms/" about="">

<p style="padding-left:10ex;padding-right:10ex;">"<em>Adding semantics to the web involves two things: allowing documents which have information in machine-readable forms, and allowing links to be created with relationship values</em>" 
(<a rel="dcterms:reference" title="The need for Semantics in the Web" href="http://www.w3.org/Talks/WWW94Tim/">TimBL, WWW1994</a>).</p>

<dl>
<dt style="font-weight:bold">Vocabulary Namespace - </dt>  <dd>Add namespace declaration <strong>xmlns:</strong> to &lt;div&gt;, &lt;p&gt;, &lt;a&gt; etc. elements that contain the annotated content.</dd>
<dt style="font-weight:bold">Subject - </dt>  <dd>Add <strong>about=""</strong> to the &lt;div&gt;, &lt;p&gt;, etc. elements that contain the annotated content. The @about value is the <span style="color:blue">subject</span> in the RDF triple and the value "" of @about suggests that the <span style="color:blue">subject</span> for enclosed annotated content is 'this' document and hence the value of <span style="color:blue">subject</span> <acronym title="Internationalized Resource Identifier">IRI</acronym> shall be IRI of 'this' document as displayed in the browser address bar.</dd>


<dt style="font-weight:bold">Predicate - </dt><dd>The value of @rel and @property is the <span style="color:blue">predicate</span> in a RDF triple.
Add RDF properties i.e. normative metadata from vocabularies such as <a rel="dcterms:reference" title="FOAF Vocabulary Specification" href="http://xmlns.com/foaf/spec/">FOAF</a>, <a rel="dcterms:reference" title="An Ontology for vCards" href="http://www.w3.org/2006/vcard/ns">vCard</a>, <a rel="dcterms:reference" title="DCMI Metadata terms" href="http://dublincore.org/documents/dcmi-terms/">Dublin Core</a> etc. in the following manner:</dd>
</dl>
<ul style="padding-left:10ex">
<li>If &lt;a&gt; element is present then add @rel to &lt;a&gt; element with value as property name from the vocabulary.</li>
<li>If &lt;a&gt; element is not present then enclose the data to be annotated in &lt;span&gt; element and include @property to &lt;span&gt; element with value as property name from the vocabulary.</li>
</ul>

<dl>
<dt  style="font-weight:bold">Object - </dt><dd>The value of @href in &lt;a&gt; element and the literal value enclosed within &lt;span&gt; elements are the <span style="color:blue">object</span> values. In brief, @rel must be used if <span style="color:blue">object</span> is a web resource with valid <acronym title="Internationalized Resource Identifier">IRI</acronym> and @property  is used when <span style="color:blue">object</span> is a plain literal text. @content is used when the plain literal text value of <span style="color:blue">object</span> rendered by the user agent such as browser is different from the machine processable value.</dd>

</dl>

<table border="1" bordercolor="navy" style="border-collapse:collapse;background-color:lightcyan;font-size:8pt;" width="100%">
<caption style="font-weight:bold">Example: Semantic Annotation of web content</caption>
<tr style="background-color:yellow;font-weight:bold;"><th>Plain Content</th><th>Semantic Annotation with RDFa</th></tr>
<tr><td>
&lt;body&gt;<br />
&lt;h3&gt;Title of this post&lt;/h3&gt;<br />
OR<br />
&lt;h3&gt;&lt;a href="IRI of this blog post"&gt;Title of this post&lt;/a&gt;&lt;/h3&gt;<br /><br />
&lt;p&gt;Content of this post&lt;/p&gt;<br /><br />
&lt;a href="home page IRI">Ila Nivas&lt;/a&gt;<br />
OR<br />
Ila Nivas<br /><br />
Date: April 15, 2008<br />
&lt;/body&gt;<br />
</td>
<td>
&lt;div xmlns:dc="http://purl.org/dc/elements/1.1" about=""&gt;<br />
&lt;h3 property="dc:title"&gt;Title of this post&lt;/h3&gt;<br />
OR<br />
&lt;h3 property="dc:title"&gt;&lt;a href="IRI of this blog post"&gt;Title of this post&lt;/a&gt;&lt;/h3&gt;<br />
OR<br />
&lt;h3&gt;&lt;a href="IRI of this blog post" property="dc:title"&gt;Title of this post&lt;/a&gt;&lt;/h3&gt;<br /><br />
&lt;p&gt;Content of this post&lt;/p&gt;<br /><br />
&lt;a rel="dc:creator" href="home page IRI">Ila Nivas&lt;/a&gt;<br />
OR<br />
&lt;span property="dc:creator"&gt;Ila Nivas&lt;/span&gt;<br /><br />
Date: &lt;span property="dc:date" content="20080415T1600-0500"
            datatype="xsd:datetime"&gt;April 15, 2008&lt;/span&gt;<br />
&lt;/div&gt;<br />
</td></tr>
<tr>
<td>No RDF triples are generated without annotation.</td>
<td>
RDF triples: <br /><br />
&lt;IRI of this blog post&gt; dc:title "Title of this post" .<br />
&lt;IRI of this blog post&gt; dc:creator "Ila Nivas" .<br />
&lt;IRI of this blog post&gt; dc:date "20080415T1600-0500"^^xsd:datetime .<br />
</td>
</tr>
</table>

<h4>View RDFa on this web page</h4>
<ul>
<li>Drag the 'RDFa Highlight' bookmarklet from <a title="RDFa Bookmarklets" href="http://www.w3.org/2006/07/SWD/RDFa/impl/js/">here</a> to the bookmarks bar.</li>
<li>Open this blog post in the browser.</li>
<li>Now click the 'RDFa Highlight' bookmark, the embedded RDFa is highlighted with <span style="color:red">red</span> border rectangle around the <span style="color:blue">object</span>.</li>
<li>The RDF triple can be viewed if you hover mouse over the rectangle.</li>
</ul>

<h4>Validate RDFa</h4>
<p>You can test the compliance of your web page RDFa <a title="W3C Markup Validator" href="http://validator.w3.org/">here</a>. If it is <a title="W3C RDFa syntax" rel="dcterms:reference" href="http://www.w3.org/TR/rdfa-syntax/">valid RDFa</a> you can display the W3C XHTML+RDFa seal <img
        src="http://www.w3.org/Icons/valid-xhtml-rdfa"
        alt="Valid XHTML + RDFa"   />on your web page. For compliance with W3C standards the following are correct values:</p>
<dl>
<dt>DocType -</dt>
<dd>
&lt;!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
    "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"&gt;
</dd>
<dt>Content-Type - </dt>
<dd>
application/xhtml+xml is the correct value to allow processing of RDFa attributes by XHTML user applications. 
</dd>
</dl>

<p><strong>Conclusion:</strong> XML contortion is not required to add semantic annotation to web content. RDFa has well-defined syntax, only correct syntax must be used to structure the data with RDFa. In this <a rel="dcterms:reference" title="Consumer requirement for open market" href="http://itgumbo.com/itcopywrite/2008/01/consumer_requirement_for_open.php">example</a> it is illustrated how RDFa can be used to annotate credit card data. Similarly RDFa can be used to add semantic annotation to other product descriptions. Many time-saving tools can be built with semantic annotation of web content.</p>



</div>
]]>
    </content>
</entry>

<entry>
    <title>Search strategies to explore the invisible web</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/04/search_strategies_to_explore_t.php" />
    <id>tag:itgumbo.com,2008:/itcopywrite//26.1552</id>

    <published>2008-04-13T08:22:52Z</published>
    <updated>2008-05-01T17:30:56Z</updated>

    <summary> Google has announced a new search strategy to explore the invisible web. Invisible web comprises web documents that have not been indexed yet. Google&apos;s new search strategy is to retrieve invisible information from the web sites that use HTML...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Search Engine" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="ajax" label="AJAX" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="atom" label="Atom" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="dublincore" label="Dublin Core" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="google" label="Google" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="grddl" label="GRDDL" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="html" label="HTML" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="http" label="HTTP" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="normativemetadata" label="normative metadata" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdf" label="RDF" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdfa" label="RDFa" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="robots" label="robots" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rss10" label="RSS 1.0" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rss20" label="RSS 2.0" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rssmicro" label="RSSMicro" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticweb" label="semantic web" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="sioc" label="SIOC" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="web20" label="web 2.0" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="web30" label="web 3.0" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="xml" label="XML" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="yahoo" label="Yahoo!" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<div xmlns:dcterms="http://purl.org/dc/terms/" about="">

<p>Google has announced a <a title="Crawling through HTML forms" href="http://googlewebmastercentral.blogspot.com/2008/04/crawling-through-html-forms.html">new search strategy</a> to explore the invisible web. Invisible web comprises web documents that have not been indexed yet. Google's new search strategy is to retrieve invisible information from the web sites that use <acronym title="HyperText Markup Language">HTML</acronym> forms by providing input data to HTML input controls. As mentioned in the post that announces this search strategy they shall honor the privacy policy of a web site by processing <a rel="dcterms:reference" title="HTML specification" href="http://www.w3.org/TR/html401/appendix/notes.html#h-B.4.1">robots.txt file and META tag 'ROBOTS'</a> and avoid <acronym title="HyperText Transfer Protocol">HTTP</acronym> requests that require user information processing. </p>

</div>]]>
        <![CDATA[<div xmlns:dcterms="http://purl.org/dc/terms/" about="">

<p>Another Search Engine that explores the invisible web is <a href="http://www.rssmicro.com/aboutus.web">RSSMicro Search Engine</a>, this Search Engine crawls the XML feeds and collects syndication data.</p>
<p style="padding-left:10ex;padding-right:10ex">"<em>In other words, a new web is evolving which has its roots in XML based feeds rather than HTML pages. Some may refer to Web 2.0 or 3.0 but nevertheless the data is mainly comprised of user generated XML feeds. Unknowingly, we might be building a new web which is a duplicate of the original in terms of the content but is being transformed into XML</em>" <a rel="dcterms:reference" href="http://www.rssmicro.com/aboutus.web">RSSMicro</a>.</p>

<p><acronym title="Extensible Markup Language">XML</acronym> feeds provide good amount of semantically annotated data, the normative metadata used is specified in syndication specifications such as <acronym title="RDF Site Summary">RSS 1.0</acronym>, <acronym title="Really Simple Syndication">RSS 2.0</acronym> and Atom. Due to this normative metadata it is possible to collect keywords for intelligent search, e.g. 'category' element in RSS 2.0 and Atom specification provides the keywords used in the post. As mentioned on RSSMicro web site they collect "millions of keywords" and apply clustering algorithms to answer user queries. This normative metadata that is included in the RSS feed is not present in the web page of the news item published in the RSS feed. The difference between the two can be observed by viewing the source code of the <a title="ITGumbo RSS feed" href="http://itgumbo.com/rss.xml">RSS feed file</a> and the source code of any <a title="ITGumbo blog" href="http://itgumbo.com/itcopywrite/2008/03/linked_data_for_search_engine.php">post published on this feed</a>.  The 'category' element present in the XML feed is not present in the blog web page, instead the keywords are included as anchor text for HTML 'a' element with @rel=tag.</p>
 
<p>XML is the basis of all semantic technology programming languages such as <acronym title="Resource Description Framework">RDF</acronym>, RDFa and <acronym title="Web Ontology Language">OWL</acronym>. With RDFa and microformats it is possible to annotate the web pages so that normative metadata can be found on the HTML web pages also. It is possible to avoid the duplication of information by using same normative metadata in web syndication and web content publishing tools. This issue is addressed in <a href="http://web.resource.org/rss/1.0/">RSS 1.0 syndication specifications</a> to some extent, by inclusion of <a rel="dcterms:reference" href="http://web.resource.org/rss/1.0/modules/dc/">Dublin Core metadata in RSS 1.0</a> RDF description of items.  The keywords can be annotated with 'dc:subject' property in both RSS 1.0 XML feed and the HTML web page. Since this duplication is being performed by an application that is generating the feed there is no loss of effort or information. Most syndication include only first few lines of the blog post and more data with semantic annotation may be embedded in the content. Example: use of vocabularies vCard, iCal, <acronym title="Friend-of-a-Friend">FOAF</acronym>, etc. inside the content.</p>

<p>Most blog tools also include links to related, popular or previous posts, categories, archives and tag clouds on the blog web page. No HTML form is to be filled to access this data. A Search Engine such as RSSMicro that reads XML files to collect data can collect some amount of data from the one time syndication. More data shall be collected by the Search Engine that crawls all the @href and @src links on the web page, this shall cover the tag cloud, archives and categories. At this moment in web 2.0 the XML file crawlers have advantage because of normative metadata made available by syndication protocols such as <a rel="dcterms:reference" href="http://web.resource.org/rss/1.0/">RSS 1.0</a>, <a rel="dcterms:reference" href="http://cyber.law.harvard.edu/rss/rss.html">RSS 2.0</a> and <a rel="dcterms:reference" href="http://tools.ietf.org/html/rfc4287">Atom</a>.</p>

<p><strong>Conclusion:</strong> <a rel="dcterms:reference" href="http://www.ysearchblog.com/archives/000527.html">Indexing of semantic data by Yahoo!</a>, <a rel="dcterms:reference" href="http://googlewebmastercentral.blogspot.com/2008/04/crawling-through-html-forms.html">HTML forms by Google</a> and <a rel="dcterms:reference" href="http://www.rssmicro.com/aboutus.web">XML data by RSSMicro</a> are steps into invisible web. Invisible web now encompasses not only documents that have not been indexed but also information (i.e. data+context) that has not been found. Information present in the web content published with <acronym title="Asynchronous Javascript and XML">AJAX</acronym> tools has data that can be explored with a combination of search strategies. RDF/XML corresponding to a web page derived with <acronym title="Gleaning Resource Descriptions from Dialects of Languages">GRDDL</acronym> transformation and stored on the web server with <acronym title="Paltform for Privacy Preferences">P3P</acronym> controls shall also solve the data accessibility issues with AJAX applications.
</p>

<u>Update</u>
<p>To view RDFa embedded in this page:</p>
<ul>
<li>Drag the 'RDFa Highlight' bookmarklet from <a href="http://www.w3.org/2006/07/SWD/RDFa/impl/js/">here</a> to the bookmarks bar.</li>
<li>Open this blog post in the browser.</li>
<li>Now click the 'RDFa Highlight' bookmark, the embedded RDFa is highlighted with <font color="red">red</font> border rectangle around the <font color="blue">object</font>.</li>
<li>The RDF triple can be viewed if you hover mouse over the rectangle.</li>
</ul>

</div>

]]>
    </content>
</entry>

<entry>
    <title>Linked Data for Search Engine Optimization</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/03/linked_data_for_search_engine.php" />
    <id>tag:itgumbo.com,2008:/itcopywrite//26.1543</id>

    <published>2008-03-30T13:01:43Z</published>
    <updated>2008-04-13T10:07:36Z</updated>

    <summary> Search Engine Optimization (SEO) is a technique applied to increase the rank of a page in Search Engine Result Page (SERP). SERP is the list of URIs found for the user query and rendered in descending PageRank order on...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Search Engine" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="commonvocabulary" label="common vocabulary" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="normativemetadata" label="normative metadata" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="owl" label="OWL" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdf" label="RDF" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="rdfa" label="RDFa" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="semanticweb" label="semantic web" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="sparql" label="SPARQL" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[ <p>Search Engine Optimization (SEO) is a technique applied to increase the rank of a page in Search Engine Result Page (SERP). SERP is the list of <acronym title="Uniform Resource Identifier">URIs</acronym> found for the user query and rendered in descending <a title="Google technology" href="http://www.google.com/technology/">PageRank</a> order on a single web page. Since most web users shall traverse the URIs on the first SERP or may be next few, most web content providers would want that their web page URI is included in the first few SERPs.</p>
]]>
        <![CDATA[<p>The web page URI relative weightage that determines the URI position in <acronym title="Search Engine Result Page">SERP</acronym> is called PageRank. One Search Engine may have a different PageRank algorithm than the other. Most Search Engines use relevance of  keywords mentioned in <em>link</em> element, values of @name, @title and @id in <acronym title="HyperText Markup Language">HTML</acronym> elements, number of inlinks and outlinks, PageRank of the inlink & outlink web pages, etc to determine the value of PageRank.</p>

<p>Therefore for a high PageRank it is important to add popular and relevant outlinks to a web page. Outlinks are URIs of other web pages that have been embedded with <em>a</em> element and @href in <font color="blue">this</font> web page. Inlinks are the URIs of the web page that have included the URI of <font color="blue">this</font> web page in their content. Some web authors also use <em>cite</em> element to refer to a web resource. Both inlinks and citations are counted by the Search Engine.</p>

<p>Inserting outlinks is one of the primary steps in creation of <a title="Linked Data explained by Tim Berners-Lee" href="http://www.w3.org/DesignIssues/LinkedData.html">Linked Data</a> because it identifies the 'things' that are connected with <font color="blue">this</font> web page. However repetitive insertion of the same outlink in a web page may not enhance the PageRank, because an intelligent Search Engine may count unique URIs only. Sometimes it is difficult to read a web document with too many outlinks to comprehend every reference, in such cases inserting a '<a title="Short phrase of information that shall pop-up when mouse hovers close to a link. " href="http://www.usability.gov/pdfs/chapter7.pdf">gloss</a>' is a better idea to enhance readability. To add a gloss add @title in HTML <em>a</em> element with the text to be displayed. In web 3.0 repetition of outlinks may be avoided with <a title="@about description in RDFa specification" href="http://www.w3.org/2006/07/SWD/RDFa/syntax/#id103742">@about</a> and <a title="Blank Node description in RDFa specification" href="http://www.w3.org/2006/07/SWD/RDFa/syntax/#id104592">blank node</a>.</p>

If page 'A' refers to page 'B' then the outlink is embedded in page 'A' as follows:<br />
<table border="1" bordercolor="navy" style="border-collapse:collapse;" width="70%">
<tr style="background-color:yellow;color:navy;"><th>web 2.0</th><th>web 3.0</th></tr>
<tr><td>&lt;a href="page_B_uri"&gt;page_B_anchor_text&lt;/a&gt;</td>
<td>&lt;a rel="dcterms:references href="page_B_uri"&gt;page_B_anchor_text&lt;/a&gt;</td>
</tr>
</table>

<p>The difference in the two representations is the addition of <font color="blue">property</font> dcterms:references in the latter. The syntax for this semantic annotation is given in <a title="RDFa syntax" href="http://www.w3.org/2006/07/SWD/RDFa/syntax/">RDFa specifications</a>. This property identifies the relationship between 'thing' 'A' and 'thing' 'B', which in this case is 'A' references 'B'. It is not necessary that all outlinks are references, the relationship may be identified by any other metadata term.</p>

<p>E.g. consider this <acronym title="Resource Description Framework">RDF</acronym> description of 'albert' copied from <a title="Linked Data explained by Tim Berners-Lee" href="http://www.w3.org/DesignIssues/LinkedData.html">Linked Data</a> architecture note:</p>
<pre style="padding-left:5px;color:green;font-size:9pt;">
&lt;rdf:Description about="#albert"&gt;
  &lt;fam:child rdf:Resource="#brian"&gt;
  &lt;fam:child rdf:Resource="#carol"&gt;
&lt;/rdf:Description&gt;
</pre>

These RDF triples may be derived from the following web 3.0 web content, the equivalent web 2.0 content is also given here <br />
<table border="1" bordercolor="navy" style="border-collapse:collapse;" width="80%">
<tr style="background-color:yellow;color:navy;"><th>web 2.0</th><th>web 3.0</th></tr>
<tr>
<td>&lt;p&gt;The name of albert's children is &lt;a href="#brian">brian&lt;/a&gt; and &lt;a href="#carol">carol&lt;/a&gt;.</td>
<td>&lt;p 
about="#albert"&gt;The name of albert's children is &lt;a rel="fam:child" href="#brian"&gt;brian&lt;/a&gt; and &lt;a rel="fam:child" href="#carol"&gt;carol&lt;/a&gt;.&lt;/p&gt;</td>
</tr>
<tr>
<td><p>The name of albert's children is <a href="#brian">brian</a> and <a href="#carol">carol</a>.</p></td>
<td><p about="#albert">The name of albert's children is <a rel="fam:child" href="#brian">brian</a> and <a rel="fam:child" href="#carol">carol</a>.</p>
</td>
</tr>
</table>

Here the <font color="blue">property</font> fam:child is the relationship between albert and brian. <br />

<p>Some web 2.0 Search Engines are powerful to scan the complete document for any word or phrase in the text. These Search Engines may infer the relationship between Albert and Brian by reading the text preceding/succeeding the <em>a</em> element. The advantage of adding RDFa annotation in web 3.0 is that it highlights relationship information that is extractable into RDF triple form.</p>
&lt;#albert&gt; fam:child &lt;#brian&gt; .<br />
&lt;#albert&gt; fam:child &lt;#brian&gt; .<br />
<p>This RDF triple is stored in a database that is usable by many other web applications.<strong>Search is not the only application on the <acronym title="World Wide Web">WWW</acronym>, but is the basis of many other practical uses of web data</strong>. Search for Albert's children may return only the names of Brian and Carol in web 2.0, but in web 3.0 may also give additional information about Brian and Carol such as school name, home address, etc. The additional information about Brian and Carol may or may not be present on the same web page, but with Linked Data in web 3.0 this information can be fetched from <font color="blue">objects</font> Brian URI and Carol URI.</p>

<p>RDFa provides a syntax that can be used to publish information i.e. (data + context) so that machine extraction and processing of relevant information is possible and many different inferences can be derived from this data. Niche Search Engines may use the value of @rel to render the relationship in audio. A marketing agent may find the names of Albert's children in a particular locality and may approach their guardians with special discounts on uniforms, teaching aids, new books, etc. Guardian of a new admission to school may find other students in his or her locality. The WWW is considered as a media for publishing information and RDFa/RDF+OWL are the technologies to represent this information in a structured manner so that it can be used in different contexts. </p>

<p>In recent past new web applications have been launched that either provide the metadata for semantic annotation or perform the annotation for specified vocabularies. <a title="Calais web site" href="http://opencalais.com/">Calais</a> and <a title="TextWise web site" href="http://www.textwise.com/">TextWise</a> are popular applications and hand annotation is another method.</p>

<h4>Open Issues:</h4>
<ul style="list-style:square outside">
<li>A <a title="Growth of common vocabulary" href="http://itgumbo.com/itcopywrite/2007/04/growth_of_common_vocabulary_pa_1.php">common vocabulary</a> or a vocabulary repository is required that is one single place to find relevant normative metadata for semantic annotation. </li>
<li>A procedure is required to allow any web user to suggest an addition of a new metadata term in a vocabulary. Attribution of new metadata term or vocabulary may not be necessary.</li>
<li>A procedure is required to eliminate redundant normative metadata as new vocabularies are defined and new metadata terms are introduced.</li>
</ul>

<p><b>Conclusion</b>: In web 3.0 Search Engines that focus on natural language processing and intelligent search may reconsider PageRank algorithms. Search Engines shall use SPARQL filters and solution sequence modifiers to <a title="Intelligent search example" href="http://itgumbo.com/itcopywrite/2007/05/intelligent_search_example.php">select and arrange information</a>. Since Linked Data enhances authenticity of data therefore semantic annotation shall be a contributing factor in PageRank determination. Hence semantic annotation is a Search Engine Optimization technique. Linked Data shall be used to generate data mash-ups for many different <a title="Analysis of consumer sentiments" href="http://itgumbo.com/itcopywrite/2008/01/analysis_of_consumer_sentiment.php">contexts</a>.</p>


<div xmlns:dcterms="http://purl.org/dc/terms" about="">
References:<br />
<a rel="dcterms:references" title="Web Ontology Language" href="http://www.w3.org/2004/OWL/">OWL</a>, <a rel="dcterms:references" title="RDF specifications" href="http://www.w3.org/RDF/">RDF</a>, <a rel="dcterms:references" title="RDFa syntax specification" href="http://www.w3.org/2006/07/SWD/RDFa/syntax/">RDFa</a>, <a rel="dcterms:references" title="SPARQL standard" href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a><br />
</div>

]]>
    </content>
</entry>

<entry>
    <title>FOAF a factor for strong authentication</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/03/foaf_a_factor_for_strong_authe.php" />
    <id>tag:itgumbo.com,2008:/itcopywrite//26.1520</id>

    <published>2008-03-06T10:04:37Z</published>
    <updated>2008-03-06T10:48:28Z</updated>

    <summary>A FOAF file is a mechanism to publish one&apos;s social and professional profile. It is also used to publicize one&apos;s social and professional network, the people one knows besides work duties and the people one collaborates with on project or...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Security" scheme="http://www.sixapart.com/ns/types#category" />
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    <category term="foaf" label="FOAF" scheme="http://www.sixapart.com/ns/types#tag" />
    <category term="strongauthentication" label="strong authentication" scheme="http://www.sixapart.com/ns/types#tag" />
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>A <acronym title="Friend-of-a-Friend">FOAF</acronym> file is a mechanism to publish one's social and professional profile. It is also used to publicize one's social and professional network, the people one knows besides work duties and the people one collaborates with on project or in an organization. FOAF vocabulary properties foaf:knows and foaf:person are used to document these relationships on the web.</p>

<p style="padding-left:15ex;padding-right:15ex">&quot;<em>The foaf:knows property relates a foaf:Person to another foaf:Person that he or she knows</em>&quot; <a href="#foaf_a_fact_ref1">[1]</a>.</p>

<p>If one suggests in his/her FOAF file that he/she knows this person (linked with foaf:knows property), the other person is under no obligation to accept that he/she is socially or professionally associated with the FOAF file owner (value of property foaf:name in FOAF file).</p> 

<p style="padding-left:15ex;padding-right:15ex">&quot;<em>If someone foaf:knows a person, it would be usual for the relation to be reciprocated. However this doesn't mean that there is any obligation for either party to publish FOAF describing this relationship</em>&quot; <a href="#foaf_a_fact_ref1">[1]</a>.</p>

<h4 title="Advantages & risks of FOAF">Advantages & risks of FOAF file</h4>

<p style="padding-left:15ex;padding-right:15ex">&quot;<em>You probably know hundreds of people, yet might only list a few in your public FOAF file. That's OK. Or you might list them all. It is perfectly fine to have a FOAF file and not list anyone else in it at all</em>&quot; <a href="#foaf_a_fact_ref1">[1]</a>.</p>

<p>Is it okay to create a long list of acquaintances in the FOAF file by adding foaf:knows? It may be okay to do so until no other web application is developed that may use foaf:knows links for purposes other than being informed about one's social & professional standing. As soon as FOAF becomes popular amongst masses it may be used to establish credibility of a person, to establish trust and to authenticate a person. The name FOAF is expanded to "Friend-of-a-Friend", FOAF specification suggests:</p>

<p style="padding-left:15ex;padding-right:15ex">&quot;<em>The name was chosen to reflect our concern with social networks and the Web, urban myths, trust and connections</em>&quot; <a href="#foaf_a_fact_ref1">[1]</a>.</p>

<p>A hyperlink to one's FOAF file may be embedded by anybody in the web content in order to add a reference to the person. The properties such as foaf:Project and foaf:Organization may be used to find a list of collaborators for a project or an organization. It is not necessary to include these people in the FOAF file with foaf:knows property, a SPARQL query on foaf:Project may be used to find people working on the same project. Many tools can be built to take advantage of this personal information in order to <strong>build and find trustworthy connections</strong>.</p>

<h4 title="Strong authentication with FOAF">Strong authentication with FOAF</h4> 

<p>If two people trust each other they can project this information in their respective FOAF file by including each other's FOAF file IRI with foaf:knows property. This <u>reciprocation is a way of establishing trust</u> in each other and this trustworthy connection may be used for authentication. It may be used for responsibility delegation, endorsement and collaboration, etc.</p>

<p>A digital certificate that is used for <a href="http://en.wikipedia.org/wiki/Strong_authentication">strong authentication</a> constitutes more than one factor. FOAF Identity (i.e. FOAF file IRI) may be used <u>as one of the factors</u> in the construction of a digital certificate. If 'A' and 'B' trust each other and agree to endorse each other's communications on a certain matter they may get a digital certificate with 'A' and 'B' FOAF Identify as one of the factor for strong authentication. The certification authority shall validate the authenticity of the suggested relationship between the two with the presence of reciprocated FOAF Id with foaf:knows property in 'A' and 'B' FOAF files respectively. The suggested relationship shall be verified by the certificate authority every time it receives a request to verify and decrypt the message encrypted with the issued private key. This verification shall ensure that the suggested trust relationship between 'A' and 'B' is not broken, i.e. digital certificate is valid only until this suggested trust relationship is present in both FOAF files. Example: If either 'A' or 'B' removes foaf:knows entry for the other then the 'knows' relationship is broken; if either 'A' or 'B' changes foaf:Project value then they are not working on the same project anymore.</p>

<p>The relationship may be suggested by specifying other FOAF properties as well, such as: foaf:Group, foaf:Project, foaf:member, etc. The relationship may also be rendered by the user agents such as browsers and e-mail clients in the same manner as <a href="http://www.verisign.com">VeriSign</a> <a href="http://www.verisign.com/ssl/ssl-information-center/faq/extended-validation-ssl-certificates.html">EV SSL</a> shows certification authority in the green highlighted browser address bar. In e-mail client 'From' bar the relationships may be shown in a drop-down menu.</p>

<p><strong>Conclusion:</strong> FOAF data is public information it may be used for strong authentication. If FOAF data is included in the construction of a digital certificate, other private factors must also be included for autentication of FOAF data. The other possible factors that may be used in the construction of digital certificate are: passport number, driving license number, etc. FOAF data may add authenticity to the information by virtue of relationship between the information originator and the foaf:knows people or other FOAF properties.</p>

References:<br />
<a name="foaf_a_fact_ref1">[1]</a> <cite>Miller, Libby. & Brickley, Dan. (2007). <a href="http://xmlns.com/foaf/spec/">FOAF Vocabulary Specification 0.91</a>.</cite>



]]>
        
    </content>
</entry>

<entry>
    <title>Is Calais an alternative to RDFa annotation?</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/02/is_calais_an_alternative_to_rd.php" />
    <id>tag:itgumbo.com,2008:/itcopywrite//26.1497</id>

    <published>2008-02-08T08:47:36Z</published>
    <updated>2008-02-08T09:07:30Z</updated>

    <summary>Reuters have launched a web service for annotation of content on web, they call it Calais. We want to make all the world&apos;s content more accessible, interoperable and valuable. Some call it Web 2.0, Web 3.0, the semantic web or...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Reuters have launched a web service for annotation of content on web, they call it <a href="http://opencalais.com/">Calais</a>.</p>

<p style="padding-left:10ex;padding-right:10ex;"><em>We want to make all the world's content more accessible, interoperable and valuable. Some call it Web 2.0, Web 3.0, the semantic web or the Giant Global Graph - we call our piece of it Calais</em> <a href="#one">[1]</a>.</p>

<h4 title="What Calais does">What Calais web service does?</h4>
<p>This web service will accept the text content and scan it to find data for semantic annotation. The service shall find the appropriate metadata for semantic annotation. There is a provision that the content provider can suggest a vocabulary to be used for semantic annotation. The service shall store the RDF triples generated from the semantic annotation in a central repository and provide a Globally Unique Identifier (GUID) to the content provider. The service shall also provide these RDF triples to the content provider.</p>

<h4 title="What Calais user must do">What should Calais web service user do?</h4>
<p>The content provider who will use Calais web service must proivde the returned Calais <acronym title="Globally Unique Identifier">GUID</acronym> to those who need RDF triples corresponding to the published content. Any web application that needs RDF triples for the published content will find these triples in Calais central repository by providing the GUID.</p>
<p style="padding-left:10ex;padding-right:10ex;"><em>Using the Calais GUID, any downstream consumer is able to retrieve this metadata via a simple call to Calais</em> <a href="#one">[1]</a>.</p>
<p>The content provider may also use RDFa to include the metadata terms from the RDF triples returned by the Calais web service for semantic annotation of the content.</p>

<h4 title="What are the advantages of Calais">What are the advantages of Calais web service?</h4>
<p>The Calais web service has following advantages:</p>
<ol>
<li>It shall find appropriate metadata for semantic annotation of the input content. The content provider does not have to search for appropriate vocabularies and metadata for semantic annotation of the content.</li>
<li>It shall generate and store RDF triples for the content in a central repository. The content provider does not need to include GRDDL transformations for RDF generation.</li>
</ol>

<p><strong>Conclusion:</strong> The Calais web service provides RDF triples for annotable data in the input content. The Calais web service does not use RDFa for semantic annotation of the content. The web content provider must hand annotate the web content by using RDFa and metadata from RDF triples returned by the Calais web service. An unanswered question is whether Calais web service will use normative metadata from a standard ontology. Use of RDFa and normative metadata are key to building semantic web/<a href="http://dig.csail.mit.edu/breadcrumbs/node/215">Giant Global Graph</a>. The use of <acronym title="Globally Unique Identifier">GUID</acronym> may not be considered as a replacement to GRDDL transformation; the former will require that every time the content is updated the RDF triples stored in Calais repository must be updated and if GUID is changed then all users of this GUID must be notified. By using GRDDL transformation instead of GUID the users of RDF triples need not be notified about any changes to the content. The web server cache may store the latest updates to RDF triples.</p>

References: <br />
[<a name="one">1</a>] <cite>Overview. 2008. Calais. &lt;http://opencalais.mashery.com/Overview&gt; </cite><br />
]]>
        
    </content>
</entry>

<entry>
    <title>Analysis of consumer sentiments for context data</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/01/analysis_of_consumer_sentiment.php" />
    <id>tag:itgumbo.ebizq.net,2008:/itcopywrite//26.1472</id>

    <published>2008-01-11T18:12:23Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>It is noticed that weblogs more commonly known as &apos;blogs&apos; are not only used to advertise about a product or service features but also to share opinion, give feedback and to review a product or service. Very often blogs are...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>It is noticed that weblogs more commonly known as 'blogs' are not only used to advertise about a product or service features but also to share opinion, give feedback and to review a product or service. Very often blogs are also used to compare products/services or to give an opinion on an opinion. This chain of comments and reviews not only creates an online social network of web users interested in a product/service but also contains a wealth of context data. Consumer sentiments and context data are the two crucial information elements that are used for business advancement.</p>]]>
        <![CDATA[<h4 title="consumer sentiments">Consumer Sentiments</h4>

<h5>New Ontology</h5> 
<p>The new properties that may be required for expressing consumer sentiments in order to collect context data are shown in sample RDF triples:</p>
<pre style="padding-left:5px;color:navy;font-size:9pt;">
&lt;product IRI&gt; std:affect "liked/disliked/good/bad/recommend/reject"
&lt;product IRI&gt; std:review "short comment"
</pre>
<h5>Example</h5>
<table border="1" bordercolor="navy" style="border-collapse:collapse;" width="80%">
<tr style="background-color:yellow;color:navy;"><th>XHTML+RDFa</th><th>User Agent</th></tr>
<tr>
<td>
xmlns:std="http://www.somestandards.org/<br />
xmlns:dc="http://purl.org/dc/elements/1.1"<br />
<br />
&lt;p about="pearl or pearl store IRI"&gt;... You can find an exclusive &lt;span property="std:size"&gt;small&lt;/span&gt; size &lt;span property="std:color"&gt;natural color&lt;/span&gt; pearl necklace from this store, this shall be a &lt;span property="std:affect"&gt;perfect&lt;/span&gt; match with this &lt;a rel="dc:relation" href="hand bag IRI"&gt;hand bag&lt;/a&gt;. ...&lt;/p&gt;<br />
</td>
<td>
<p about="pearl or pearl store IRI">... You can find an exclusive <span property="std:size">small</span> size <span property="std:color">natural color</span> pearl necklace from this store, this shall be a <span property="std:affect">perfect</span> match with this <font color="blue"><u>hand bag</u></font>. ...
</p>
</td>
</tr>
</table>
<p>The blogger may use other terms such as recommend, suggest, etc. The <font color="blue">object</font> value for property <b>std:affect</b> may be any combination of verb, adverb and adjective. If the blog poster uses RDFa attribute <em>about</em> to identify the 'thing' discussed or referred in the post and attribute <em>property</em> to describe the 'thing' characteristics within the blog post then the data for consumer sentiments and context information can be gleaned from the blog post. Product IRI that is the <font color="blue">subject</font> of the RDF tripple shall be same in all blog posts, the <font color="blue">predicate</font> of this RDF tripple is <b>std:affect</b>. </p>

<h4 title="product context data">Product Context Data</h4>
<img style="margin-left:50px;" src="http://www.itgumbo.com/itcopywrite/images/jan11.png" width="301" height="201" alt="product context data" />
<p>Reference of other products/services by bloggers in a blog post for a particular product/service can provide intelligent context data. E.g. mention of sunshine pearls in a blog post on hand bags provides context information on what size and color pearls match with a hand bag. A business opportunity for both handbags and pearls can be found by the analysis of this context data. The two products are bound to each other with the property <b>dc:relation</b>. The binding may be written in a reverse order as well, it depends on how the sentence is framed. The requirement is to provide semantic annotation of data and to provide marketing tools that can use this context data to find new business opportunity. The tool must be intelligent to resolve <b>std:affect</b> synonym values in order to produce context histograms.</p>
<h5>SPARQL query</h5>
<pre style="padding-left:5px;color:green;font-size:9pt;">
PREFIX std: &lt;http://www.somestandards.org/&gt;
PREFIX dc: &lt;http://purl.org/dc/elements/1.1&gt;
SELECT ?s ?o ?v
WHERE { 
	{ 
	  ?s dc:relation &lt;pearl IRI&gt; .
	} UNION
	{
	  &lt;pearl IRI&gt; dc:relation ?o .
	}
	&lt;pearl IRI&gt; ?p1 "small"@en .
	&lt;pearl IRI&gt; ?p2 "natural"@en .
	FILTER(?p1 = std:size && ?p2 = std:color)
	&lt;pearl IRI&gt; std:affect ?v .
      }
</pre>
<p>This SPARQL query shall select all web resources that are related to "pearl IRI" and have included the given properties for "pearl IRI". The SPARQL query solution shall also include the value of property <b>std:affect</b> used in the web resource (e.g. blog post). Blog category and tags also provide data that may be used to collect relevant information from this free source of online information. The properties that may be used for these are <b>dcmitype:collection</b> and <b>dc:subject</b> respectively.</p>

<p>Microformat <a href="http://microformats.org/wiki/vote-links">vote-links</a> and <a href="http://microformats.org/wiki/hreview">hreview</a> may be embedded for rating. Vote-links microformats for similar purpose may be included as value of <em>rev</em> attribute. The hreview microformat 'rating' may also be used to provide numerical rating. </p>

<p><b>Conclusion:</b> The advantage of subjective opinion on a product through <b>std:affect</b> property is that it has enhanced expressive power as compared to numerical value. The advantage of using RDFa over microformat are obviously the emerging semantic web applications. The purpose of the new properties is to enable annotation of language qualifiers within a sentence, the alternative may be to provide the value of the new properties as radio buttons. But the objective here is to capture the sentiments from within the text of the blog post. It is not possible to provide radio buttons for all products that may be referred to in blog posts. Without this semantic annotation of products and sentiments this useful context data may be lost.</p>

<p>Reference:<br />
<a href="http://dublincore.org/documents/dcmi-terms/">DCMI Metadata Terms</a>, <a href="http://www.w3.org/TR/rdf-concepts/">RDF</a>, <a href="http://www.w3.org/TR/rdfa-syntax/">RDFa</a>, <a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a>.<br />
<a href="http://www.buzzlogic.com/blog/2007/12/sentiment_scoring_partnership.html">Leon, de. Ponce, Sandra. (2007) Sentiment Scoring Partnership With KD Paine. Buzzlogic</a>. </p>]]>
    </content>
</entry>

<entry>
    <title>Consumer requirement for &quot;open market&quot;</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2008/01/consumer_requirement_for_open.php" />
    <id>tag:itgumbo.ebizq.net,2008:/itcopywrite//26.1471</id>

    <published>2008-01-05T14:52:22Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>A consumer has asked for help in finding a suitable credit card for credit score 550. How can semantic technology answer the question? The credit report information is available to the Credit Reporting Agency (CRA), the credit card companies who...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>A <a href="http://www.itgumbo.com/itcopywrite/2007/05/blogs_in_web_30.php">consumer has asked for help</a> in finding a suitable credit card for credit score 550. How can semantic technology answer the question?</p>
<p>The credit report information is available to the Credit Reporting Agency (CRA), the credit card companies who want to offer pre-approved credit cards can search the protected <acronym title="Credit Reporting Agency">CRA</acronym> database for potential customers. The information about credit data available and accessible may be published and advertised through a Web Services Description (WSD) document. Access to the protected database may be controlled with user (the user may be an individual who has a credit record or credit company that can benefit from access to individual data) authentication and authorization. Different services may be made available to different type of users thus the database fields accessible to a user can be controlled. Federal laws for fair credit may be referred for guidelines. Credit agencies may use this data to calculate the credit score according to a proprietary formula or may use the credit score provided by <acronym title="Credit Reporting Agency">CRA</acronym>.</p>]]>
        <![CDATA[<p>Some vocabularies may be defined to enable machine processing of natural language queries such as "<b>Find best credit card for credit score 550</b>" for intelligent answers. The basic vocabularies required may be described as:</p>
<ul>
<li><em>credit report vocabulary</em> - http://www.somestandards.org/cr</li>
<li><em>credit vocabulary</em> - http://www.somestandards.org/credit</li>
</ul>

<p>Some normative metadata terms that may be used to answer this query are:<br />
"xmlns:credit=http://www.somestandards.org/credit#"<br />
"xmlns:std=http://www.somestandards.org/ns#"<br />
"xmlns:dcmitype="http://purl.org/dc/dcmitype/"</p>

<table border="1" bordercolor="blue" style="border-collapse:collapse;width:80%;">
<tr style="background-color:navy;color:white"><th>normative metadata term(<font color="blue">predicate</font>)</th><th>possible values(<font color="blue">object</font>)</th><th>description</th></tr>
<tr><td>credit:csformula</td><td>beacon, empirica, fico</td><td>credit score formula</td></tr>
<tr><td>credit:csvalue</td><td>blank node label</td><td>credit score range and credit rate</td></tr>
<tr><td>credit:scoreMin</td><td>300-900</td><td>Minimum value of credit score</td></tr>
<tr><td>credit:scoreMax</td><td>300-900</td><td>Maximum value of credit score</td></tr>
<tr><td>std:rate</td><td>0-100%</td><td>interest or premium rate</td></tr>
<tr><td>dcmitype:dataset</td><td>Equifax, Experian, TransUnion, etc.</td><td>Name of credit report database</td></tr>
</table>

<p>Credit Reporting Agency (CRA) may describe its service as (RDF triples):</p>
<pre style="color:navy;font-size:9pt;">
&lt;CRA IRI&gt; dcmitype:service "Credit Reporting Agency"@en
&lt;CRA IRI&gt; dcmitype:dataset "Equifax/Experian/TransUnion"
&lt;CRA IRI&gt; credit:csformula "Beacon/Empirica/FICO"
</pre>

<p>Credit Providing Agency (CPA) such as bank, credit card, insurance etc, may describe the service provided as (RDF triples):</p>
<pre style="color:navy;font-size:9pt;">
&lt;CPA IRI&gt; dcmitype:service "Credit Card"@en
&lt;CPA IRI&gt; dcmitype:dataset "Equifax/Experian/TransUnion"
&lt;CPA IRI&gt; credit:csformula "Beacon/Empirica/FICO"
&lt;CPA IRI&gt; credit:csvalue   _:range1
_:range1  credit:scoreMin  "300"^^xsd:integer
_:range1  credit:scoreMax  "400"^^xsd:integer
_:range1  std:rate         "interest/premium"
</pre>

<p>A search engine may translate this user query into a SPARQL query as follows:</p>
<pre style="color:green;font-size:9pt;">
SELECT ?s ?minval ?maxval ?rateval
WHERE {?s  dcmitype:service "^Credit Card"@en .
	?s  credit:csvalue   ?bn .
	?bn credit:scoreMin  ?minval .
	?bn credit:scoreMax  ?maxval .
	?bn std:rate         ?rateval .
	FILTER  (?minval <= 550 &&  ?maxval >= 550)
	}
ORDER BY ?s ASC(?rateval)
</pre>

<p>However the suggested semantic technology method may succeed only if <acronym title="Credit Providing Agency">CPAs</acronym> publish the guidelines for interest/premium rate and credit score relationship. There may be more factors involved other than credit score that shall be used in determining the approval of credit card and interest/premium rates. The human and business factors may overrule technology. The other suggested method to find an answer to this query is to post this question to an appropriate forum. A popular forum for credit card related issues can be found by a simple query:
"<b>popular credit card forum</b>". If the forum has included semantic annotation (RDFa) then it shall be found by the semantic Search Engine. The forum may use the following terms for semantic annotation:</p>
<pre style="color:navy;font-size:9pt;">
&lt;forum IRI&gt; dcmitype:service  "blog/forum"
&lt;forum IRI&gt; dcmitype:collection "Credit Card"@en
&lt;forum IRI&gt; std:popularity "IntegerValue"^^xsd:integer 
</pre>
<p><b>Conclusion:</b> Definition of Credit Report vocabulary and a data security mechanism that may control access to information shall provide enhanced logical solutions for such problems. The ability to compare how different credit card companies consider credit report and credit score shall be the answer to the consumer question. More on std:popularity property and access mechanism later.</p>
<p>References:<br />
<a href="http://www.w3.org/TR/rdf-sparql-query/">SPARQL</a> 
<a href="http://money.howstuffworks.com/credit-report.htm">Credit Report</a> 
<a href="http://money.howstuffworks.com/credit-score.htm">Credit Score</a>
</p>]]>
    </content>
</entry>

<entry>
    <title>Semantic Annotation of Blogs in Web 3.0</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/12/semantic_annotation_of_blogs_i.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1470</id>

    <published>2007-12-23T11:51:47Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>In order to incorporate the semantic annotation in the blog post, the standard ontology shall have to be used by blogging tools like MovableType, WordPress, etc. These tools use proprietary values for HTML element ‘div’ attributes id, class, etc. to...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>In order to incorporate the semantic annotation in the blog post, the standard ontology shall have to be used by blogging tools like MovableType, WordPress, etc. These tools use proprietary values for <acronym title="HyperText Markup Language">HTML</acronym> element ‘div’ attributes <em>id</em>, <em>class</em>, etc. to annotate the published content stored on the web server.  The limitation of the present practice of proprietary annotation is that the blogger has to register the blog in various forums by submitting the blog URI and Search Engine indexing is based on keywords defined in the META element and other HTML element attributes like <em>id</em>, etc. As described in the previous <a href="http://www.itgumbo.com/itcopywrite/2007/05/blogs_in_web_30.php">blog post</a> a method is required to automate the process of blog discovery. I.e., classification of blog post is required for categorization. </p>]]>
        <![CDATA[<h4>Common Vocabularies for semantic annotation</h4>
<p>The <a href="http://www.w3.org/Submission/2007/SUBM-sioc-spec-20070612/">Semantically-Interlinked Online Communities</a> (SIOC) core ontology a W3C member submission by <a href="http://www.deri.ie/">DERI Galway</a>, <a href="http://dublincore.org/documents/dcmi-terms/">Dublin Core Metadata</a>, <a href="http://www.w3.org/TR/2005/WD-swbp-skos-core-spec-20051102/">SKOS</a> and <a href="http://xmlns.com/foaf/spec/">FOAF</a> are some of the vocabularies that may be used for semantic annotation of the blog posts. A simplified version of semantically annotated blog post may be achieved with Dublin Core Metadata and FOAF vocabulary. </p>
<div style="font-size:7pt">
xmlns:bg="http://purl.org/dc/elements/1.1/" <br />
xmlns:bgterm="http://purl.org/dc/terms/" <br />
xmlns:bgtype="http://purl.org/dc/dcmitype/" <br /><br />
</div>
<table border="1" bordercolor="navy" style="font-size:7pt;border-collapse:collapse;" width="75%">
<tr style="background-color:yellow;color:navy;"><th>normative metadata</th><th>description</th></tr>
<tr><td> bg:contributor</td><td> comment posters </td></tr>
<tr><td> bg:creator </td><td> author of the post </td></tr>
<tr><td> bg:date </td><td> date of publication </td></tr>
<tr><td> bg:description </td><td> content of the post </td></tr>
<tr><td> bg:identifier </td><td> identifier for the post </td></tr>
<tr><td> bg:publisher </td><td> e.g. IT Gumbo </td></tr>
<tr><td> bg:relation </td><td> URI of related blog posts </td></tr>
<tr><td> bg:subject </td><td> keywords that shall classify the post </td></tr>
<tr><td> bg:title </td><td> title of the post </td></tr>
<tr><td> bgterm:abstract </td><td> either an abstract or selected lines of the post </td></tr>
<tr><td> bgterm:bibliographicCitation </td><td> for references </td></tr>
<tr><td> bgtype:image </td><td> for picture of the poster </td></tr>
</table>
<p>Some other terms from Dublin Core Metadata, <acronym title="Friend Of A Friend">FOAF</acronym> and <acronym title="Simple Knowledge Organization System">SKOS</acronym> may be used. The <a href="http://dmoz.org">DMOZ</a> category can be added for post classification by adding <em>bg:service</em> property. </p>
<h4>User interface for semantic annotated blog post</h4>
<p>The blog posters may not use SIOC vocabulary; instead this vocabulary may be used in the blog tools.
The blog tools may not necessarily use the <a href="http://sparql.captsolo.net/browser/browser.py?url=http%3A//johnbreslin.com/blog/index.php%3Fsioc_type%3Dpost%26sioc_id%3D517">SIOC RDF browser</a> to publish blog content but instead use <a href="http://www.w3.org/TR/rdfa-syntax/">RDFa</a> or <a href="http://microformats.org/">microformats</a> for semantic annotation and may include a GRDDL transformation. GRDDL transformation shall be used by semantic web applications to generate RDF/XML for the blog post. Refer table <a href="#table2">[2]</a>, column 2. The blogging tool shall use XHTML+RDFa to publish blog post, <u>it is not necessary to visually render <acronym title="Resource Description Framework">RDF</acronym>/<acronym title="Web Ontology Language">OWL</acronym> class & properties</u> these are embedded in the published content in order to annotate the content for semantic web applications. Refer table <a href="#table1">[1]</a>, column 2. The published semantic annotated blog post shall reside on the web server. There may be no visible change in the user interface for posting a blog or in the published post.</p>
<p>The blogging tool may continue to use its proprietary <a href="http://www.w3.org/TR/xforms11/">XForms</a> or <a href="http://www.w3.org/TR/html401/interact/forms.html">HTML forms</a> for user input interface. Tool may either include <a href="http://www.w3.org/TR/REC-xml-names#NT-QName">Qnames</a> for normative metadata in XForms model to generate table <a href="#table2">[2]</a>, column 1 submission or if proprietary tags are used in XForms model then a mapping to normative metadata terms shall be required in order to publish <acronym title="Extensible HyperText Markup Language">XHTML</acronym>+<acronym title="RDF in HTML">RDFa</acronym>.</p>

<p>The tools that use XForms or HTML forms for user input and standard ontology for semantic application <u>must be able to publish web content with RDFa semantic annotation</u> in order to make the content useful for semantic web applications.</p>
<h4>Elimination of redundant metadata</h4>
<p>If there are no new characteristics added to a Class or Property, it is better idea to use the base property rather than defining a new property as sub-property-of another property. Example: <em>sioc:subject</em> is defined as sub-property-of <em>dc:subject</em>. The issue here is that sioc prefix expands to <span style="font-size:7pt;">http://rdfs.org/sioc/ns#</span> namespace and dc prefix expands to <span style="font-size:7pt;">http://purl.org/dc/elements/1.1/</span> namespace. Another new vocabulary <span style="font-size:7pt;">http://some.org/newvocabulary/</span> may define a new property <em>xyz:differentLocalPart</em> as a sub-property-of <em>dc:subject</em>. In order to answer a user query “applications of SIOC" the Search Engine SPARQL agent shall have to select all web resources that use any of these normative metadata terms with value 'SIOC'.  This may not be an easy task for <acronym title="SPARQL Protocol and Query Language">SPARQL</acronym> agent because the query must then find and include all evolving similar normative metadata terms such as <em>sioc:subject</em>, <em>dc:subject</em>, <em>xyz:differentLocalPart</em> and so on.</p>
<p style="font-size:7pt">Note: Web users may use different prefix for the same <acronym title="Extensible Markup Language">XML</acronym> Namespace, here in this blog post prefix ‘bg’ has been used to suggest that a standard ontology is required for blog annotation (to continue discussion with previous <a href="http://www.itgumbo.com/itcopywrite/2007/05/blogs_in_web_30.php">post</a> on this topic). This standard ontology may be one of the existing common vocabularies (e.g. Dublin Core Metadata).</p>                                   
<h4>Advantages of standard ontology</h4>
<p>The advantages of defining and using standard ontology are:<br />
<ul>
<li>Redundant normative metadata is eliminated,</li>
<li>Ambiguity due to similar vocabularies is resolved,</li>
<li><acronym title="World Wide Web">WWW</acronym> user shall not be inhibited of writing content for ‘semantic web’; some web users may not use blog tools and may hand-type semantic annotation in the published content. Single source of information (i.e. one common vocabulary) with one standard ontology to annotate a ‘thing’ shall address the first two points.</li>
<li>If normative metadata from same vocabulary is used to annotate blog posts and non-blog content then more data can be found for a keyword or category. Example: refer example in the table <a href="#table1">[1]</a>, <font color="blue">predicate</font> is <em>bg:subject</em>, <font color="blue">object</font> is SIOC and <font color="blue">subject</font> is the <acronym title="Internationalized Resource Identifier">IRI</acronym> of the blog post. Non-blog articles may also embed this property for keyword/tag ‘SIOC’. There shall be more number of IRIs in the <a href="http://www.itgumbo.com/itcopywrite/2007/11/web_resource_classification_fo.php">IRI database</a> for <em>bg:subject</em>='SIOC'.</li>
</ul>

<p><font color="brown"><strong>Conclusion</strong></font>: It is not necessary that browser must render the ontology in XForms, HTML forms or published content. <strong>The blogging tool must be able to validate and process the <acronym title="Extensible Markup Language">XML</acronym> data received from the XForms or HTML forms user interface and use RDFa to embed standard ontology in published content.</strong> Same normative metadata must be used for annotation of blog posts or other published web content in order to provide more data for semantic web applications.</p>

<table border="1" bordercolor="navy" style="font-size:7pt;border-collapse:collapse;" width="100%" id="table1">
<caption>Table 1: Published Blog Post</caption>
<tr style="background-color:yellow;color:navy;">
<th> Web 2.0 annotation of blog post </th>
<th>Semantic annotation of blog post</th>
</tr>
<tr>
<td>
&lt;body&gt;<br />
&lt;h2 id="archive-title"&gt;Blogs in Web 3.0&lt;/h2&gt;<br />
&lt;div class="entry"&gt;<br />
&lt;p&gt; Content of the post. &lt;/p&gt;<br />
&lt;div class="post_author_info"&gt;<br />
Written by &lt;br /&gt;<br />
&lt;span&gt;Ila Nivas&lt;/span&gt;<br />
&lt;/div&gt;<br />
&lt;/div&gt;<br />
<br />
Tags: <br />
&lt;a href="Link to SIOC articles" rel="tag"&gt;SIOC&lt;/a&gt;<br />
&lt;a href="Link to RDFa articles" rel="tag"&gt;RDFa&lt;/a&gt;<br />
&lt;a href="Link to Blogs articles" rel="tag"&gt;Blogs&lt;/a&gt;<br />
<br />
&lt;/body&gt;<br />
</td>
<td>
&lt;body&gt;<br />
&lt;div about=""&gt;<br />
&lt;h2 property="bg:title"&gt;Blogs in Web 3.0&lt;/h2&gt;<br />
&lt;div property="bg:description"&gt;<br />
 Content of the post <br />
&lt;/div&gt;<br />
&lt;p&gt;<br />
Written by &lt;br /&gt;<br />
&lt;span property="bg:creator foaf:person"&gt;Ila Nivas&lt;/span&gt;<br />
&lt;/p&gt;<br />
&lt;!-- List of keywords --&gt;<br />
&lt;span property="bg:subject"&gt;SIOC&lt;/span&gt;<br />
&lt;span property="bg:subject"&gt;RDFa&lt;/span&gt;<br />
&lt;span property="bg:subject"&gt;Blogs&lt;/span&gt;<br />
&lt;!-- Category --&gt;<br />
&lt;span property="bg:service"&gt;Weblog&lt;/span&gt;<br />
&lt;span property="bg:service"&gt;Information Technology&lt;/span&gt;<br />
&lt;!-- Publisher --&gt;<br />
&lt;span property="bg:publisher"&gt;IT Gumbo&lt;/span&gt;<br />
&lt;/div&gt;<br />
&lt;/body&gt;<br />
</td>
</tr>
</table>
<br />
<table border="1" bordercolor="navy" style="font-size:7pt;border-collapse:collapse;" width="100%" id="table2">
<caption>Table 2: Structure of Blog Data</caption>
<tr style="background-color:yellow;color:navy;">
<th> XML data submitted to XML processor</th>
<th> RDF/XML data extracted from published content</th>
</tr>
<tr>
<td>
&lt;blogpost xmlns:bg="http://purl.org/dc/elements/1.1/"<br />
<span style="padding-left:8ex">xmlns:foaf="http://xmlns.com/foaf/0.1/"&gt;<br />
<div style="padding-left:2ex">
&lt;bg:title&gt;Blogs in Web 3.0&lt;/bg:title&gt;<br />
&lt;bg:description&gt;Content of the post&lt;/bg:description&gt;<br />
&lt;bg:creator&gt;Ila Nivas&lt;/bg:creator&gt;<br />
&lt;foaf:person&gt;Ila Nivas&lt;/foaf:person&gt;<br />
&lt;bg:subject&gt;SIOC&lt;/bg:subject&gt;<br />
&lt;bg:subject&gt;RDFa&lt;/bg:subject&gt;<br />
&lt;bg:subject&gt;Blogs&lt;/bg:subject&gt;<br />
&lt;bg:service&gt;Weblog&lt;/bg:service&gt;<br />
&lt;bg:service&gt;Information Technology&lt;/bg:service&gt;<br />
&lt;bg:publisher&gt;IT Gumbo&lt;/bg:publisher&gt;<br />
</div>
&lt;/blogpost&gt;<br />
</td>
<td>
&lt;?xml version="1.0"?&gt;<br />
&lt;rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"<br />
<span style="padding-left:8ex">xmlns:bg="http://purl.org/dc/elements/1.1/" </span><br />
<span style="padding-left:8ex">xmlns:foaf="http://xmlns.com/foaf/0.1/"&gt;</span><br />
&lt;rdf:Description rdf:about="Blog IRI"&gt;<br />
<div style="padding-left:2ex">
&lt;bg:title&gt;Blogs in Web 3.0&lt;/bg:title&gt;<br />
&lt;bg:description&gt;Content of the post&lt;/bg:description&gt;<br />
&lt;bg:creator&gt;Ila Nivas&lt;/bg:creator&gt;<br />
&lt;foaf:person&gt;Ila Nivas&lt;/foaf:person&gt;<br />
&lt;bg:subject&gt;SIOC&lt;/bg:subject&gt;<br />
&lt;bg:subject&gt;RDFa&lt;/bg:subject&gt;<br />
&lt;bg:subject&gt;Blogs&lt;/bg:subject&gt;<br />
&lt;bg:service&gt;Weblog&lt;/bg:service&gt;<br />
&lt;bg:service&gt;Information Technology&lt;/bg:service&gt;<br />
&lt;bg:publisher&gt;IT Gumbo&lt;/bg:publisher&gt;<br />
</div>
&lt;/rdf:Description&gt;<br />
&lt;/rdf:RDF&gt;<br />
</td>
</tr>
</table>]]>
    </content>
</entry>

<entry>
    <title>Web Resource Classification for Search Engine Taxonomy</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/11/web_resource_classification_fo.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1469</id>

    <published>2007-11-12T06:26:14Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>As more data on WWW is made available with semantic annotation of web resources the categorization of web resources based on classification with characteristics identifiable by normative metadata shall be the key to development of semantic web applications. The inclusion...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>As more data on <acronym title="World Wide Web">WWW</acronym> is made available with semantic annotation of web resources the categorization of web resources based on classification with characteristics identifiable by normative metadata shall be the key to development of semantic web applications. The inclusion of normative metadata from standard ontology as web resource descriptors in POWDER <acronym title="Description Resource">DR</acronym> or embedded in the web content with RDFa shall provide data for search engine indexing to build <acronym title="Search Engine">SE</acronym> taxonomy.</p>
<ul>
<li>Characteristics are properties of a 'thing'.</li>
<li>Classification is the identification of characteristics for categorization.</li>
<li>Categorization is grouping of resources with same characteristics.</li>
<li>Taxonomy is the process of classification.</li>
</ul>
<p>The value of HTML language elements and attributes such as <em>content</em> attribute in META element with attribute <em>name</em>='keywords', heading elements, <em>alt</em>, <em>name</em>, <em>title</em>, <em>hreflang</em> and <em>media</em>, etc. are used by search engines to classify web content. While these attributes may still be used for web resource classification, the normative metadata shall extend the vocabulary that is used for web resource classification. The difference is that HTML element and attribute tag <u>value</u> and not tag name are used for web resource classification, the RDF/OWL class <u>property</u> (<span style="color:blue;">predicate</span>) as well as <u>values</u> (<span style="color:blue;">object</span>) shall be used for classification of semantic web resources (<span style="color:blue;">subject</span>). It is important to note that HTML tag names do not mean <em>name</em> attribute value but tag names such as META, TH, A, H1-H6, etc. </p>]]>
        <![CDATA[<h4>Search Engine Indexing</h4>
<p>Search engines may use the web resource classification to build taxonomy for efficient search. Since the web resources (<span style="color:blue;">subjects</span>) with same classification, i.e. &lt;<span style="color:blue;">predicate</span>, <span style="color:blue;">object</span>&gt; tuple shall be categorized into one category, the hierarchical taxonomy may be built with normative metadata from standard ontology.</p> 

<p><img src="http://www.itgumbo.com/itcopywrite/images/nov6.jpg" alt="Semantic Web Taxonomy" align="center" width="401" height="271" /></p>

<p>In semantic web all data is represented in the form of &lt;<span style="color:blue;">subject</span>, <span style="color:blue;">predicate</span>, <span style="color:blue;">object</span>&gt; triples. The <span style="color:blue;">object</span> is a value of the <span style="color:blue;">predicate</span>, i.e. it is the value of a <span style="color:blue;">subject</span> characteristic that is identified by the <span style="color:blue;">predicate</span>. The <span style="color:blue;">object</span> may be another web resource or a literal value. Hence all web resources with the characteristic identified by <span style="color:blue;">predicate</span> shall have one of the well-known values for <span style="color:blue;">object</span>. If the <span style="color:blue;">object</span> value is an integer typed literal, <acronym title="Search Engine">SE</acronym> may use <span style="color:blue;">predicate</span> only for classification. This web resource classification is used to build <acronym title="Search Engine">SE</acronym> taxonomy for web resources where <acronym title="Internationalized Resource Identifier">IRI</acronym> database is build by characteristics <span style="color:blue;">predicate</span> and/or <span style="color:blue;">object</span>. The figure illustrates the SE hierarchical taxonomy derived from the web resources RDF graph for &lt;<span style="color:blue;">subject</span>, <span style="color:blue;">predicate</span>, <span style="color:blue;">object</span>&gt; triple.</p>
<p>In the figure the web resources in the <acronym title="Internationalized Resource Identifier">IRI</acronym> database include either of the following &lt;<span style="color:blue;">predicate</span>, <span style="color:blue;">object</span>&gt; tuple in the RDFa annotation of the web content or in POWDER <acronym title="Description Resource">DR</acronym>. </p>
<div style="color:green;font-size:9pt;padding-left:5ex; padding-top:1ex; padding-bottom:1ex;">
dcmitype:service "ISP" <br />
dcmitype:service "Internet Service Provider" <br />
dcmitype:service "Electric Service Provider" <br />
dcmitype:service &lt;http://www.somestandards.com/ISP&gt; <br />
dcmitype:service &lt;http://www.somestandards.com/ESP&gt; <br />
</div>
<p>The web content providers have classified the web resources ('<b>thing</b>'); <acronym title="Search Engine">SE</acronym> shall use these classification characteristics to build taxonomy for WWW <acronym title="Internationalized Resource Identifier">IRIs</acronym> by grouping all web resources with same characteristics. All web resource with dcmitype:service property are further classified by the value of this property. The natural language query "ISP" submitted to the search engine by the user is transformed into the following SPARQL query by the search engine.</p>
<div style="color:green;font-size:9pt;padding-left:5ex;padding-top:1ex; padding-bottom:1ex;">
SELECT ?s WHERE <br />
{ <br />
 <span style="padding-left:2ex;">?s dcmitype:service "ISP", "Internet Service Provider" .</span> <br />
 <span style="padding-left:2ex;">?s dcmitype:service &lt;http://www.somestandards.com/ISP&gt; .</span> <br />
} <br />
</div>
<p><span style="font-size:9pt;">Note: Search engine may add other expansions of "ISP" abbreviation to the query. User must enter desired expansion for efficient query.</span><br />
</p>
<p><span style="color:brown;font-weight:bold;">Conclusion:</span> A natural language query submitted by the web user that is converted to SPARQL query with relevant normative metadata corresponding to the user query terms can be answered by traversing the <acronym title="Search Engine">SE</acronym> hierarchical taxonomy to retrieve web resources from the <acronym title="Internationalized Resource Identifier">IRI</acronym> database.</p>
<p>References: 
<a href="http://www.w3.org/2007/powder/">POWDER</a>, 
<a href="http://www.w3.org/RDF/">RDF</a>, 
<a href="http://www.w3.org/TR/2007/WD-rdfa-syntax-20071018/">RDFa</a>, 
<a href="http://www.w3.org/2001/sw/DataAccess/">SPARQL</a>.
</p>]]>
    </content>
</entry>

<entry>
    <title>Why use POWDER for web resource description?</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/10/why_use_powder_for_web_resourc.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1468</id>

    <published>2007-10-11T13:08:04Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>POWDER specifications provide mechanisms for defining and identifying trust worthy web resource descriptions. The web resource description is the description of web resource characteristics described with normative metadata. The use of normative metadata is the basis of proof and trust...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>POWDER <a href="http://www.w3.org/2007/powder/">specifications</a> provide mechanisms for defining and identifying trust worthy web resource descriptions. The web resource description is the description of web resource characteristics described with normative metadata. The use of normative metadata is the basis of <b>proof</b> and <b>trust</b> requirements of semantic web. The metadata terms defined in the POWDER specifications provide a structure to semantic annotation of a web resource.</p>]]>
        <![CDATA[<p>We <a href="http://www.itgumbo.com/itcopywrite/2007/04/transition_to_semantic_web_par_1.php">identified three types of metadata</a> that can be used to describe a web resource. RDFa can be used to embed this metadata in the web content. The POWDER Description Resources can be used to separate the normative metadata that describes the web resource characteristics from the web content. The characteristic properties identified with normative metadata can be included as DR descriptors. <a href="http://www.w3.org/TR/2007/WD-powder-dr-20070925/#structure">POWDER specifications</a> suggest that descriptors "<i> DO NOT describe the DR, rather, they describe the resources in the Resource Set that is the scope of the DR</i>".</p>

<table style="background-color:lightcyan">
<tr><td>
<b><u>Example</u>:</b>
<table border="1" style="border-color:navy">
<tr style="background-color:navy;color:white;"><th>RDFa</th><th>POWDER</th></tr>
<tr><td>
&lt;p&gt;The xyz.com is an <br />
&lt;span property="dcmitype:Service" &gt;<br />
<span style="padding-left:3ex">ISP<br />
&lt;/span&gt;.<br />
&lt;/p&gt;
</td>
<td>
&lt;wdr:hasDescriptors&gt;<br />
<span style="padding-left:3ex">&lt;wdr:Descriptors&gt;</span><br />
<span style="padding-left:6ex">&lt;dcmitype:Service&gt;</span><br />
<span style="padding-left:8ex">ISP</span><br />
<span style="padding-left:6ex">&lt;/dcmitype:Service&gt;</span><br />
<span style="padding-left:3ex">&lt;/wdr:Descriptors&gt;</span><br />
&lt;/wdr:hasDescriptors&gt;
</td>
</tr>
</table>
</td></tr>
</table>

<p> All web pages on xyz.com may link to the DR that includes the given descriptor, whereas if this characteristic <i>dcmitype:Service</i> is embedded with RDFa then all web pages shall include RDFa statement. However the link to POWDER DR does not preclude the use of RDFa.</p>
<p>The metadata terms defined in POWDER can be used to:</p>
<ul style="list-style:square none outside">
<li>to authenticate normative metadata used (e.g. DR validity and trust properties)</li>
<li>structure the semantic annotation of the web resource (e.g. web resource classification and resource grouping)</li>
<li>avoid repetition of same normative metadata in multiple web resource content (e.g. include link to relevant Description Resource)</li>
<li>enable fast parsing of web content for SPARQL query solutions required for "<b><font color="teal" face="comic sans ms">open market</font></b>" and other semantic web applications (e.g. Resource Set definitions)</li>
</ul>]]>
    </content>
</entry>

<entry>
    <title>Consumer personal and context data security concerns</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/09/consumer_personal_and_context.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1467</id>

    <published>2007-09-26T12:24:47Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>Ubiquitous internet access and globalization of human civilization has encouraged both sellers and buyers to approach each other beyond national borders. Irrespective of language, law and culture limitations people want to know, learn and practice new ways of life....</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Security" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Ubiquitous internet access and globalization of human civilization has encouraged both sellers and buyers to approach each other beyond national borders. Irrespective of language, law and culture limitations people want to know, learn and practice new ways of life.</p>]]>
        <![CDATA[<p>This intermingling of people has encouraged web service providers to cater to people from different parts of society, honor laws and language of different nations. Web service providers collect visitor personal and context data in order to know about :</p>
<ul>
<li>the popularity and reach ability of their service</li>
<li>the consumer interest to provide better service</li>
</ul>
<p>The collected data is used internally for marketing research and much useful information is extracted. This information is used by the service provider to advance their business by providing better online service specially implemented for different parts of the world, individualized services for the consumer and may also be released to research organizations for global research. This silent and hidden data collection inflicts fear in consumers that result in either apprehension in use or prohibition of web service. <b><i>The competition between service providers inhibit sharing of collected data specification</i></b>. As a result different data may be collected by web service providers and the consumer is unaware about the personal data that is collected by the website. The internet user has many questions:</p>
<ul>
<li>What user data is collected by the website?</li>
<li>What will web service provider do with this data? </li>
<li>What are the risks in sharing data?</li>
<li>What are the advantages of sharing data?</li>
<li>How can provider inform consumer about the data that is collected?</li>
<li>How can consumer control collection of data? </li>
<li>How does provider secure collected data?</li>
</ul>
<h4>What user data is collected by the website?</h4>
<p>Some samples of data that may be collected by the website are listed here, this is not an exhaustive list of data collected by the website.</p>
<ul>
<li><p><b>Personal data:</b> Website may silently collect data for traffic statistics, this data is also called <a href="http://www.w3.org/TR/P3P/#Internet_Addresses">log data</a> and include visitor IP address, type of browser used, IRI of web pages visited, time spent per page etc. Website may ask user to input data in online forms, this may be contact information, credit card information or individual interest information.</p></li>
<li><p><b>Context data:</b> Website may also collect context data based on user behavior such as inlink IRI to know the point of reference to the website, outlink IRI to know how and when the user exits, whether he/she returns back to the website after visiting the external reference. What search terms are entered by the user in the "Google Search" box provided on the website, whether the search is conducted on the WWW or on the website? What external search terms directed the user to this website?</p></li>
</ul>

<h4>What will web service provider do with this data?</h4>
<p>Many business and marketing decisions are driven based on the personal and context data collected by the websites. This data is used in a similar manner as free/paid surveys. If some 'n' number of visits from a region show an interest in a product this reflects the popularity of the product in that area. This may lead to marketing research by the organization to find business opportunity.</p>
<ul>
<li><p><b>Personal data:</b> Website shall use personal data to provide secure access to information and allow users to save private context data on web servers. Most e-business will provide login/password in order to protect their online data, to collect personal/context data, to allow user to store e-business context information such as current orders, order status, order history etc. Service provider can send new sale intimation to the user e-mail address or home/office address, etc.</p></li>
<li><p><b>Context data:</b> Website shall use context data to advance their business and provide better service to the consumer. An interest of multiple users in a product that is not available on e-business website may encourage the provider to explore the business opportunity. If a user is not able to find the relevant data on the website or if there is associated information to the user search terms, personalized e-mail may be sent to the user e-mail address.</p></li>
</ul>

<h4>What are the risks in sharing data?</h4>
<p>Spam, spoofing and phishing are some of the risks of online data transactions; consumer apprehension and prohibition are the dangers of data collection. Denial of service by the provider to the consumer due to latter's refusal to share data can be former's policy. Stolen identity may disturb finances and peace.</p>
<ul>
<li><p><b>User to provider:</b> Log data is the visitor foot print on the internet. In addition to the website data collection, at home or in office the web user activity can be watched. An individual may be targeted by international terrorists or internet culprits by tracking log data. In case of solicited data the consumer consciously provides data by agreeing to the published privacy policy.</p></li>
<li><p><b>Service provider to 3<sup>rd</sup> party:</b> The contract between the two parties governs the data ownership and security policies. The web user who provided the data may not be aware about the identity of the 3<sup>rd</sup> party who has access to data. The security breach due to 3<sup>rd</sup> party negligence may result in consumer annoyance. The service provider is at the risk of loosing the consumer unless it is ensured that 3<sup>rd</sup> party will also honor the consumer interest.</p></li>
</ul>

<h4>What are the advantages of sharing data?</h4>
<p>Consumers can receive personalized service. <b><i>Data shared is used for research, business advancement and new inventions</i></b>. If you like thing X with color Y and you search for it on the internet it is very likely that the search term is logged. This search may lead to a new product X with color Y in your vicinity soon. The log data is also used to catch international terrorists and internet culprits such as hackers and spammers.</p>

<h4>How can provider inform consumer about the data that is collected?</h4>
<p>Most websites will include a "Privacy Policy" web page to inform the user about the data collected. This web page will include human readable text description of the policy and is written in the regional language. The human readable text is to be trusted by the consumer, the service provider may display the regulatory compliance and audit seals and certificates on the website. Web technologies have defined <a href="http://www.w3.org/TR/P3P">P3P</a> specifications for machine-readable policies. Provider can publish and configure different P3P policies for sections of website or have a single policy for the complete website. If website uses cookies to collect data, the P3P policy must include the policy for the data collected by the cookie. The compact P3P policy is published to control the cookie data collection and is carried in the HTTP response header, the browser can decode this policy and prompt the user according to browser privacy settings. The advantage of providing website P3P policy is that it provides better control to the visitor on the data that is collected by the website. A website that has deployed P3P policy will not collect personal identifiable data with cookies unless the P3P policy has been fetched by the browser and the fetched policy complies with browser privacy setting configured by the user.</p>
<p>P3P specifications provide standard structures for the commonly collected log data and a mechanism by means of EXTENSION element to include proprietary log data structure. The log data structure in P3P policy along with the text description in human readable policy will eliminate the user fear of unknown data collection policy and also educate the web user about the significance of the collected data. </p>

<h4>How can consumer control collection of data?</h4>
Consumer can control the collection of personal/context data by not visiting the websites that have published data collection policy not amiable to the consumer. Web technologies provide the following mechanisms for consumers to control data collection:
<ul>
<li><p><b>Set policy preference in browser:</b> The privacy option in internet settings has options to configure the 1<sup>st</sup> party and 3<sup>rd</sup> party cookie acceptance. This setting will allow/block cookies irrespective of web page P3P policy.</p></li>
<li><p><b>Access mechanism in P3P policy:</b> The access mechanism in P3P has three options: "always", "opt-in", "opt-out". The data that is flagged as "always" does not provide any option to the website visitor and is always collected. The website visitor can choose to provide or reject request for data that is flagged "opt-in" or "opt-out" respectively, the mechanism to "opt-in" or "opt-out" is specified either in the human readable privacy policy or may be provided dynamically as forms or pop-ups on the website.</p></li>
<li><p><b>Interactive user interface on website:</b> Websites that do not publish P3P policy support forms & pop-ups to seek user consent for data collection.</p></li>
</ul>

<h4>How does provider secure collected data?</h4>
The collected data is secured by the web service provider:
<ul>
<li><p><b>During data collection transaction:</b> Data is collected over Secure Socket Layer (SSL) connections. SSL connections encrypt the data and provide checksum, digital certificate protection against data corruption and spoofing.</p></li>
<li><p><b>In storage during retention period</b>: Data is stored on servers behind firewall to protect against unauthorized access. Strict information security policies are practiced within the organization to avoid security breach due to computer/network system failure or employee mishandling.</p></li>
</ul>
After the published data retention period is over the web service provider must destroy the collected data and ensure that destroyed data is not accessible. The compliance of the organization with the necessary security standards and regulatory laws is displayed on the websites in the form of seals and certificates.

<p><b><font color="brown">Conclusion:</font></b> Build a website that can infuse confidence in visitors by informing what data is collected, how this data is secured and how this data is used. Unless necessary provide an open access to information on the website and collect minimal data. Many free content websites require user login/e-mail and password to collect user behavior context data, this may inhibit many interested readers. The syndication methods like RSS/ATOM feeds may be used to communicate with visitors. <i>Add machine-readable P3P and human readable text policies to the website to provide better consumer control on data</i>. <a href="http://www.w3.org/TR/2002/NOTE-p3pdeployment-20020211
">Web technologies</a> must be used to build simple and secure website. The integration of P3P policies stored on web server with user interface i.e. browser and data storage systems behind the firewall will <b>automate</b> the <u>data collection process</u> and <u>security audits</u> thus infuse trust in the web user. Data rich applications such as "<b><font color="teal" face="comic sans ms">open market</font></b>" for semantic web and smart stores will provide better data collection policy transparency with integrated P3P policies.</p>]]>
    </content>
</entry>

<entry>
    <title>Data Security in Semantic Web</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/08/data_security_in_semantic_web.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1466</id>

    <published>2007-08-31T15:07:18Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>Data is the building block of the semantic web; therefore security of all data in the semantic web is the essential requirement for the success of semantic web. The web service provider and web service requester data must be secured....</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Data is the building block of the semantic web; therefore security of all data in the semantic web is the essential requirement for the success of semantic web. The web service provider and web service requester data must be secured. Web services specifications and semantic technologies provide mechanisms to define and execute <a href="http://www.nivasgroup.com/wl/NGWeeklyLetter17Jun07.pdf">security policies</a> for different types of data. Data that must be secured is:</p>

<ul>
<li><p><font color="navy"><b>Published data -</b></font> Data that has been published by the web service provider as HTML web content or is accessible according to Message Exchange Pattern (MEP) described in Web Service Definition (WSD) document.</p></li>
<li><p><font color="navy"><b>Concealed data -</b></font> Data that is stored in the internal storage media of the web service provider and is not available for public access. Data that is stored in the personal computer storage media of the web user who connects to the semantic web.</p></li>
<li><p><font color="navy"><b>Personal Identifiable Information -</b></font> Data that is provided by the web service requester and is collected by the web service provider.</p></li>
</ul>]]>
        <![CDATA[<p>In addition to data security it is important to ensure web service requester and web service provider integrity is secure. To ensure integrity security the providers must ensure that information provided does not break the information exchange contract between the provider and the requester, i.e. the wire messages are constructed in accordance with WSD and are processed according to the published policies. In semantic web where the automated web service integration is promoted the WSD documents and policy documents must be carefully elaborated to ensure that no unwarranted access to information is allowed. The web service requester & provider must provide the message IOPE with ontology references (<i>modelReference</i> and <i>schemaMapping</i>) and policy definitions to find the best match web service from the semantic web service registry. </p>

<p><br />
<b><font color="teal" size="+1">Example:</font></b></p>

<p>There is a video library that maintains an online database of all disks. Assume that this library wants to provide a special service for non-adult members. The non-adult members can host access to the library database on their web site in order to post reviews, blog and create discussion forums. The views of one community created on a web site can be shared with another community through library database and thus cultural information in different localities is shared. For the benefit and interest of its non-adult members the library hosts the community profile and views on the library web server. The library web service provides policy alternatives to control access to the database for different rating (TV-14, PG, R, etc.) movies. This suggests that the member web sites must select the movie rating for membership and include rating in all messages. </p>

<table border="1">
<tr bgcolor="yellow">
<th>Title</th><th>Rating</th>
</tr>
<tr><td>Heroes</td><td>TV-14</td></tr>
<tr><td>Death Sentence</td><td>R</td></tr>
</table>

<p>If the metadata terms that describe the audio/video record in the library database are proprietary tags of this library then the web requesters (non-adult members) who want to integrate with several libraries around the globe will have to manually read the MEP and policies of library web services from XML files to find appropriate security for content that will be published on their web sites. Use of normative metadata (title, rating) from a standard ontology (library movie record) that describes library records will enable automated discovery and integration with library web services. The web requester can find library web services with a particular rating movie database from the semantic web registry.  </p>

<table border="1">
<tr bgcolor="yellow">
<th>Library WSD</th><th>Library member WSD</th>
</tr>
<tr>
<td><font size="-2">
&lt;owl:DatatypeProperty rdf:ID="hasRating"&gt;<br />
<span style="padding-left:10px">
	&lt;rdf:domain rdf:resource="#libMovieRec" /&gt;<br />
</span>
<span style="padding-left:10px">
	&lt;rdf:range rdf:resource="#listMovieRating" /&gt;<br />
</span>
&lt;/owl:DatatypeProperty&gt;<br />
<br />
&lt;owl:Class rdf:ID="libIntPreCond"&gt;<br />
<span style="padding-left:10px">
	&lt;rdfs:subClassOf rdf"resource="#libMovieRec" /&gt;<br />
</span>
<span style="padding-left:10px">
	&lt;rdfs:subClassOf&gt;<br />
</span>
<span style="padding-left:10px">
		&lt;owl:Restriction&gt;<br />
</span>
<span style="padding-left:25px">
			&lt;owl:onProperty rdf:resource="#hasRating" /&gt;<br />
</span>
<span style="padding-left:25px">
			&lt;owl:allValuesFrom rdf:resource="#listMovieRatingNA" /&gt;<br />
</span>
<span style="padding-left:10px">
		&lt;/owl:Restriction&gt;<br />
</span>
<span style="padding-left:10px">
	&lt;/rdfs:subClassOf&gt;<br />
</span>
&lt;/owl:Class&gt;<br />
</font></td>
<td><font size="-2">
&lt;owl:Class rdf:ID="memIntPreCond"&gt;<br />
<span style="padding-left:10px">
	&lt;rdfs:subClassOf rdf"resource="#libMovieRec" /&gt;<br />
</span>
<span style="padding-left:10px">
	&lt;rdfs:subClassOf&gt;<br />
</span>
<span style="padding-left:10px">
		&lt;owl:Restriction&gt;<br />
</span>
<span style="padding-left:25px">
</span>
			&lt;owl:onProperty rdf:resource="#hasRating" /&gt;<br />
</span>
<span style="padding-left:25px">
			&lt;owl:hasValue rdf:resource="TY-14" /&gt;<br />
</span>
<span style="padding-left:10px">
		&lt;/owl:Restriction&gt;<br />
</span>
<span style="padding-left:10px">
	&lt;/rdfs:subClassOf&gt;<br />
&lt;/owl:Class&gt;<br />
</font></td>
</tr>
</table>

<p>In this example, <b>libMovieRec</b>, <b>listMovieRating</b> are objects from a standard ontology. The object <b>listMovieRatingNA</b> is the list of movie rating values for non-adults that is supported by this library. The library member supports access to TY-14 rating movies only on the member web site. Therefore the library member WSD document has precondition for message output parameter element to have value TY-14 for rating. This member web service will integrate with those library web services that provide access to TY-14 movie database. The library has provided access to all non-adult movies on this interface and adult movies may be accessed on another interface . The <i>precondition</i> child element in the <i>operation</i> element of an interface IOPE is used to suggest the rating value accepted on this interface. The rating value is an input parameter element for the library web service. Another member who wants access to R movies will not be able to integrate with this service of library.</p>

<p><br />
<p><font color="brown" size="+1"><b>Conclusion:</b></font> The semantic annotation (reference of standard ontology) in web service WSD document will provide secure interface for automated integration of web services.</p></p>]]>
    </content>
</entry>

<entry>
    <title>Encoding normative metadata context</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/07/encoding_normative_metadata_co.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1465</id>

    <published>2007-07-09T14:20:29Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>Here it is suggested that if normative metadata terms can be represented by alphanumeric codes defined by respective standards organization associated with the product, then this will simplify the web services development and make search more efficient. The Web Service...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Here it is suggested that if normative metadata terms can be represented by <a href="http://www.itgumbo.com/itcopywrite/2007/06/conference_to_address_metadata.php">alphanumeric codes defined by respective standards organization</a> associated with the product, then this <i>will simplify the web services development and make search more efficient</i>. The Web Service Definition (WSD) and Functional Definition (FD) documents will then use this normative metadata code only. The translation and conversion of the natural language query to normative metadata code will be done by the thesaurus based context mediator.</p>

<table bgcolor="lightcyan">
<tr><td>
<b><u>Example</u>:</b><br />
Normative metadata term code - xxxyyyzzz<br />
Standards organization identification - xxx<br />
Product vocabulary identification - yyy<br />
Metadata term identification – zzz<br />
</tr></td>
</table>]]>
        <![CDATA[<p>By assigning alphanumeric codes to the metadata terms the ambiguity due to use of synonyms in the vocabulary can be eliminated. Synonyms can be assigned the same code. The identification of a metadata term in the user query and translation of this term to code may be done by the context mediator. This <u><b>alphanumeric code will encode the metadata context within the code</b></u>, thus connecting the language with the data. This method is analogous to the EPCglobal tag that is assigned to physical "<b>things</b>". Since all web resources <font color="blue"><b>subjects</b></font>, <font color="blue"><b>predicate</b></font> and <font color="blue"><b>objects</b></font> are considered as "<b>things</b>", this alphanumeric code will assign a unique identity to every "<b>thing</b>" on the World Wide Web. Within a web document the metadata term code will still be used as text term, i.e. by including vocabulary namespace and reference with Qname.
</p>

<p>The vocabulary that defines an alphanumeric code for a metadata term can be identified from the code. Introduction of alphanumeric codes for normative metadata terms does not add any processing overhead for context mediators. The context mediators will still process the natural language query to understand the query context and find corresponding vocabulary. The use of alphanumeric codes for metadata terms will enable introduction of new features in database applications; <b><u>intelligent context search algorithms can be built based on these codes</u></b>. The textual terms require an exact match of vocabulary and metadata, whereas alphanumeric code will enable search on sub-code such as "xxxyyy". Thus a two-pass search where first a web document that uses a given vocabulary is found and then the metadata term is searched within the document can be replaced with single-pass search algorithms based on metadata term code.</p>

<p>Consider RDF triples:</p>
<table>
<tr><td>DSLBroadbandService</td><td>BitRate</td><td>_:2Mbps</td></tr>
<tr><td>_:2Mbps</td><td>rateValue</td><td>"2"</td></tr>
<tr><td>_:2Mbps</td><td>unitValue</td><td>"Mbps"</td></tr>
<tr><td></td></tr>
</table>

<table border="1" bordercolor="black">
<tr bgcolor="navy"><th><font color="white">Service_code</font></th><th><font color="white">BitRate_code</font></th></tr>
<tr><td></td><td bgcolor="lightcyan">rateValue_code</td><td bgcolor="lightcyan">unitValue_code</td></tr>
<tr><td>DSLBroadbandService_code</td><td>2</td><td>Mbps</td></tr>
<tr><td>DSLBroadbandService_code</td><td>10</td><td>Mbps</td></tr>
<tr><td>SatelliteBroadbandService_code</td><td>200</td><td>Mbps</td></tr>
</table>

<p>The table has <a href="http://www.nivasgroup.com/wl/NGWeeklyLetter08Jul07.pdf">BitRate</a> column with two sub-properties rateValue and unitValue. These sub-properties are assigned unique code "aaabbb001" and "aaabbb002" respectively, where  "aaa" is the standards organization (e.g. IEEE) that defines the vocabulary for broadband service, "bbb" is the code that uniquely identifies broadband vocabulary amongst other vocabularies defined by this organization and the values "001" and "002" identify the property rateValue and unitValue respectively. The table  stores the actual values of the elements of BitRate class for all instances of broadband service record. A search for "Mbps broadband service" can be limited to search for records with BitRate sub-property "aaabbb002" value "Mbps". A search for vocabulary "aaabbb" is eliminated and direct search for metadata term "aaabbb002" is executed.</p>

<p>The BitRate class may be written as (rateValue_code = aaabbb001):</p>
<table>
<tr><td>
&lt;owl:Class rdf:ID="BitRate_code" /&gt;<br />
&lt;owl:DatatypeProperty rdf:ID="rateValue_code&gt;<br />
	&lt;rdfs:domain rdf:resource="#BitRate_code" /&gt;<br />
	&lt;rdfs:range rdf:resource="xs:integer_code" /&gt;<br />
&lt;/owl:DatatypeProperty&gt;
</td></tr>
</table>
<br />

<p>The disadvantage of these alphanumeric codes for normative metadata terms is that these codes will have to be embedded in the HTML schema of a web page to enable assimilation of information in the semantic web framework. Whereas textual metadata terms would have enabled assimilation of information in the semantic web framework even when metadata terms are not added to HTML schema, this text may be found by pattern matching.</p>

<p><font color="brown" size="+1"><b>Conclusion:</b></font> The advantages of alphanumeric code for normative metadata terms are:<br />
<ul><br />
<li>context is encoded within metadata term code,</li><br />
<li>context search features can be added to database applications,</li><br />
<li>more efficient search applications can be built.</li></p>]]>
    </content>
</entry>

<entry>
    <title>Normative metadata for services</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/07/normative_metadata_for_service.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1464</id>

    <published>2007-07-07T07:46:49Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary>Web users have diverse requirements and seek different information. In order to do this they form different phrases and try to get the best result from a search application. The search query that is composed with the same meaning as...</summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Web users have diverse requirements and seek different information. In order to do this they form different phrases and try to get the best result from a search application. The search query that is composed with the same meaning as the question generally seeks the best result. To support these diverse requirements most text search engines have been built with the basic algorithm as: best match for the query pattern in the web content. <a href="http://www.itgumbo.com/itcopywrite/2007/06/semantic_technology_skepticism.php">Semantic technologies</a> aim to provide accurate and fast objective results that will not only enhance customer satisfaction but also provide better business opportunities. A customer has to find an appropriate search application that can lead him/her to the best business solution available for his/her requirement. Once the business solution has been found then the online customer interface of the business must be able to answer the customer queries. </p>]]>
        <![CDATA[<h3>What are customer requirements?</h3>

<p>The normative metadata is the solution to standardize these requirements so that comparative search can be conducted in a competitive "<font color="teal" face="comic sans ms"><b>open market</b></font>". While normative metadata definition does not stop web service providers from introducing proprietary metadata, the former is a method to participate and play in the larger framework supported by semantic technologies. Normative metadata for different businesses may be formed based on the product/service characteristics and customer requirements. This normative metadata may be used directly or mapped to internal database structure of the business to maintain service records and business information.</p>

<table bgcolor="lightcyan">
<tr><td>
<u><b>Example</b>:</u><br />
<p>A traveler has many <a href="http://searchenginewatch.com/showPage.html?page=3626338
">questions that must be answered</a> before a travel agency is chosen. The apprehension is about spending a day reading articles to find the most appropriate information source and then the waiting period to get answers from the customer representatives. There is a hidden wealth of metadata terms related to travel business that can be found in this article, such as: family vacation, adventure, cheap hotel, discount ticket, couple only resort, travel directions, hotel check-in and check-out, etc.</p>
</td></tr>
</table>

<p><font color="brown" size="+1"><b>Conclusion</b></font>: Normative metadata terms for service businesses can be collected from the customer requirements and online customer interface. Service records, customer feedback and popular search terms are some of the possible information sources for normative metdata definition.]]>
    </content>
</entry>

<entry>
    <title>Extensibility of normative metadata ontology</title>
    <link rel="alternate" type="text/html" href="http://itgumbo.com/itcopywrite/2007/07/extensibility_of_normative_met.php" />
    <id>tag:itgumbo.ebizq.net,2007:/itcopywrite//26.1463</id>

    <published>2007-07-04T16:56:02Z</published>
    <updated>2008-01-21T15:01:44Z</updated>

    <summary><![CDATA[Consider the following OWL ontology with normative metadata (assumed) provided by a standards organization, this ontology describes the characteristics of a product "Laptop": &lt;owl:Class rdf:ID="Laptop" /&gt; &lt;owl:Class rdf:ID="Speed" /&gt; &lt;owl:DatatypeProperty rdf:ID="LaptopProcessorNumber"&gt; &lt;rdfs:domain rdf:resource="#Laptop" /&gt; &lt;rdfs:range rdf:datatype="xsd:integer" /&gt; &lt;/owl:DatatypeProperty&gt; &lt;owl:ObjectProperty rdf:ID="LaptopProcessorSpeed"&gt;...]]></summary>
    <author>
        <name>Ila Nivas</name>
        <uri>http://itgumbo.com/itcopywrite/</uri>
    </author>
    
        <category term="Web 3.0" scheme="http://www.sixapart.com/ns/types#category" />
    
    
    <content type="html" xml:lang="en" xml:base="http://itgumbo.com/itcopywrite/">
        <![CDATA[<p>Consider the following OWL ontology with normative metadata (assumed) provided by a standards organization,  this ontology describes the characteristics of a product "Laptop":</p>

<table>
<tr><td><font color="navy">
&lt;owl:Class rdf:ID="Laptop" /&gt;<br />
&lt;owl:Class rdf:ID="Speed" /&gt;<br /><br />
&lt;owl:DatatypeProperty rdf:ID="LaptopProcessorNumber"&gt;<br />
	&lt;rdfs:domain rdf:resource="#Laptop" /&gt;<br />
	&lt;rdfs:range rdf:datatype="xsd:integer" /&gt;<br />
&lt;/owl:DatatypeProperty&gt;<br /><br />
&lt;owl:ObjectProperty rdf:ID="LaptopProcessorSpeed"&gt;<br />
	&lt;rdfs:domain rdf:resource="#Laptop" /&gt;<br />
	&lt;rdfs:range rdf:resource="#Speed" /&gt;<br />
&lt;/owl:ObjectProperty&gt;<br />
</font></td></tr>
</table>]]>
        <![CDATA[<p>This ontology is extended by web 3.0 search application provider as follows:</p>

<table>
<tr><td><font color="seablue">
&lt;!DOCTYPE rdf:RDF [ <br />
	&lt;!ENTITY lapt "http://www.example.com/test/laptop#"&gt; ]&gt;<br /><br />
&lt;owl:DatatypeProperty rdf:about="lapt:LaptopProcessorNumber"&gt;<br />
	&lt;rdf:type rdf:resource="owl:InverseFunctionalProperty" /&gt;<br />
&lt;/owl:DatatypeProperty&gt;<br /><br />
&lt;owl:ObjectProperty rdf:about="lapt:LaptopProcessorSpeed"&gt;<br />
	&lt;rdf:type rdf:resource="owl:InverseFunctionalProperty" /&gt;<br />
&lt;/owl:ObjectProperty&gt;<br />
</font></td></tr>
</table>

<p>Different vendors of laptop may define the product laptop as given here and use the normative metadata terms associated with this class in WSD document or web content to assimilate the information into semantic web framework. The vendors may associate names with product range for a given processor. </p>

<table>
<tr><td><font color="blue">
&lt;owl:Class rdf:ID="Vendor1ProductXYZ"&gt;<br />
	&lt;rdfs:subClassOf rdf:resource="lapt:Laptop" /&gt;<br />
&lt;/owl:Class&gt;<br /><br />
&lt;owl:Class rdf:ID="Vendor2ProductXYZ"&gt;<br />
	&lt;rdfs:subClassOf rdf:resource="lapt:Laptop" /&gt;<br />
&lt;/owl:Class&gt;<br />
</font></td></tr>
</table>

<p>The OWL Inverse Functional Property implies that if property P is applied to a &lt;<font color="blue"><b>subject</b></font>, <font color="blue"><b>object</b></font>&gt; tuple such that P&lt;y,x&gt; and P&lt;z,x&gt; then y=z. Here in this example a user query such as: "<font face="comic sans ms">laptop with 2.0 GHz speed</font>" may be converted to a search for the Laptop vendors with support for processor speed or corresponding processor number. The search for normative metadata terms LaptopProcessorNumber and LaptopProcessorSpeed will be converted to application of these Inverse Functional Properties on tuples &lt;vendors, "Intel Core Duo Processor T2500" &gt; and &lt;vendors, "2.0 GHz"&gt;. Thus all web pages and web services found on the WWW for these properties (normative metadata terms) and the given values can be considered to describe "Laptop" product and may be returned in the query result. Example: <font color="teal" face="comic sans ms">all vendors found for LaptopProcessorNumber = T2500 will be considered as laptop vendors</font>.</p>

<p>The product vendors may use different subset of normative metadata terms to describe the products. The web content provider can find the necessary normative metadata terms from the basic ontology provided by the standards organization. These ontologies may also be used by other <a href="http://www.hawaii.edu/coe/departments/ite/documents/LaptopStandard-2007_000.pdf
">agencies</a> to define standards for products to be used within an organization. The web 3.0 search applications may have different search algorithms and hence define different ontologies for the normative metadata terms. The challenge for web 3.0 search application developers will be to define the most optimized and efficient algorithm and ontologies. </p>

<p><font color="brown"><b>Conclusion</b></font>: The provision of normative metadata terms by standards organization will provide the <u>flexibility to define different ontologies with these normative metadata terms</u>. In order to be able to do so the ontologies provided by the standards organization must provide the basic ontology for normative metadata that is extensible by the users of the ontology. The ontology can be extended by using "rdf:about attribute".<p>

<p>Note: Speed class may have properties as value and clock speed.</p>]]>
    </content>
</entry>

</feed>
