Consider ISP has following services:
Dial-up, DSL, ISDN, Satellite, and WLAN. ISP defines the following characteristics of every service: Connection Type (wired or wireless), Connection Speed ( 56Kbps, 128Kbps, 256Kbps, 512Kbps, 1Mbps , etc), Installation Cost (Fixed Charges, Modem Cost), Subscription Cost ($25/Month, $5/4GB, etc)
| HTML | XML | RDF |
<table>
<tr>
<th>Speed</th><th>Unit</th>
<th>SCost</th><th>SType</th>
</tr>
<tr>
<td>256</td><td>Kbps</td>
<td>$5</td><td>monthly</td>
</tr>
<tr>
<td>2</td><td>Mbps</td>
<td>$60</td><td>monthly</td>
</tr>
</table>
|
<ServicePackages>
<Package>
<Speed unit="Kbps">256</Speed>
<SubsCost unit="Monthly">5</SubsCost>
</Package>
<Package>
...
</Package>
</ServicePackages>
|
<rdf:Description rdf:nodeID="PkgWebRes">
<cdef:Speed rdf:nodeID="SpdWebRes">
<cdef:SubsCost rdf:nodeID="ScostWebRes">
</rdf:Description>
<rdf:Description rdfnodeID="SpdWebRes">
<cdef:BitRate rdf:nodeID="BRWebRes">
</rdf:Description>
|
| formatting for display |
storage layout and logical structure |
web resources metadata |
HTML
provides formatting to the information that is displayed on the web page.
XML
provides a method to structure this information into entities that have attributes: e.g. Characteristics of a service. Each entity is described in the XML document as an element, attribute specifications and child elements. Document Type Definition (DTD), HTML and Style Sheet provide the formatting for the XML structured information. The advantage of XML structured information is that each element can be accessed and processed individually like a database record. The element information can be exchanged in protocol messages and the element information can be used to operate relational database. Before XML, the web page content existed in HTML format that was not easily accessible for database operations.
The World Wide Web is a web of such HTML and XML documents. The XML entities are accessible within a limited scope, e.g. the credit card information is exchanged online with the finance service (bank), and the secure protocol connections between the consumer, merchant and the bank ensure information security. The e-shop product information and the purchase order are communicated between the consumer and the merchant. XML has made it possible to exchange information within this limited scope.
The unresolved issue is the time spent by the consumer in locating the best product.
An e-shop may provide a sort mechanism or a search tool. However the scope is limited to e-shop.
The DSL service packages that were listed in tabular form with HTML are entered as database records by XML definition; the information is present for the Search Engine as a text document. If every element identified in the XML document can be considered as a web resource with an unambiguous URI then a metadata can be defined for this web resource. This metadata will empower the Search Engines to make an intelligent search. Consider the service provided by the ISP as a web resource and the characteristics of this service as a metadata of this web resource. With speed and cost as data values in the metadata a Search Engine can locate the best service.
RDF will define the DSL service package as a web resource with associated metadata. Mention of a phrase “256Kbps” in the search query will tell Search Engine to query all metadata with '256Kbps' value in the BitRate field. A more advanced Search Engine can then compare the SubscriptionCost in the results. The semantic web provides meaning to the phrase '256Kbps'; rather than listing all web pages with the text phrase '256Kbps', the Web 3.0 Search Engine will list only those pages that have this phrase in the BitRate field of metadata.
Resource Description Framework (RDF) is a language to represent the entities as web resources with the associated metadata. The challenges are:
- Identification of web resources
- Standard definition of metadata
- Definition of common namespace
- ...
Swoogle is a search engine that indexes all URI with a given term in the URI, all documents that contain this term in the metadata. E.g. search term “CDMA”, Swoogle will index all http://*.CDMA.*, http://*.*.*/*/CDMA/* type of URI and all documents with “CDMA” in metadata. SHOE is another Semantic Search project that works on metadata and does not depend on keyword density only for indexing web pages.
Please refer W3C specifications for correct language syntax.