May 02, 2008   Sign In |  About ebizQ |  Contact Us |  Join ebizQ Gold Club

ITGumbo: spicing IT up

IT Copywrite

Technology and application of technology.

ebizQ presents ITGumbo: a spicy blog network where vendors and IT professionals share ideas about creating Business Agility.

Simple steps to add RDFa to web content

In many web content publishing tools it is not possible to modify the contents of <html> and <head> element. Therefore it may not be possible to mention correct Document Type (DOCTYPE) and Content Type (@http-equiv) values. However it is still possible to add RDF normative metadata to your web content. Web content annotated with RDFa is scanned for RDF triples by major Search Engines such as Google and Yahoo!.

» Continue reading Simple steps to add RDFa to web content.

Search strategies to explore the invisible web

Google has announced a new search strategy to explore the invisible web. Invisible web comprises web documents that have not been indexed yet. Google's new search strategy is to retrieve invisible information from the web sites that use HTML forms by providing input data to HTML input controls. As mentioned in the post that announces this search strategy they shall honor the privacy policy of a web site by processing robots.txt file and META tag 'ROBOTS' and avoid HTTP requests that require user information processing.

» Continue reading Search strategies to explore the invisible web.

Linked Data for Search Engine Optimization

Search Engine Optimization (SEO) is a technique applied to increase the rank of a page in Search Engine Result Page (SERP). SERP is the list of URIs found for the user query and rendered in descending PageRank order on a single web page. Since most web users shall traverse the URIs on the first SERP or may be next few, most web content providers would want that their web page URI is included in the first few SERPs.

» Continue reading Linked Data for Search Engine Optimization.

Advertisement

FOAF a factor for strong authentication

A FOAF file is a mechanism to publish one's social and professional profile. It is also used to publicize one's social and professional network, the people one knows besides work duties and the people one collaborates with on project or in an organization. FOAF vocabulary properties foaf:knows and foaf:person are used to document these relationships on the web.

"The foaf:knows property relates a foaf:Person to another foaf:Person that he or she knows" [1].

If one suggests in his/her FOAF file that he/she knows this person (linked with foaf:knows property), the other person is under no obligation to accept that he/she is socially or professionally associated with the FOAF file owner (value of property foaf:name in FOAF file).

"If someone foaf:knows a person, it would be usual for the relation to be reciprocated. However this doesn't mean that there is any obligation for either party to publish FOAF describing this relationship" [1].

Advantages & risks of FOAF file

"You probably know hundreds of people, yet might only list a few in your public FOAF file. That's OK. Or you might list them all. It is perfectly fine to have a FOAF file and not list anyone else in it at all" [1].

Is it okay to create a long list of acquaintances in the FOAF file by adding foaf:knows? It may be okay to do so until no other web application is developed that may use foaf:knows links for purposes other than being informed about one's social & professional standing. As soon as FOAF becomes popular amongst masses it may be used to establish credibility of a person, to establish trust and to authenticate a person. The name FOAF is expanded to "Friend-of-a-Friend", FOAF specification suggests:

"The name was chosen to reflect our concern with social networks and the Web, urban myths, trust and connections" [1].

A hyperlink to one's FOAF file may be embedded by anybody in the web content in order to add a reference to the person. The properties such as foaf:Project and foaf:Organization may be used to find a list of collaborators for a project or an organization. It is not necessary to include these people in the FOAF file with foaf:knows property, a SPARQL query on foaf:Project may be used to find people working on the same project. Many tools can be built to take advantage of this personal information in order to build and find trustworthy connections.

Strong authentication with FOAF

If two people trust each other they can project this information in their respective FOAF file by including each other's FOAF file IRI with foaf:knows property. This reciprocation is a way of establishing trust in each other and this trustworthy connection may be used for authentication. It may be used for responsibility delegation, endorsement and collaboration, etc.

A digital certificate that is used for strong authentication constitutes more than one factor. FOAF Identity (i.e. FOAF file IRI) may be used as one of the factors in the construction of a digital certificate. If 'A' and 'B' trust each other and agree to endorse each other's communications on a certain matter they may get a digital certificate with 'A' and 'B' FOAF Identify as one of the factor for strong authentication. The certification authority shall validate the authenticity of the suggested relationship between the two with the presence of reciprocated FOAF Id with foaf:knows property in 'A' and 'B' FOAF files respectively. The suggested relationship shall be verified by the certificate authority every time it receives a request to verify and decrypt the message encrypted with the issued private key. This verification shall ensure that the suggested trust relationship between 'A' and 'B' is not broken, i.e. digital certificate is valid only until this suggested trust relationship is present in both FOAF files. Example: If either 'A' or 'B' removes foaf:knows entry for the other then the 'knows' relationship is broken; if either 'A' or 'B' changes foaf:Project value then they are not working on the same project anymore.

The relationship may be suggested by specifying other FOAF properties as well, such as: foaf:Group, foaf:Project, foaf:member, etc. The relationship may also be rendered by the user agents such as browsers and e-mail clients in the same manner as VeriSign EV SSL shows certification authority in the green highlighted browser address bar. In e-mail client 'From' bar the relationships may be shown in a drop-down menu.

Conclusion: FOAF data is public information it may be used for strong authentication. If FOAF data is included in the construction of a digital certificate, other private factors must also be included for autentication of FOAF data. The other possible factors that may be used in the construction of digital certificate are: passport number, driving license number, etc. FOAF data may add authenticity to the information by virtue of relationship between the information originator and the foaf:knows people or other FOAF properties.

References:
[1] Miller, Libby. & Brickley, Dan. (2007). FOAF Vocabulary Specification 0.91.

Is Calais an alternative to RDFa annotation?

Reuters have launched a web service for annotation of content on web, they call it Calais.

We want to make all the world's content more accessible, interoperable and valuable. Some call it Web 2.0, Web 3.0, the semantic web or the Giant Global Graph - we call our piece of it Calais [1].

What Calais web service does?

This web service will accept the text content and scan it to find data for semantic annotation. The service shall find the appropriate metadata for semantic annotation. There is a provision that the content provider can suggest a vocabulary to be used for semantic annotation. The service shall store the RDF triples generated from the semantic annotation in a central repository and provide a Globally Unique Identifier (GUID) to the content provider. The service shall also provide these RDF triples to the content provider.

What should Calais web service user do?

The content provider who will use Calais web service must proivde the returned Calais GUID to those who need RDF triples corresponding to the published content. Any web application that needs RDF triples for the published content will find these triples in Calais central repository by providing the GUID.

Using the Calais GUID, any downstream consumer is able to retrieve this metadata via a simple call to Calais [1].

The content provider may also use RDFa to include the metadata terms from the RDF triples returned by the Calais web service for semantic annotation of the content.

What are the advantages of Calais web service?

The Calais web service has following advantages:

  1. It shall find appropriate metadata for semantic annotation of the input content. The content provider does not have to search for appropriate vocabularies and metadata for semantic annotation of the content.
  2. It shall generate and store RDF triples for the content in a central repository. The content provider does not need to include GRDDL transformations for RDF generation.

Conclusion: The Calais web service provides RDF triples for annotable data in the input content. The Calais web service does not use RDFa for semantic annotation of the content. The web content provider must hand annotate the web content by using RDFa and metadata from RDF triples returned by the Calais web service. An unanswered question is whether Calais web service will use normative metadata from a standard ontology. Use of RDFa and normative metadata are key to building semantic web/Giant Global Graph. The use of GUID may not be considered as a replacement to GRDDL transformation; the former will require that every time the content is updated the RDF triples stored in Calais repository must be updated and if GUID is changed then all users of this GUID must be notified. By using GRDDL transformation instead of GUID the users of RDF triples need not be notified about any changes to the content. The web server cache may store the latest updates to RDF triples.

References:
[1] Overview. 2008. Calais. <http://opencalais.mashery.com/Overview>