Archives for: January 2005

01/13/05

Permalink 09:32:23 pm, Categories: All, 550 words   English (US)

Semantic Web: A different perspective on what works and what doesn't

Peter Norvig's view published on AlwaysOn seems to be colored by a decidedly web search engine perspective. If we start looking at Enterprise Semantic Applications (semantic applications developed for targeted enterprise/corporate/scientific/engineering user base, whether the data comes from the enterprises, or is a syndicated/licensed content, or is a open Web content), you can start to see some exciting alternative perspectives and realties. Let us review Peter's belief that the point that there is not enough RDF and SW content. This is growing at an extremely rapid pace (I am sure others will put out numbers; and there are specialized search engines such as Swoggle that focus on Semantic Web content with rapidly growing size of indexed documents: check ). More importantly, the promise of Semantic Web is closely tied to having the tools for semantic annotations of heterogeneous content, i.e., create semantic metadata automatically. This is much easier to do when you have high quality domain ontologies that bound the scope of automatic extraction. And I really do not see content suppliers putting the metadata in (as we do not see Web page authors using metatags), or at least this will be optional and just one form of input. Instead, metadata will be created with respect to potential use (e.g., there are some definite concepts when we deal with WorldNews, USNews, TechnologyNews, and so on). Commercial technologies (example) can process millions of pages per day and extract semantic metadata, and all these can be represented as RDF (and that is a good idea because of the benefits esp. for high end semantic applications such as analytics). Granted this is hard to do on a panWeb scale where you have no single domain or even a limited set of domains, and huge diversity of users who may need to see the content from different perspectives. Even here, I believe much can be done, but it will take a little more time - maybe 3 years.

Let me next respond to the comment about ontologies. There are many cases within Enterprises and even for consumer applications (e.g., see examples; also expect to see use of Ontologies soon by Amazon and this types of companies). Rather than focusing on common sense or general purpose ontologies, the immediate future is with domain and task/application specific ontologies (latter are appropriate for enterprises, eg., SOX or Anti-Money Laundering (AML) applications). This is done now successfully, with line of business applications (e.g., AML application deployed at one of the largest banks). These types of ontologies routinely have millions of instances (look at SWETO, NCI ontology, GlycO, as very different types of examples: think of SWETO with about a million instance as a poor sample of what is being done for real world Enterprise class Semantic Applications representing only a 10th of population and with diffused focus that a typical enterprise deployment; with 767 classes, think of GlycO has well-focused scientific ontology developed by domain experts; NCI ontology involves more community involvement). Additional thoughts on SW adoption are in my earlier item on this blog.

Postscript: Just before posting this, I stumbled across Danny Ayer's views in response to Peter Norvig's item; I agree with those view almost completely. After my initial posting, I came across another set of comments by Tim Finin.

Permalink

01/07/05

Permalink 03:07:54 pm, Categories: All, 1286 words   English (US)

Why are we still pushing Semantic Web?

This was the question a panelist asked at the W3C Advisory Committee meeting that I attending at the beginning of December 2004. In other words, the panelist and others discussing this question were wondering, why is it taking so long for the industry to get it (its importance)? Or that, by now, we would have expected it to have seen much wider adoption, a clear indication that the Semantic Web is here for good, transforming the Web into its next logical incarnation.

The essence of my comment at that time was that the rate of progress is quite robust and pervasive, and there are prominent signs that the Semantic Web is not just a fad, that this time, semantics as applied to information (which predates the Semantic Web as defined today) is indeed likely to affect many businesses in not-too-distant a future, and even common Web user in intermediate future. Here is an extended perspective on the adoption of the Semantic Web, which also incorporates a nice dinner discussion that some of the Semantic Web technology/product vendors (who are members of the W3C) had with the W3C Semantic Web team members (Eric Miller and others).

Research

Although funding from NSF, DARPA and the premier funding agencies have now waned, DAML program gave excellent and timely start to the Semantic Web research in the US. The funding initiative moved to Europe with Framework V, and is firmly entrenched with Framework VI. The number of new conferences, conference attendance, sessions related to the Semantic Web in older and more established conferences, number of published papers and new scientific journals devoted to the Semantic Web (such as Web Semantics, Semantic Web & Information Systems, and Applied Ontology) all point to broad and increasingly entrenched interest in this new area.

Standards

One of the nicest things that have happened to our area is timely standards activity. Note the emphasis on "timely", as it is helpful to have basic standards before the area matures and before industry interest peaks, reducing the chances of clashes between the entrenched interests. Not having activities being taken to competing standards bodies, as is the case in Web Services area, helps too.

Technology and Products

One of the most exciting things to have happened in our area is the number of technologies commercialized from academic research (Taalee's MediaAnywhere A/V Semantic Search and Semagix's Freedom from University of Georgia's SCORE technology, Network Inference's relationship with University of Manchester, Ontoprise's relationship with Karlsruhe, to name a few). Now, at least twenty vendors claim to use or support Semantic Web technologies, and the list is growing quite rapidly. And perhaps most importantly, scientific and business communities are building targeted (i.e., with clear purpose) and large ontologies at an impressive pace.

Industry Recognition

The informative panel at the W3C 10th anniversary celebrations (http://www.w3.org/2004/09/W3C10-Program.html) on the "Web of Meaning" illustrated how the thought leaders and industry executives buy into the vision of the Semantic Web. Panelists Tim O'Reilly (O'Reilly Media, talk) and Bill Ruh (Cisco Systems, talk) presented a fairly encouraging perspective on how Semantic (Web) technologies are needed for key applications, such as Regulatory Compliance, B2B Exchange, Workflow and BPM, and Business Intelligence. What is interesting is that some of these are "selling aspirin" rather than "selling vitamins", something that does better in low to moderate economic growth environments.

I would add several other fields of rapid adoption, including life sciences (see the W3c workshop in Semantic Web for Life Sciences), bioinformatics, healthcare, content management, national intelligence and homeland security. Just look at the number of large ontologies that cover the broad range of schema size, descriptionbases (instances) and expressiveness of representation, developed by community or a small number of domain experts, that are now being put to practical use. Some illustrious examples are NCI Cancer Ontology with over 17,000 concepts, or GlycO ontology for complex Carbohydrates with 767 Classes that is up to 11 levels deep and utilizes all expressive power of OWL, or ontologies with over 10 million instances developed for enterprise semantic applications using Semagix Freedom. Researchers interested in finding ontoloiges to play with can consider TAP or SWETO that are based on real-world facts, or get their hands on software to generate synthetically generated ontologies.

At the industry events, such as those organized by TopQuadrant and MITRE, or the user group initiated events, such as those for the US Department of Defense or the Life Science Community, 100 to 300+ people have shown up, which indicated fairly high level of industry and user group interest.

Industry Deployment and Early Successes

Since some very early deployment examples that were discussed at the WWW2004 Developer's day, there are now increasing number of examples of deployments both in Enterprises (e.g., see my KMWorld talk) or for more 'common' web users. It is this topic what garnered the main attention during our dinner discussions (mentioned above). One exciting observation that came up is the stealth inclusion of the Semantic Web technologies in applications. Eric Miller gave the example of Creative Common's use of RDF (also see Shelly Parker's earlier article). This is an example of simpler SW applications involving embedding license metadata and validating it so millions of content items would in essence be using at least limited Semantic Web technology for enforcing licenses! Another example is that of semantic annotation of syndicated contents and Web Services (e.g., the WSDL-S semantic proposal (early draft, currently being revised in an academic-industry partnership) and corresponding tools (e.g., MWSAF and ASSAM) for annotation of Web Services). Such applications can quickly lead to a wide spread and pervasive use of RDF in a fairly short time. What is interesting is that some of the applications are not being deployed by early adoptors; instead the SW technologies have been part of the pain killer types of main-stream IT applications and solutions (such as Anti-Money Laundering, compliance and risk management)! Enecdotal successes are starting to come. For example, a compliance related semantic application(implemented with a semantic technology platform from Semagix) is live at one of the largest banks in the world in the line of business. And I have heard of companies such as Amazon inserting ontologies in their main stream applications, so we can expect to see large scale consumer centric applications exploiting essential components of Semantic Web in the near future.

Final thoughts

One perspective that some in the community, particularly Tim Berners-Lee-TBL, seem to promote it that Semantic Web is "not interesting in the smaller scale". As more and more things connected by a "semantic way" it becomes more and more important. This makes sense from the perspective of global scale Web and non-enterprise applications. But from an industry perspective, I believe Semantic Web is equally interesting at the intra- and inter-Enterprise scales, and for Enterprise applications. This view is the same as the adoption and importance of Web technologies in Intranets. If at all, given the ability to constrain or limit the domain, deeper domain semantics can be put to use, agreements to build ontologies can be reached faster, industry specific metadata standards can be readily used, and facts and knowledge to populate ontologies can be obtained more easily. Today's enterprises have millions of documents, and access to massive amounts of high-quality or targeted syndicated contents and data (e.g., through Lexis-Nexis, ChoicePoint, NewsML and RSS News Feeds, and so on). The ontologies developed to support targeted enterprise scale Semantic Applications are currently exploiting ontologies with millions to tens of millions entity and relationship instances. And yes, the promise of scaling these Enterprise and industry scale islands by interconnecting them (and achieve what TBL called network effect) exists anyways.

Permalink

Amit Sheth's Blog on Semantic Web: Now and Future

January 2005
Mon Tue Wed Thu Fri Sat Sun
<<  <   >  >>
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31

Search

Misc

Syndicate this blog XML

What is RSS?

powered by
b2evolution