Read Digital Edition


ADS BY GOOGLE
Top Three Links You Must Click On


The Semantic Organization: Knowing What You Know
How do we get from here to there?

Corporations have a tremendous amount of stored information. On top of this, new information is being created every day. A small but critical portion of this information is stored in highly structured and well-defined formats in relational databases. However, most of the information is on paper, in e-mail, in word processing documents, in spreadsheets, in PDF files, in engineering diagrams, and so on.

Ever since the initial XML draft in 1996, there has been an ongoing discussion of the semantic Web. A Google search for the exact expression the semantic Web returns about 1.2 million Web pages. Clearly there has been and continues to be a lot of discussion about the semantic Web. However, the semantic Web is still being worked on. This is mainly because very little information on the Web has not been semantically tagged. It may be more prudent to start on a smaller scale than the Web.

There is some clear precedent for this. Web services and the underlying technologies (UDDI, WSDL, SOAP) all started out as having been intended for the Web at large. However, they have been most successfully implemented inside an organization. Similarly, it makes sense to create a semantic organization rather than taking on the whole Web.

Here are two examples of what a semantic organization might look like:

  • A new RFP comes in to an advertising agency. It is from a consumer goods company that is looking to launch a new food product. The RFP manager could quickly locate similar RFPs that the agency has received, similar responses, similar project-related documents, and resumes of people who have worked on similar projects. All of these documents would be returned with the relevant passages highlighted. Documents for the launch of a food-grade plastic container would be successfully filtered out. In most organizations today, gathering all of this information can be half of the effort, and many relevant documents can be overlooked.
  • A human services agency has been asked to provide a summary of the impact of a change in eligibility laws. Searching all case documents provides a summary of the clients that will be impacted by the proposed changes. The specific combination of attributes that are needed can be found, for example, people who have lived in the community for more than 10 years, who have worked for more than 15 years, and who have one of several diagnosis codes. Without semantically tagged case documents, it would require a manual search of the documents.
There are seven key areas that are needed to support a semantic organization. The first is a common model of the organization. The remaining six are data tagging, data location, data relationships, data access, data storage, and data transformation. XML can play a role in three areas of building a semantic organization. These are data tagging, data location, and data relations. XML is not the correct tool for the common model, data access, data storage, and data transformation. (See Table 1.)

How do we get from here to there? The first step in implementing a semantic organization is to look for the low-hanging fruit. There are numerous areas in any organization where simple tagging of some information will provide a tremendous benefit. For example, a company could tag all of the proposals that they have sent out with some simple information such as company, project, type of project, etc. In these cases, the common model would be informal. Going beyond this will require a greater effort and support from vendors. The biggest challenge is the creation of a common model.

There are a number of tools on the market that support one or more of the areas required for a semantic organization. However, they tend to be special-purpose tools that require extensive setup. The process of tagging, storing, and retrieving documents should be built into the tools that we use every day. It should be a basic part of what everyone does.

Getting to this will provide organizations with tremendous insight into what they know. All of the information that is being generated will be widely available and useful. The data will be unlocked and available to benefit the organization. Organizations will be able to know what they know.

About Michael Wacey
Michael Wacey is a partner with CSC Consulting and has been involved in the data processing industry since 1982. He has worked as a CTO, CIO, and project leader in numerous areas, including the telecommunications, pharmaceutical, chemical, and financial industries.

In order to post a comment you need to be registered and logged in.

Register | Sign-in

Reader Feedback: Page 1 of 1

I'd re-edit this sentence from para 2
"This is mainly because very little information on the Web has not been semantically tagged"
Check that "not"

Trackback Added: Starting Small via the Semantic Organization; Michael Wacey argues inThe Semantic Organization: Knowing What You Know that corporations have a tremendous amount of stored information and are likely to be the early adoption point for semantic Web capabillities, similar to the ways in which corpora...

XML Journal - The Semantic Organization: Knowing What You Know. Corporations have a tremendous amount of stored information. On top of this, new information is being created every day. A small but critical portion of this information is stored in highly structured and well-defined formats in relational databases. However, most of the information is on paper, in e-mail, in word processing documents, in spreadsheets, in PDF files, in engineering diagrams, and so on.

Can we have more Semantic Web articles like this? Very informative.


  Subscribe to our RSS feeds now and receive the next article instantly!
In It? Reprint It! Contact advertising(at)sys-con.com to order your reprints!
Subscribe to the World's Most Powerful Newsletters

ADS BY GOOGLE
As more business is conducted online and additional files are stored on remote servers rather than ...
As part of its continual push to embrace state-of-the-art technology, EON Reality, the world's lead...
This week, the latest list of the world’s top 500 supercomputers was announced. What I find interest...
The ability to effortlessly share cool things you discover on the Web in real time, with friends, fa...
Windows clustering gives a unique way to employ fail over support for demanding applications and ser...
A hard disk is a primary memory storage device that stores all the data of the system. It stores a ...
Cisco had to up its bid Monday for Tandberg to overcome stockholder resistance to the $3 billion dea...
In the interest of selling more widgetry to more people, IBM has transformed one of its mainframes i...
Beginning Sunday, Pantech Wireless, Inc., – the U.S.-based subsidiary of Pantech Group, one of Korea...
RightScale, the cloud manager, plans to support Windows Azure and let customers deploy RightScale-ma...
Joyent, whose customers include ABC Disney, CNN, The Gap, Facebook, LinkedIn and Yahoo, developed it...
Salesforce.com is going the way of Facebook and Twitter with a reportedly secure private social netw...
Bruz Marzolf is writing a series of blog posts implementing a simple time tracking app in various cl...
Several users upgrade their operating systems from Microsoft Windows XP to Windows Vista. But in cas...
AMD wanted to be paid for its antitrust claims. Intel wanted to be paid to let AMD’s joint venture m...
They say nobody else can hand you everything you need in minutes. There’s nothing comparable. Still ...
When Nippon Telegraph and Telephone (NTT), the world’s biggest telco, took a piece of the $10 millio...
Egenera, the very first company to build a blade system meant to handle peaks and valleys, has repor...
Turns out Intel apparently didn’t give Globalfoundries, the AMD-Arab joint venture now making all of...
Striking while Oracle is at sixes and sevens over Sun, SAP and Microsoft, two of its worst enemies, ...