Issue 34: October 2000
XML becomes a Recommendation again!
You will be pleased to know that this is not a new XML language but it is significant in
that it factors back into the XML document the 109 Errata that have been
accumulating since the original version was issued.
Many New Candidate Recommendations this Month
October proved a very fruitful month with the following moving to Candidate Recommendation status:
- XBase
- XML Schema Parts 1 and 2
- XML-Signature Core Syntax and Processing
- Canonical XML
- Modularisation of XHTML
XBase
XBase is used by XLink and has similar functionality to the HTML BASE element. Suppose you have in XML:
<root xml:base="http://mycompany.org">
<reference xlink:type="simple" xlink:href="new.xml">what's new</reference>
</root>
The simple link would resolve to http://mycompany.org/new.xml.
The base attribute can be nested so that you have URLs depending on more than one base.
XML Schemas
XML Schema comes in three parts:
- Part 0: Primer
- Part 1: Structures
- Part 2: Datatypes
The Primer is a very good introduction with a good running example throughout the text.
The other two parts are fairly heavy going. Essentially an XML Schema provides a replacement
for much of an XML DTD but with stronger datatyping. Here is a simple XML document:
<exam course="cs203">
<student>Fred Smith</student>
<qapair>
<question>Who is the last King of England</question>
<answer>George VI</answer>
</qapair>
<qapair>
<question>How many queens were named Elizabeth </question>
<answer>Two</answer>
</qapair>
</exam>
This might have a DTD like:
<!ELEMENT exam (student , (qapair)*) >
<!ATTLIST exam course CDATA #REQUIRED>
<!ELEMENT student (#PCDATA) >
<!ELEMENT qapair (question,answer) >
<!ELEMENT question (#PCDATA) >
<!ELEMENT answer (#PCDATA) >
If this was expressed in XML Schemas it might look like:
<schema>
<element name="exam">
<complexType>
<sequence>
<element name="student" type="string" />
<element name="qapair" minOccurs="0" maxOccurs="unbounded">
<complexType>
<sequence>
<element name="question" type="string" />
<element name="answer" type="string" />
</sequence>
</complexType>
</element>
</sequence>
</complexType>
</element>
</schema>
The good news is that it is in XML so that everything you can do with XML you can do with XML Schemas.
The bad news is that they tend to be a bit more verbose. So far, we have not put any further constraints
on the datatypes. But if the Course Names had to be two capital letters followed by three digits,
we would need to add:
<schema>
<element name="exam">
<complexType>
<attribute name="paper" type="course" />
<simpleType name="course" />
<restriction base="string">
<pattern value="[A-Z]{2}\d{3}" />
</restriction>
</complexType>
</element>
</schema>
So one of the additional features that we get with XML Schemas is stronger datatyping
applied to the attributes. Some of the syntax used in the XML Schemas changed quite
late on so you will not see words like "bounded" in earlier versions.
XML Signature and Canonical XML
XML Signature Core Syntax and Processing describes how you would define a digital signature as part of a
document or detached from it. The signature is something like:
<Signature>
<SignedInfo>
<CanonicalizationMethod Algorithm="htpp://..." />
<SignatureMethod Algorithm="http://...." />
<Reference URI="http:www.w3.org/MyDocument.html" />
<Transforms>
<Transform Algorithm="http://www.w3.org/MyXSLTransform.xsl" />
</Transforms>
<DigestMethod Algorithm="http://..." />
<DigestValue>.....</DigestValue>
</Reference>
</SignedInfo>
<SignatureValue>....</SignatureValue>
<KeyInfo>
<KeyValue>
- - - -
</KeyValue>
</KeyInfo>
</Signature>
The SignedInfo section is the thing that is being signed. The requirement is that performing canonicalisation
and the signature method on MyDocument should produce the SignatureValue. KeyInfo indicates the key to be
used to validate the signature. The Transforms allow you to perform an XSLT transformation over the
document so that the part being signed is a subset of the complete document. So, for example, you can
embed the signature in the document yet sign everything other than the signature.
Canonicalization ensures that two documents that are effectively the same will have the
same value when the signature method is applied. The Canonical XML Candidate Recommendation
defines the canonical form of a document. For example, it specifies the order of attributes
in an element, gets rid of white space variations and ensures that null elements are the same
irrespective of the way they are written.
WWW10 Papers Due
The Tenth International World Wide Web Conference (WWW10) takes
place in Hong Kong, China from May 1-5, 2001.
The deadline for refereed paper submission is 13 November so no time to delay!
Membership of W3C Continues to Rise
The total number of W3C Members has risen to 478 (that is 30 more in the last two months). Recent new Members are:
-
Altova GmbH: this Austrian company, previously
called Icon Information Systems, develops and distributes XML Spy, an editor conversant with both DTDs and XML
Schema, and also able to edit and execute XSLT transformations.
-
Argo Interactive Group: this UK company's
main interest is the wireless internet with software to deliver content to most mobile phones
either in WAP or i_Mode format from the same content.
-
Avaya Communication: the enterprise networks spinoff from Lucent Technologies. It provides advanced communications systems for enterprises. Avaya offers converged voice and data, customer relationship management, messaging, and voice multi-service networking.
-
Avenue A: this Seattle based company is mainly involved in advertisment tracking. By looking at the cookie on a user's computer and comparing it to theirr database, Avenue A can see that that user has viewed a particular ad on a particular site at a particular time. If the user clicks on the ad, Avenue A can see that.
-
Barbadosoft BV: this Dutch company is a young company devoted to the development of advanced XML management tools. The Barbados Suite of products will be available early 2001.
-
Blast Radius: this is a Canadian web site design company that has customers like Nike, and has developed sites like www.zelda.com.
It received the Communication Arts Web Site of the Week award for its U-571 site. This is the one about how an American submarine crew carries out a daring mission to capture the Enigma machine from a Nazi U-Boat. It is interesting that the History part of the site does not seem to mention that it was a UK submarine and not a US one that carried out the mission. Having said that, it is an innovative site.
-
Chevron Information Technology Company: the Houston-based gas company that recently merged with Texaco.
-
CollegeNET, Inc.: the US portal for applying to college over the web. CollegeNET lets applicants complete, file, and pay for their admissions applications entirely through the internet. Over 500 US universities use the system for their admissions.
-
Comverse Network Systems Ltd: based in Woodbury, New York, it develops, manufactures and markets digital recording (including video recording) and quality monitoring solutions for call centers, government and law enforcement organizations.
-
Contivo, Inc.: based in Palo Alto, Contivo, provides auto-mapping solutions for B2B trading partner networks and eCommerce. Contivo's web service hosts a B2B thesaurus which defines and maintains common business content between trading partners. Representations of content, for business documents such as purchase orders, are managed through a rules dictionary and synonym library. The patented software looks for matches between the business documents and applies the appropriate rules when matches are found. This is known as auto-mapping. The thesaurus is self-learning and constantly growing with each deployment. Its knowledge-base can be extended to all participants of the Contivo eService network.
-
Defense Logistics Information Service (DLIS): US government Agency
dedicated to providing worldwide logistics support throughout the Department of Defense. The primary focus
of the Agency is to support the war fighter in time of war and in peace and to support relief effort during times of national emergency.
-
I-DNS.net International Inc.: provides multilingual internet
technology solutions (the name stands for Internationalised DNS) that allow people to register and
use domain names in their language of choice. For example, it is possible to register Japanese
domain names ending with
and
(corresponding to .com and .net) and
have them used either in Japanese or English.
-
iGeneration: based in San Francisco, their mission is to close the Internet skills gap by empowering a new generation to dream, architect and build the Internet era.
-
Internet Academy, Inc.: the first school in Japan specialising in the Internet.
-
Milo: a San Francisco-based company specialising in the design and development of Voice Web business solutions including Voice UI and dialog design, grammar development, VoiceXML engineering, application tuning and platform integration.
-
Open eBook Forum: an association of hardware and software companies, publishers, authors and users of electronic books whose goals are to establish common specifications for electronic book systems.
-
Popkin Software & System, Inc.: this New York based company's main product is System Architecture 2001, which offers a set of interrelated models and information matrices that enable users to model the entire enterprise from various perspectives - from high-level business objectives and hierarchical makeup to event-driven processes and data warehousing. It provides full support for UML 1.1, OMT, Booch and Jacobson Use Cases.
-
Savage Software: this Vancouver-based company markets a Scalable Vector Graphics (SVG) toolkit.
-
SmartPipes, Inc.: this Californian company markets SmartPipes Global IP Service which enables corporations to manage heterogeneous networks as one seamless logical network via advanced web-based interfaces, while providing a comprehensive set of integrated advanced IP Services.
-
Spatial Identifier AG (SID):
-
Symantec Corporation: the internet security company that sells the Norton utilities.
-
TIBCO Software. Inc.: this Palo Alto based company
provides e-business infrastructure through its ActiveEnterprise (business process integration and automation ),
ActivePortal(information aggregation and personalized interactivity via the Web and wireless devices ),
ActiveExchange (B2B commerce with other enterprises and through marketplaces) software.
-
Ubicco: the Ubiquitous Computing Company - is the Fi SYSTEM Group's Mobile Internet subsidiary.
UBICCO's main objective is to assist companies with the distribution of web content to non-PC terminals (both WAP and i-mode mobile phones, PDAs,
embedded computing in vehicles and public places, interactive TV set-top boxes etc)
-
Verascape: based in Illinois, Verascape develops complete voice systems for companies extending Web access to wireless and wireline phones.
-
Vrije Universiteit:this Dutch university is involved with the IBROW project to develop intelligent brokers that are able to distributively configure reusable components into knowledge systems through the Web.
-
Western Australian Dept. of Commerce and Trade, Office of Information and Communications: the role of OIC is to lead, facilitate, co-ordinate and work with agencies, businesses and communities, in order to maximise Western Australia's transformation to the Information Age.
-
XML Global Technologies, Inc.: this Vancouver-based company is a provider and developer of highly scalable XML based e-business software solutions designed to allow organisations to optimise information integration, retrieval and manipulation. It markets an XML-based search engine goXML and expressXCHG software aimed at maximising the effectiveness of information exchange while minimising capital expenditures. Ken Holman of Crane Softwrights Ltd is on its Technical Advisory Board.