Lecture 9: XML

Title slideFrom the strict rectangles of structured data to the more generous triangles of semistructured data. This morning’s lecture gave an overview of what might qualify data as semistructured; trees in general as a mathematical model of data; the particular form of trees in the XPath data model; and their textual respresentation in XML — the Extensible Markup Language.

Finally, some examples of real XML data: from musical scores to financial trading.

Link: Slides for Lecture 9


  1. Read XML Essentials from the World Wide Web Consortium (W3C).

    Link: XML Essentials — What is XML? What is XML used for?

  2. Read Sections 2.1–2.5 from Chapter 2 of Møller and Schwartzbach. I have sent a scanned PDF of this chapter to all students by email; there will also be printed copies outside the ITO in Forrest Hill; and you can find the whole book in the Library HUB.

  3. Find an SVG file and open it in a text editor to study its XML content.

  4. Find a Microsoft Office .docx file and look at the XML content in that. This format (OOXML) is in fact a zipped archive of XML files, so you will need to unzip it first. Depending on your platform, this may require renaming the .docx extension as .zip

    Link: Wikipedia on Microsoft’s OOXML format


To learn more about XML, try any of the following.