PB138 - Markup Languages Tomas Pitner March 22, 2013 To i • Query language for searching and extraction of XML nodes (elements, attributes) from a document and for an output XML document construction. • The XQuery is a basic XML query language at present time (and it seems in the future as well). • The W3C specification since March 2011, see http://www.w3.org/XML/Query. • Base on XPath 2.0 data model, operators and functions. • Supported by main database engines producers (IBM, MS, Oracle, etc) and where not) XQuery domain are: • queries, where extraction (selection) part is more complicated than the construction part. • Use the XSLT in the opposite case • or using the more general API (using DOM manipulation for example). To i Example of source document, XML Queries on it and their results. Petr Novak 1969-05-14 novak@myfriends.com characteristics lang="en">Very good friend Jaroslav Novaeek 1968-06-14 novacek@myfriends.com Another good friend< Figure: XQuery to the source documents. Task: "extract all surnames in the addressbook". Query is XPath expression - selects all lastname elements, doc('myaddresses. xml;)/addressbook/person/lastname sing Saxon 9.0j XSLT processor Saxon contains the XQuery processor since version 8.x as well. To process XQuery you need: • to install Saxon 9.0.0.4J for example ("j" means implementation in Java, there is a .NET implementation as well) by unpacking to folder c: /devel/saxon9-0-0-4j for example. • Change working directory to the folder: cd c:/devel/saxon9-0-0-4j • put the above mentioned query into a file (lastnames .xq). • store the above mentioned XML document containing "addressbook" into the file myaddresses .xml in the same directory. • Run: Java -classpath saxon9.jar net.sf.saxon.Query -o result.xml lastnames.xq from command line. The query to above mentioned document will create the file result.xml its content follows: Novak Novacek Horak Polak To i FLWOR is an acronym of an XQuery structure. It means: (F)or Initial query part that specifies query cycle including control variable. Results of XPath expression behind the keyword "in" are assigned to the variable. (L)et You can assign values of next variable that can be used later in this section. (W)here specifies selection condition ie. which nodes (values) selected by for section will be used. The condition can utilize the variables defined in the " let" section. (O)rder Defines how the nodes should be oredered. (R)eturn Defines what is returned, constructed from extracted nodes (values). e Condition used to select requested nodes can be specified either in an XPath expression in "for" clause or in the "where" clause. "Return Mr. Polaks birthdate." for $person in doc(;myaddresses.xml;)/addressbook/person where $person/lastname=)Polák5 return $person/date-of-birth Query returns: 1980-02-28 AXO .x • install (extract) Saxon with version 7.0 at least (8.x, 9.x) into some directory • change working directory to the Saxon directory and • run: Java -classpath saxon9.jar net.sf.saxon.Query -o result.xml queryfile.xq from command line. • There is a .NET Saxon implementation (means .DLL and .EXE files) Native XML database systems support XQuery as a query language often. Native XML Databases are for example: • Berkeley DB XML (http://www.sleepycat.com/products/index.shtml) • eXist (http://exist.sourceforge.net/) To i