PHP XML Expat Parser
The built-in Expat parser makes it possible to process XML documents in PHP.
What is XML?
XML is used to describe data, and its focus is on what the data is. The XML file describes the structure of the data.
In XML, there are no predefined tags. You must define your own tags.
To learn more about XML, please visit our XML Tutorial
What is Expat?
To read and update-create and process-an XML document, you need an XML parser.
There are two basic types of XML parsers:
- Tree-based parser: This parser transforms XML documents into a tree structure. It analyzes the entire document and provides access to elements in the tree, such as the Document Object Model (DOM).
- Event-based parser: Think of an XML document as a series of events. When a specific event occurs, the parser will call a function to handle it.
Expat parser is an event-based parser.
Event-based parsers focus on the content of XML documents, not their structure. Because of this, event-based parsers can access data faster than tree-based parsers.
Look at the following XML snippet:
The event-based parser reports the above XML as a series of three events:
- Starting element: from
- Start CDATA section, value: Jani
- Close element: from
The XML example above contains well-formed XML. However, this instance is invalid XML because there is no Document Type Declaration (DTD) associated with it.
However, this makes no difference when using the Expat parser. Expat is a parser that does not check for validity and ignores any DTDs.
As an event-based, non-validating XML parser, Expat is fast and lightweight, making it ideal for PHP web applications.
Note: The XML document must be well-formed, or Expat will generate an error.
XML Expat parser functions are a core part of PHP. No installation is required to use these functions.
The following XML file will be used in our example:
<body>Don't forget me this weekend!</body>
Initialize XML parser
We need to initialize the XML parser in PHP, define handlers for different XML events, and then parse this XML file.
//Initialize the XML parser
//Function to use at the start of an element
echo "-- Note --<br>";
echo "To: ";
echo "From: ";
echo "Heading: ";
echo "Message: ";
//Function to use at the end of an element
//Function to use when finding character data
//Specify element handler
//Specify data handler
//Open XML file
die (sprintf("XML Error: %s at line %d",
//Free the XML parser
The above code will output:
Message: Don't forget me this weekend!
How it works:
- Initialize the XML parser with the xml_parser_create () function
- Creating functions that work with different event handlers
- Add xml_set_element_handler () function to define which function to execute when the parser encounters the start and end tags
- Add xml_set_character_data_handler () function to define which function to execute when the parser encounters character data
- Parse the file "test.xml" with the xml_parse () function
- In case of errors, add xml_error_string () function to convert XML errors into text descriptions
- Call the xml_parser_free () function to free the memory allocated to the xml_parser_create () function