Frequently asked questions about getting started with XML (4)-XML/RSS Tutorial-php.cn

Home

Backend Development

XML/RSS Tutorial

Frequently asked questions about getting started with XML (4)

黄舟

Dec 22, 2016 pm 05:39 PM

xml

　How to deal with whitespace characters in the xml object model?

　 Sometimes, the XML object model will display TEXT nodes that contain whitespace characters. When whitespace characters are truncated, it's likely to cause some confusion. For example, the following XML example:

　　]>
　Smith
John

　 generates the following tree:

　PRocessing Instruction: xml
　DocType: person
　ELEMENT: person
　TEXT:
　ELEMENT: lastname
TEXT:
ELEMENT : firstname
　TEXT:

　The first name and last name are surrounded by TEXT nodes containing only whitespace characters, because the content model of the "person" element is MIXED; it contains the #PCDATA keyword. The MIXED content model specifies that text can exist between elements. Therefore, the following is also correct:

My last name is Smith and my first name is John

The result is a tree similar to the following: ELEMENT: person
TEXT: My last name is

ELEMENT : lastname

　　　TEXT: and my first name is
　　ELEMENT: firstname
　　TEXT:

　　Without the whitespace characters after and before the word "is", and the whitespace characters after and before the word "and", the sentence would not be understandable. . Therefore, for the MIXED content model, text combinations, whitespace characters, and elements are all relevant. This is not the case for non-MIXED content models.

　To make the TEXT node with only whitespace characters disappear, remove the #PCDATA keyword from the "person" element declaration:

　The result is the following clear tree:

　Processing Instruction: xml

　DocType: person

　ELEMENT: person

　　ELEMENT: lastname
　ELEMENT: firstname

　What does an XML declaration do?

　The XML declaration must be listed at the top of the XML document:

　It specifies the following items:

　The document is an XML document. MIME detectors can use this to detect if a file is of type text/xml when the MIME type is missing or has not been specified.

　The document complies with the XML 1.0 specification. This will be important in the future if there are other versions of XML.

　Document character encoding. The encoding attribute is optional and defaults to UTF-8.

　Note: The XML declaration must be on the first line of the XML document, so the following XML file:

　 produces the following parsing error:

　Invalid xml declaration.

　 Line 0000002:

Location 0000007: ------^

Note: The XML declaration is optional. If you need to specify comments or processing instructions at the top, don't put an XML declaration. However, the default encoding will be UTF-8.

　How can I print my XML document in a readable format?

　When constructing a document from scratch using DOM to produce an XML file, everything is on one line with no spaces between each other. This is the default behavior.

　Construct the default XSL stylesheet in Internet Explorer 5 to display and print XML documents in a readable format. For example, if you have IE5 installed, try looking at the nospace.xml file. The following tree should be displayed in the browser:

　 Printing readable XML is very interesting, especially when there are DTDs that define different types of content models. For example, under the mixed content model (#PCDATA) you cannot insert spaces as it might change the meaning of the content. For example consider the following XML:

　Elephant
　This would be better not to output as:

　elephant

　because the word boundaries are no longer correct.

　All of these make automated printing problematic. If you don't need to print readable XML, you can use the DOM to insert whitespace characters as text nodes at appropriate locations.

　How to use namespaces in DTD? To use a namespace in a DTD, declare it in the ATTLIST declaration of the element that uses it, as follows:

　The namespace type must be #FIXED. The same goes for attribute namespaces:

　Namespaces and XML schemas

　DTD and XML schemas cannot be mixed. For example, the following xmlns:x CDATA #FIXED "x-schema:myschema.xml" will not cause the schema definition defined in myschema.xml to be used. The use of DTD and XML schemas are mutually exclusive.

　How to use XMLDSO in Visual Basic?

Use the following XML as an example:

Mark Hanson 206 765 4583

Jane Smith 425 808 1111 You can bind to an ADO recordset as follows:

　Create a new VB 6.0 project.

　Add references to Microsoft ActiveX Data Objects 2.1 or later, Microsoft Data Adapter Library, and Microsoft XML version 2.0.

Use the following code to load XML data into the XML DSO control:

　　Dim dso As New XMLDSOControl
　　Dim doc As IXMLDOMDocument
　 Set doc = dso. ：

　　Dim da As New DataAdapter

　 Set da.Object = dso

　 Dim rs As New ADODB.Recordset
　 Set rs.DataSource = da

　 Access data:

MsgBox rs.Fields("name"). Value

　The result shows the string "Mark Hanson"
　How to use XML DOM in java?

　The IE5 version of MSXML.DLL must be installed. In Visual J++ 6.0, select Add COM Wrapper from the Project menu, and then select "Microsoft XML 1.0" from the COM object list. This will construct the required Java wrapper into a new package called "msxml". These pre-built Java wrappers are also available for download. Classes can be used as follows:

　　import com.ms.com.*;

　import msxml.*;

　public class Class1
　{
　public static void main (String[] args)
　{
　DOMDocument doc = new DOMDocument ();
　doc.load(new Variant("file://d:/samples/ot.xml"));
　System.out.println("Loaded " + doc.getDocumentElement().getNodeName());
　　}
　}

　　The code example will load the 3.8MB test file "ot.xml" from the sun religion example. The Variant class wraps the Win32 VARIANT basic type.

　You cannot use pointer comparisons on nodes because you actually get a new wrapper every time you retrieve a node. So instead of using the following code,

IXMLDOMNode root1 = doc.getDocumentElement(); IXMLDOMNode root2 = doc.getDocumentElement(); if (root1 == root2)...

Code:

　if (ComLib.isEqualUnknown(root1, root2)) ....

　 . The total size of the .class wrapper is approximately 160KB. However, for full compliance with the W3C specification, only IXMLDOM* wrappers should be used. The following classes are old IE 4.0 XML interfaces and can be removed from the msxml folder:

　_xml_error*

　This reduces the size to 147KB. You can also delete the following items:

　DOMFreeThreadedDocument
　Access XML documents from multiple threads in Java applications.
　xmlhttpRequest
　Use the XML DAV HTTP extension to communicate with the server.

　IXTLRuntime

　 Define XSL stylesheet script object.

　XMLDSOControl

　Bind to XML data in an HTML page.

　XMLDOMDocumentEvents

　Return callbacks during the analysis process.

　This reduces the size to 116KB. To make it smaller, consider the fact that the DOM itself has two layers: The core layer consists of:

　　DOMDocument, IXMLDOMDocument
　IXMLDOMNode*
　IXMLDOMNodeList*
　IXMLDOMNamedNodeMap*

　IXMLDOMDocumentFragment*

　IXMLDO MImplementation

　 IXMLDOMParseError

　 and users may need to retain DTD information:

　IXMLDOMDocumentType
IXMLDOMEntity
IXMLDOMNotation

All node types in an XML document are IXMLDOMNodes, which provide full functionality, but there are higher-level wrappers for each node type. Therefore, if you modify the DOMDocument wrapper and change these specific types to use IXMLDOMNode, then all the following interfaces can be removed: IXMLDOMAttribute IXMLDOMCDATASection IXMLDOMCharacterData IXMLDOMComment ProcessingInstruction

IXMLDOMEntityReference
IXMLDOMText

Deleting these will Reduce the size to 61KB. However, for IXMLDOMElement, both the getAttribute and setAttribute methods are useful. Otherwise, you need to use:

　IXMLDOMNode.getAttributes().setNamedItem(...)

The above is the content of the FAQ (4) for getting started with XML. For more related content, please pay attention to the PHP Chinese website (m.sbmmt.com) !

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undress AI Tool

Undress images for free

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undresser.AI Undress

AI-powered app for creating realistic nude photos

ArtGPT

AI image generator for creative art from text prompts.

Stock Market GPT

AI powered investment research for smarter decisions

Hot Article

How to fix a monitor's color looking washed out or yellow?

3 weeks ago By 下次还敢

How to resize images in Photoshop without losing quality? (Pro tips)

1 months ago By 下次还敢

How to add subtitles in premiere

3 weeks ago By 下次还敢

How to cancel the continuous monthly subscription on Youku? [Steps] to unbind Alipay as a Youku member

4 weeks ago By DDD

How to set up the Notification Center in Windows 11_Windows 11 Notification Center setting tips [Steps]

1 months ago By DDD

Popular tool

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Douyin level price list 1-75

20441

wifi shows no ip assigned

13592

Virtual mobile phone number to receive verification code

11932

Where is the login entrance for gmail email?

8933

How to turn off windows security center

8479

Related knowledge

JSON vs. XML: Why RSS Chose XML May 05, 2025 am 12:01 AM

RSS chose XML instead of JSON because: 1) XML's structure and verification capabilities are better than JSON, which is suitable for the needs of RSS complex data structures; 2) XML was supported extensively at that time; 3) Early versions of RSS were based on XML and have become a standard.

RSS in XML: Decoding Tags, Attributes, and Structure Apr 24, 2025 am 12:09 AM

RSS is an XML-based format used to publish and subscribe to content. The XML structure of an RSS file includes a root element, an element, and multiple elements, each representing a content entry. Read and parse RSS files through XML parser, and users can subscribe and get the latest content.

Understanding RSS Documents: A Comprehensive Guide May 09, 2025 am 12:15 AM

RSS documents are a simple subscription mechanism to publish content updates through XML files. 1. The RSS document structure consists of and elements and contains multiple elements. 2. Use RSS readers to subscribe to the channel and extract information by parsing XML. 3. Advanced usage includes filtering and sorting using the feedparser library. 4. Common errors include XML parsing and encoding issues. XML format and encoding need to be verified during debugging. 5. Performance optimization suggestions include cache RSS documents and asynchronous parsing.

XML's Advantages in RSS: A Technical Deep Dive Apr 23, 2025 am 12:02 AM

XML has the advantages of structured data, scalability, cross-platform compatibility and parsing verification in RSS. 1) Structured data ensures consistency and reliability of content; 2) Scalability allows the addition of custom tags to suit content needs; 3) Cross-platform compatibility makes it work seamlessly on different devices; 4) Analytical and verification tools ensure the quality and integrity of the feed.

Building XML Applications with C : Practical Examples May 03, 2025 am 12:16 AM

You can use the TinyXML, Pugixml, or libxml2 libraries to process XML data in C. 1) Parse XML files: Use DOM or SAX methods, DOM is suitable for small files, and SAX is suitable for large files. 2) Generate XML file: convert the data structure into XML format and write to the file. Through these steps, XML data can be effectively managed and manipulated.

RSS, XML and the Modern Web: A Content Syndication Deep Dive May 08, 2025 am 12:14 AM

RSS and XML are still important in the modern web. 1.RSS is used to publish and distribute content, and users can subscribe and get updates through the RSS reader. 2. XML is a markup language and supports data storage and exchange, and RSS files are based on XML.

XML in C : Handling Complex Data Structures May 02, 2025 am 12:04 AM

Working with XML data structures in C can use the TinyXML or pugixml library. 1) Use the pugixml library to parse and generate XML files. 2) Handle complex nested XML elements, such as book information. 3) Optimize XML processing code, and it is recommended to use efficient libraries and streaming parsing. Through these steps, XML data can be processed efficiently.

Beyond Basics: Advanced RSS Features Enabled by XML May 07, 2025 am 12:12 AM

RSS enables multimedia content embedding, conditional subscription, and performance and security optimization. 1) Embed multimedia content such as audio and video through tags. 2) Use XML namespace to implement conditional subscriptions, allowing subscribers to filter content based on specific conditions. 3) Optimize the performance and security of RSSFeed through CDATA section and XMLSchema to ensure stability and compliance with standards.