Preserve the doctype when parsing and unparsing #351

Ravencentric · 2024-07-14T20:03:09Z

Current behavior:
xmltodict discards DOCTYPE.

Expected behavior:
xmltodict should keep DOCTYPE

Reproduction:

import xmltodict

xml = """\
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE library PUBLIC "-//Example//DTD Library 1.0//EN" "http://www.example.com/DTD/library.dtd">
<library>
    <book>
        <title>XML Basics</title>
        <author>John Doe</author>
        <published>2020</published>
    </book>
    <book>
        <title>Advanced XML Techniques</title>
        <author>Jane Smith</author>
        <published>2021</published>
    </book>
</library>
"""

parsed = xmltodict.parse(xml)
unparsed = xmltodict.unparse(parsed, pretty=True)
print(unparsed)
# <?xml version="1.0" encoding="utf-8"?>
# <library>
#         <book>
#                 <title>XML Basics</title>
#                 <author>John Doe</author>
#                 <published>2020</published>
#         </book>
#         <book>
#                 <title>Advanced XML Techniques</title>
#                 <author>Jane Smith</author>
#                 <published>2021</published>
#         </book>
# </library>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve the doctype when parsing and unparsing #351

Preserve the doctype when parsing and unparsing #351

Ravencentric commented Jul 14, 2024

Preserve the doctype when parsing and unparsing #351

Preserve the doctype when parsing and unparsing #351

Comments

Ravencentric commented Jul 14, 2024