xml Archives - Page 7 of 9

What is the fastest way to parse large XML docs in Python?

August 17, 2022 by Magenaut

I am currently running the following code based on Chapter 12.5 of the Python Cookbook:

Can ElementTree be told to preserve the order of attributes?

August 17, 2022 by Magenaut

I’ve written a fairly simple filter in python using ElementTree to munge the contexts of some xml files. And it works, more or less.

How can this function be rewritten to implement OrderedDict?

August 17, 2022 by Magenaut

I have the following function which does a crude job of parsing an XML file into a dictionary. Unfortunately, since Python dictionaries are not ordered, I am unable to cycle through the nodes as I would like. How do I change this so it outputs an ordered dictionary which reflects the original order of the … Read more

Remove namespace and prefix from xml in python using lxml

August 16, 2022 by Magenaut

I have an xml file I need to open and make some changes to, one of those changes is to remove the namespace and prefix and then save to another file.
Here is the xml:

How to output CDATA using ElementTree

August 16, 2022 by Magenaut

I’ve discovered that cElementTree is about 30 times faster than xml.dom.minidom and I’m rewriting my XML encoding/decoding code. However, I need to output XML that contains CDATA sections and there doesn’t seem to be a way to do that with ElementTree.

using lxml and iterparse() to parse a big (+- 1Gb) XML file

August 16, 2022 by Magenaut

I have to parse a 1Gb XML file with a structure such as below and extract the text within the tags “Author” and “Content”:

How do I use a default namespace in an lxml xpath query?

August 15, 2022 by Magenaut

I have an xml document in the following format:

How to use Python XML findall to find ”

August 14, 2022 by Magenaut

I’m trying to do a find all from a Word document for <v:imagedata r:id="rId7" o:title="1-REN"/> with namespace xmlns:v="urn:schemas-microsoft-com:vml" and I cannot figure out what on earth the syntax is.

How can I create a Word document using Python?

August 13, 2022 by Magenaut

I’d like to create a Word document using Python, however, I want to re-use as much of my existing document-creation code as possible. I am currently using an XSLT to generate an HTML file that I programatically convert to a PDF file. However, my client is now requesting that the same document be made available … Read more

Empty list returned from ElementTree findall

August 13, 2022 by Magenaut

I’m new to xml parsing and Python so bear with me. I’m using lxml to parse a wiki dump, but I just want for each page, its title and text.