I would like to process a huge xml file that is distributed across a HDFS file system, using the iterparse function from lxml.etree package.
iterparse
lxml.etree
I h