How can I get content using XML::Twig?

旧时模样 提交于 2019-12-10 13:41:38

问题


My aim is that start_tag_handler (see below) get the apps/title content when it finds an apps/title tag (see sample XML below).

And end_tag_handler gets the apps/logs content when it finds an apps/logs tag.

But instead this code returns null and exits.

This is the Perl code for parsing (using XML::Twig)###:

    #!/usr/local/bin/perl -w

    use XML::Twig;
    my $twig = XML::Twig->new(
                start_tag_handlers =>
                  { 'apps/title' => \&kicks
                  },
                twig_roots =>
                  { 'apps' => \&app
                  },
                end_tag_handlers =>
                  { 'apps/logs' => \&bye
                  }
                );
    $twig -> parsefile( "doc.xml");

    sub kicks {
        my ($twig, $elt) = @_;
        print "---kicks--- \n";
        print $elt -> text;
        print " \n";
    }

    sub app {
        my ($twig, $apps) = @_;
        print "---app--- \n";
        print $apps -> text;
        print " \n";
    }


    sub bye {
        my ($twig, $elt) = @_;
        print "bye \n";
        print $elt->text;
        print " \n";
    }

This is doc.xml###:

    <?xml version="1.0" encoding="UTF-8"?>
    <auto>
      <apps>
        <title>watch</title>
        <commands>set,start,00:00,alart,end</commands>
        <logs>csv</logs>
      </apps>
      <apps>
        <title>machine</title>
        <commands>down,select,vol_100,check,line,end</commands>
        <logs>dump</logs>
      </apps>
    </auto>

This is the output in the console###:

    C:\>perl parse.pl
    ---kicks---

    ---app---
    watchset,start,00:00,alart,endcsv
    ---kicks---

    ---app---
    machinedown,select,vol_100,check,line,enddump

回答1:


Check out the XML::Twig documentation for start_tag_handlers:

The handlers are called with 2 params: the twig and the element. The element is empty at that point, its attributes are created though.

At the time start_tag_handlers is called, the text content isn't even seen yet, since parsing of the start tag (e.g. <title>, not the end tag </title>) has only just completed.

The reason that end_tag_handlers don't supply element text is probably for symmetry :-).

What you want is probably to use twig_handlers instead:

my $twig = XML::Twig->new(
    twig_handlers => {
        'apps/title' => \&kicks,
        'apps/logs' => \&bye
    },
    twig_roots => {
        'apps' => \&app
    },
);

Output is:

---kicks--- 
watch 
bye 
csv 
---app--- 
watchset,start,00:00,alart,endcsv
---kicks--- 
machine 
bye 
dump 
---app--- 
machinedown,select,vol_100,check,line,enddump


来源:https://stackoverflow.com/questions/1447632/how-can-i-get-content-using-xmltwig

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!