# This is a .pm just to (try to) make some CPAN document converters # convert it happily as part of the dist's documentation tree. package HTML::Element::traverse; # ABSTRACT: discussion of HTML::Element's traverse method use warnings; use strict; our $VERSION = '5.07'; # VERSION from OurPkgVersion use HTML::Element (); 1; __END__ =pod =head1 NAME HTML::Element::traverse - discussion of HTML::Element's traverse method =head1 VERSION This document describes version 5.07 of HTML::Element::traverse, released August 31, 2017 as part of L. =head1 SYNOPSIS # $element->traverse is unnecessary and obscure. # Don't use it in new code. =head1 DESCRIPTION C provides a method C that traverses the tree and calls user-specified callbacks for each node, in pre- or post-order. However, use of the method is quite superfluous: if you want to recursively visit every node in the tree, it's almost always simpler to write a subroutine does just that, than it is to bundle up the pre- and/or post-order code in callbacks for the C method. =head1 EXAMPLES Suppose you want to traverse at/under a node $tree and give elements an 'id' attribute unless they already have one. You can use the C method: { my $counter = 'x0000'; $start_node->traverse( [ # Callbacks; # pre-order callback: sub { my $x = $_[0]; $x->attr('id', $counter++) unless defined $x->attr('id'); return HTML::Element::OK; # keep traversing }, # post-order callback: undef ], 1, # don't call the callbacks for text nodes ); } or you can just be simple and clear (and not have to understand the calling format for C) by writing a sub that traverses the tree by just calling itself: { my $counter = 'x0000'; sub give_id { my $x = $_[0]; $x->attr('id', $counter++) unless defined $x->attr('id'); foreach my $c ($x->content_list) { give_id($c) if ref $c; # ignore text nodes } }; give_id($start_node); } See, isn't that nice and clear? But, if you really need to know: =head1 THE TRAVERSE METHOD The C method is a general object-method for traversing a tree or subtree and calling user-specified callbacks. It accepts the following syntaxes: =over =item $h->traverse(\&callback) =item or $h->traverse(\&callback, $ignore_text) =item or $h->traverse( [\&pre_callback,\&post_callback] , $ignore_text) =back These all mean to traverse the element and all of its children. That is, this method starts at node $h, "pre-order visits" $h, traverses its children, and then will "post-order visit" $h. "Visiting" means that the callback routine is called, with these arguments: $_[0] : the node (element or text segment), $_[1] : a startflag, and $_[2] : the depth If the $ignore_text parameter is given and true, then the pre-order call I be happen for text content. The startflag is 1 when we enter a node (i.e., in pre-order calls) and 0 when we leave the node (in post-order calls). Note, however, that post-order calls don't happen for nodes that are text segments or are elements that are prototypically empty (like "br", "hr", etc.). If we visit text nodes (i.e., unless $ignore_text is given and true), then when text nodes are visited, we will also pass two extra arguments to the callback: $_[3] : the element that's the parent of this text node $_[4] : the index of this text node in its parent's content list Note that you can specify that the pre-order routine can be a different routine from the post-order one: $h->traverse( [\&pre_callback,\&post_callback], ...); You can also specify that no post-order calls are to be made, by providing a false value as the post-order routine: $h->traverse([ \&pre_callback,0 ], ...); And similarly for suppressing pre-order callbacks: $h->traverse([ 0,\&post_callback ], ...); Note that these two syntaxes specify the same operation