=head1 NAME
HTML::Mason::Devel - Mason Developer's Manual
=head1 DESCRIPTION
This manual is written for content developers who know HTML and at
least a little Perl. The goal is to write, run, and debug Mason
components.
If you are the webmaster (or otherwise responsible for the Mason
installation), you should also read L. There you will find information about site
configuration, performance tuning, component caching, and so on.
If you are a developer just interested in knowing more about Mason's
capabilities and implementation, then L is for you too.
We strongly suggest that you have a working Mason to play with as you
work through these examples. Other component examples can be found in
the C directory.
While Mason can be used for tasks besides implementing a dynamic web
site, that is what I people want to do with Mason, and is thus
the focus of this manual.
If you are planning to use Mason outside of the web, this manual will
still be useful, of course. Also make sure to read the L section of the administrator's manual.
=head1 HOW TO USE THIS MANUAL
If you are just learning Mason and want to get started quickly, we
recommend the following sections:
o L
o L
o L
o L
o L
o L (mainly C<< <%init> >>)
o L
o L
=head1 WHAT ARE COMPONENTS?
The component - a mix of Perl and HTML - is Mason's basic building
block and computational unit. Under Mason, web pages are formed by
combining the output from multiple components. An article page for a
news publication, for example, might call separate components for the
company masthead, ad banner, left table of contents, and article
body. Consider this layout sketch:
+---------+------------------+
|Masthead | Banner Ad |
+---------+------------------+
| | |
|+-------+|Text of Article ..|
|| || |
||Related||Text of Article ..|
||Stories|| |
|| ||Text of Article ..|
|+-------+| |
| +------------------+
| | Footer |
+---------+------------------+
The top level component decides the overall page layout, perhaps with
HTML tables. Individual cells are then filled by the output of
subordinate components, one for the Masthead, one for the Footer,
etc. In practice pages are built up from as few as one, to as many as
twenty or more components.
This component approach reaps many benefits in a web environment. The
first benefit is I: by embedding standard design
elements in components, you ensure a consistent look and make it
possible to update the entire site with just a few edits. The second
benefit is I: in a multi-person environment, one person
can edit the masthead while another edits the table of contents. A
last benefit is I: a component produced for one site
might be useful on another. You can develop a library of generally
useful components to employ on your sites and to share with others.
Most components emit chunks of HTML. "Top level" components, invoked
from a URL, represent an entire web page. Other, subordinate
components emit smaller bits of HTML destined for inclusion in top
level components.
Components receive form and query data from HTTP requests. When called
from another component, they can accept arbitrary parameter lists just
like a subroutine, and optionally return values. This enables a type
of component that does not print any HTML, but simply serves as a
function, computing and returning a result.
Mason actually compiles components down to Perl subroutines, so you
can debug and profile component-based web pages with standard Perl
tools that understand the subroutine concept, e.g. you can use the
Perl debugger to step through components, and B to
profile their performance.
=head1 IN-LINE PERL SECTIONS
Here is a simple component example:
<%perl>
my $noun = 'World';
my @time = localtime;
%perl>
Hello <% $noun %>,
% if ( $time[2] < 12 ) {
good morning.
% } else {
good afternoon.
% }
After 12 pm, the output of this component is:
Hello World, good afternoon.
This short example demonstrates the three primary "in-line" Perl
sections. In-line sections are generally embedded within HTML and
execute in the order they appear. Other sections (C<< <%init> >>,
C<< <%args> >>, etc.) are tied to component events like initialization,
cleanup, and argument definition.
The parsing rules for these Perl sections are as follows:
=over
=item 1.
Blocks of the form <% xxx %> are replaced with the result of
evaluating xxx as a single Perl expression. These are often used for
variable replacement. such as 'Hello, <% $name %>!'.
=item 2.
Lines beginning with a '%' character are treated as Perl.
=item 3.
Multiline blocks of Perl code can be inserted with the C<< <%perl> >>
.. C<< %perl> >> tag. The enclosed text is executed as Perl and the return
value, if any, is discarded.
The C<< <%perl> >> tag, like all block tags in Mason, is
case-insensitive. It may appear anywhere in the text, and may span any
number of lines. C<< <%perl> >> blocks cannot be nested inside one
another.
=back
=head2 Examples and Recommended Usage
B<% lines>
Most useful for conditional and loop structures - if, while, foreach,
, etc. - as well as side-effect commands like assignments. To improve
readability, always put a space after the '%'. Examples:
o Conditional code
% my $ua = $r->header_in('User-Agent');
% if ($ua =~ /msie/i) {
Welcome, Internet Explorer users
...
% } elsif ($ua =~ /mozilla/i) {
Welcome, Netscape users
...
% }
o HTML list formed from array
% foreach $item (@list) {
<% $item %>
% }
o HTML list formed from hash
% while (my ($key,$value) = each(%ENV)) {
<% $key %>: <% $value %>
% }
o HTML table formed from list of hashes
% foreach my $h (@loh) {
<% $h->{foo} %>
<% $h->{bar} %>
<% $h->{baz} %>
% }
B<< <% xxx %> >>
Most useful for printing out variables, as well as more complex
expressions. To improve readability, always separate the tag
and expression with spaces. Examples:
Dear <% $name %>: We will come to your house at <% $address %> in the
fair city of <% $city %> to deliver your $<% $amount %> dollar prize!
The answer is <% ($y+8) % 2 %>.
You are <% $age < 18 ? 'not' : '' %> permitted to enter this site.
B<< <%perl> xxx %perl> >>
Useful for Perl blocks of more than a few lines.
=head1 MASON OBJECTS
This section describes the various objects in the Mason universe.
If you're just starting out, all you need to worry about initially
are the request objects.
=head2 Request Objects
Two global per-request objects are available to all components: $r and
$m.
$r, the mod_perl request object, provides a Perl API to the current
Apache request. It is fully described in Apache.pod. Here is a
sampling of methods useful to component developers:
$r->uri # the HTTP request URI
$r->header_in(..) # get the named HTTP header line
$r->content_type # set or retrieve content-type
$r->header_out(..) # set or retrieve an outgoing header
$r->content # don't use this one! (see Tips and Traps)
$m, the Mason request object, provides an analogous API for
Mason. Almost all Mason features not activated by syntactic tags are
accessed via $m methods. You'll be introduced to these methods
throughout this document as they are needed. For a description of all
methods see B>.
Because these are always set inside components, you should not ever
define other variables with the same name, or else your code may fail
in strange and mysterious ways.
=head2 Component Objects
Mason provides an object API for components, allowing you to query a
component's various asociated files, arguments, etc. For a description
of all methods see
B>. Typically you
get a handle on a component object from request methods like C<<
$m->current_comp >> and C<< $m->fetch_comp >>.
Note that for many basic applications all you'll want to do with
components is call them, for which no object method is needed. See
next section.
=head2 System Objects
Many system objects share the work of serving requests in Mason:
L,
L,
L,
L,
and L are examples. The
administrator creates these objects and provides parameters that shape
Mason's behavior. As a pure component developer you shouldn't need to
worry about or access these objects, but occasionally we'll mention a
relevant parameter.
=head1 CALLING COMPONENTS
Mason pages often are built not from a single component, but from
multiple components that call each other in a hierarchical fashion.
=head2 Components that output HTML
To call one component from another, use the <& &> tag:
<& comp_path, [name=>value, ...] &>
=over
=item comp_path:
The component path. With a leading '/', the path is relative to the
component root (L). Otherwise, it is relative to the
location of the calling component.
=item name => value pairs:
Parameters are passed as one or more C<< name => value >> pairs,
e.g. S 'M. Jordan' >>>.
=back
comp_path may be a literal string (quotes optional) or a Perl expression
that evaluates to a string. To eliminate the need for quotes in most
cases, Mason employs some magic parsing: If the first character is
one of C<[\w/_.]>, comp_path is assumed to be a literal
string running up to the first comma or &>. Otherwise, comp_path
is evaluated as an expression.
Here are some examples:
# relative component paths
<& topimage &>
<& tools/searchbox &>
# absolute component path
<& /shared/masthead, color=>'salmon' &>
# this component path MUST have quotes because it contains a comma
<& "sugar,eggs", mix=>1 &>
# variable component path
<& $comp &>
# variable component and arguments
<& $comp, %args &>
# you can use arbitrary expression for component path, but it cannot
# begin with a letter or number; delimit with () to remedy this
<& (int(rand(2)) ? 'thiscomp' : 'thatcomp'), id=>123 &>
Several request methods also exist for calling components. C<< $m->comp >>
performs the equivalent action to <& &>:
$m->comp('/shared/masthead', color=>'salmon');
C<< $m->scomp >> is like the sprintf version of C<< $m->comp >>: it returns
the component output, allowing the caller to examine and modify it
before printing:
my $masthead = $m->scomp('/shared/masthead', color=>'salmon');
$masthead =~ ...;
$m->print($masthead);
=head2 Component Calls with Content
Components can be used to filter part of the page's content using an
extended component syntax.
<&| /path/to/comp &> this is the content &>
<&| comp, arg1 => 'hi' &> filters can take arguments &>
<&| comp &> content can include <% "tags" %> of all kinds &>
<&| comp1 &> nesting is also <&| comp2 &> OK &> &>
<&| SELF:method1 &> subcomponents can be filters &>
The filtering component can be called in all the same ways a normal
component is called, with arguments and so forth. The only difference
between a filtering component and a normal component is that a
filtering component is expected to fetch the content by calling
$m->content and do something with it.
The ending tag may optionally contain the name of the component, and Mason
will verify that it matches the name in the starting tag. This may
be helpful when the tags are far apart or nested. To avoid
ambiguous situations, this is only allowed when the component name
is an unquoted literal (starting with C<[\w/_.]>). For anything
more complicated, such as C<< <|& $var &> >> or C<< <&| 'name' &> >>,
the simple C<< &> >> form must be used.
<&| "outer" &>
<&| /inner/comp, arg=>'this' &>
<&| .mycomp &>
Yada yada yada
& .mycomp >
& /inner/comp >
&>
Here is an example of a component used for localization. Its content
is a series of strings in different languages, and it selects the
correct one based on a global C<$lang> variable, which could be setup
in a site-level autohandler.
<&| /i18n/itext &>
Hello, <% $name %> This is a string in EnglishSchoene Gruesse, <% $name %>, diese Worte sind auf Deutschellohay <% substr($name,2).substr($name,1,1).'ay' %>,
isthay isay igpay atinlay
&>
Here is the F component:
<% $text %>
<%init>
# this assumes $lang is a global variable which has been set up earlier.
local $_ = $m->content;
my ($text) = m{<$lang>(.*?)$lang>};
%init>
You can explicitly check whether a component has passed content by
checking the boolean C<< $m->has_content >>. This allows you to write
a component that will do different things depending on whether it was
passed content. However, before overloading a component in this way,
consider whether splitting the behavior into two distinct components
would work as well.
If a normal component which does not call C<< $m->content >> is called
with content, the content will not be output.
If you wrap a filtering component call around the entire component,
the result will be functionally similar to a C<< <%filter> >> section.
See also L.
=head2 Advanced Components Calls with Content
Internally C<< $m->content >> is implemented with a closure containing
the part of the component which is the content. In English, that
means that any mason tags and perl code in the content are evaluated
when C<< $m->content >> is called, and C<< $m->content >> returns the
text which would have been output by mason. Because the contents are
evaluated at the time that C<< $m->content >> is called, one can write
components which act as control structures or which output their
contents multiple times with different values for the variables (can
you say taglibs?).
The tricky part of using filter components as control structures is
setting up variables which can be accessed from both the filter
component and the content, which is in the component which calls the
filter component. The content has access to all variables in the
surrounding component, but the filtering component does not. There
are two ways to do this: use global variables, or pass a reference to
a lexical variable to the filter component.
Here is a simple example using the second method:
% my $var;
<&| list_items , list => \@items, var => \$var &>
<% $var %>
&>
list_items component:
<%args>
@list
$var
%args>
% foreach (@list) {
% $$var = $_; # $var is a reference
<% $m->content %>
% }
Using global variables can be somewhat simpler. Below is the same
example, with C<$var> defined as a global variable. The site
administrator must make sure that C<$var> is included in Mason's
allow_globals parameter. Local-izing C<$var> within the filter
component will allow the list_items component to be nested.
<&| list_items, list => \@items &>
<% $var %>
&>
list_items component:
<%args>
@list
%args>
% foreach (@list) {
% local $var = $_;
<% $m->content %>
% }
Besides remembering to include C<$var> in allow_globals, the
developers should take care not to use that variable is other places
where it might conflict with usage by the filter component.
Local-izing $var will also provide some protection against using it in
other places.
An even simpler method is to use the C<$_> variable. It is already
global, and is automatically local-ized by the foreach statement:
<&| list_items, list => \@items &>
<% $_ %>
&>
list_items component:
<%args>
@list
%args>
% foreach (@list) {
<% $m->content %>
% }
=head2 Components that Return Values
So far you have seen components used solely to output HTML. However,
components may also be used to return values.
While we will demonstrate how this is done, we strongly encourage you
to put code like this in modules instead. There are several reasons
why this is a good idea:
=over 4
=item *
You can re-use this code outside of Mason.
=item *
It is easy to preload module code when running under mod_perl, which
can lower memory usage.
=item *
Using Mason components as subroutines is slower than just using
modules to do the same thing.
=item *
It's easier to regression test module code.
=back
With that being said, there are times when you may want to write a
component which returns a value.
As an example, you might have a component C that analyzes
the user agent to determine whether it is a Netscape browser:
<%init>
my $ua = $r->header_in('User-Agent');
return ($ua =~ /Mozilla/i && $ua !~ /MSIE/i) ? 1 : 0;
%init>
Because components are implemented underneath with Perl subroutines,
they can return values and even understand scalar/list
context. e.g. The result of wantarray() inside a component will reflect
whether the component was called in scalar or list context.
The <& &> notation only calls a component for its side-effect, and
discards its return value, if any. To get at the return value of a
component, use the C<< $m->comp >> command:
% if ($m->comp('is_netscape')) {
Welcome, Netscape user!
% }
Mason adds a C to the bottom of each component to
provide an empty default return value. To return your own value from a
component, you I use an explicit C statement. You cannot
rely on the usual Perl trick of letting return values "fall through".
While it is possible for a component to generate output B return
values, there is very little reason for a component to do both. For
example, it would not be very friendly for C to output
"hi Mom" while it was computing its value, thereby surprising the
C statement! Conversely, any value returned by an output
generating component would typically be discarded by the <& &> tag
that invoked it.
=head2 Subrequests
You may sometimes want to have a component call go through all the
steps that the initial component call goes through, such as checking
for autohandlers and dhandlers. To do this, you need to execute a
subrequest.
A subrequest is simply a Mason Request object and has all of the
methods normally associated with one.
To create a subrequest you simply use the C<< $m->make_subrequest >>
method. This method can take any parameters belonging to
L, such as L or
L. Once you have a new request object you simply call its
C method to execute it, which takes exactly the same parameters
as the C method.
Since subrequests inherit their parent request's parameters, output
from a component called via a subrequest goes to the same desintation
as output from components called during the parent request. Of course,
you can change this.
Here are some examples:
<%perl>
my $req = $m->make_subrequest( comp => '/some/comp', args => [ id => 172 ] );
$req->exec;
%perl>
If you want to capture the subrequest's output in a scalar, you can
simply pass an L parameter to C<< $m->make_subrequest >>:
<%perl>
my $buffer;
my $req =
$m->make_subrequest
( comp => '/some/comp', args => [ id => 172 ], out_method => \$buffer );
$req->exec;
%perl>
Now C<$buffer> contains all the output from that call to F.
For convenience, Mason also provides an C<< $m->subexec >> method.
This method takes the same arguments as C<< $m->comp >> and internally
calls C<< $m->make_subrequest >> and then C on the created
request, all in one fell swoop. This is useful in cases where you
have no need to override any of the parent request object's
attributes.
By default, output from a subrequest appears inline in the calling
component, at the point where it is executed. If you wish to do
something else, you will need to explicitly override the subrequest's
L parameter.
Mason Request objects are only designed to handle a single call to
C. If you wish to make multiple subrequests, you must create
a new subrequest object for each one.
=head1 TOP-LEVEL COMPONENTS
The first component invoked for a page (the "top-level component")
resides within the DocumentRoot and is chosen based on the URL. For
example:
http://www.foo.com/mktg/prods.html?id=372
Mason converts this URL to a filename,
e.g. F. Mason loads and
executes that file as a component. In effect, Mason calls
$m->comp('/mktg/prods.html', id=>372)
This component might in turn call other components and execute some Perl
code, or it might contain nothing more than static HTML.
=head2 dhandlers
What happens when a user requests a component that doesn't exist? In
this case Mason scans backward through the URI, checking each
directory for a component named I ("default handler"). If
found, the dhandler is invoked and is expected to use
C<< $m->dhandler_arg >> as the parameter to some
access function, perhaps a database lookup or location in another
filesystem. In a sense, dhandlers are similar in spirit to Perl's
AUTOLOAD feature; they are the "component of last resort" when a URL
points to a non-existent component.
Consider the following URL, in which F exists but not the
subdirectory F nor the component F:
http://myserver/newsfeeds/LocalNews/Story1
In this case Mason constructs the following search path:
/newsfeeds/LocalNews/Story1 => no such thing
/newsfeeds/LocalNews/dhandler => no such thing
/newsfeeds/dhandler => found! (search ends)
/dhandler
The found dhandler would read "LocalNews/Story1" from
C<< $m->dhandler_arg >> and use it as a retrieval key into a
database of stories.
Here's how a simple /newsfeeds/dhandler might look:
<& header &>
<% $headline %>
<% $body %>
<& footer &>
<%init>
my $arg = $m->dhandler_arg; # get rest of path
my ($section, $story) = split("/", $arg); # split out pieces
my $sth = $DBH->prepare
(qq{SELECT headline,body FROM news
WHERE section = ? AND story = ?);
$sth->execute($section, $story);
my ($headline, $body) = $sth->fetchrow_array;
return 404 if !$headline; # return "not found" if no such story
%init>
By default dhandlers do not get a chance to handle requests to a
directory itself (e.g. F). These are automatically
deferred to Apache, which generates an index page or a FORBIDDEN
error. Often this is desirable, but if necessary the administrator
can let in directory requests as well; see the L section of the administrator's manual.
A component or dhandler that does not want to handle a particular
request may defer control to the next dhandler by calling C<<
$m->decline >>.
When using dhandlers under mod_perl, you may find that sometimes
Apache will not set a content type for a response. This usually
happens when a dhandler handles a request for a non-existent file or
directory. You can add a C<< >> or C<< >>
block containing a C directive to your Apache config file, or
you can just set the content type dynamically by calling C<<
$r->content_type >>.
The administrator can customize the file name used for dhandlers with
the L parameter.
=head2 autohandlers
Autohandlers allow you to grab control and perform some action just
before Mason calls the top-level component. This might mean adding a
standard header and footer, applying an output filter, or setting up
global variables.
Autohandlers are directory based. When Mason determines the top-level
component, it checks that directory and all parent directories for a
component called F. If found, the autohandler is called
first. After performing its actions, the autohandler typically calls
C<< $m->call_next >> to transfer control to the original intended
component.
C<< $m->call_next >> works just like C<< $m->comp >> except that the component path
and arguments are implicit. You can pass additional arguments to
C<< $m->call_next >>; these are merged with the original arguments, taking
precedence in case of conflict. This allows you, for example, to
override arguments passed in the URL.
Here is an autohandler that adds a common header and footer to each
page underneath its directory:
McHuffy Incorporated
% $m->call_next;
Copyright 1999 McHuffy Inc.
Same idea, using components for the header/footer:
<& /shared/header &>
% $m->call_next;
<& /shared/footer &>
The next autohandler applies a filter to its pages, adding an absolute
hostname to relative image URLs:
% $m->call_next;
<%filter>
s{(]+src=\")/} {$1http://images.mysite.com/}ig;
%filter>
Most of the time autohandler can simply call C<< $m->call_next >>
without needing to know what the next component is. However, should
you need it, the component object is available from
C<< $m->fetch_next >>. This is useful for calling the component manually,
e.g. if you want to suppress some original arguments or if you want to
use C<< $m->scomp >> to store and process the output.
What happens if more than one autohandler applies to a page? Prior to
version 0.85, only the most specific autohandler would execute. In
0.85 and beyond each autohandler gets a chance to run. The top-most
autohandler runs first; each C<< $m->call_next >> transfers control to the
next autohandler and finally to the originally called component. This
allows you, for example, to combine general site-wide templates and
more specific section-based templates.
Autohandlers can be made even more powerful in conjunction with
Mason's object-oriented style features: methods, attributes, and
inheritance. In the interest of space these are discussed in a
separate section, L.
The administrator can customize the file name used for autohandlers
with the L parameter.
=head2 dhandlers vs. autohandlers
dhandlers and autohandlers both provide a way to exert control over a
large set of URLs. However, each specializes in a very different
application. The key difference is that dhandlers are invoked only
when no appropriate component exists, while autohandlers are invoked
only in conjunction with a matching component.
As a rule of thumb: use an autohandler when you have a set of
components to handle your pages and you want to augment them
with a template/filter. Use a dhandler when you want to create a set
of "virtual URLs" that don't correspond to any actual components,
or to provide default behavior for a directory.
dhandlers and autohandlers can even be used in the same directory. For
example, you might have a mix of real URLs and virtual URLs to which
you would like to apply a common template/filter.
=head1 PASSING PARAMETERS
This section describes Mason's facilities for passing parameters to
components (either from HTTP requests or component calls) and for
accessing parameter values inside components.
=head2 In Component Calls
Any Perl data type can be passed in a component call:
<& /sales/header, s => 'dog', l => [2, 3, 4], h => {a => 7, b => 8} &>
This command passes a scalar ($s), a list (@l), and a hash (%h). The
list and hash must be passed as references, but they will be automatically
dereferenced in the called component.
=head2 In HTTP requests
Consider a CGI-style URL with a query string:
http://www.foo.com/mktg/prods.html?str=dog&lst=2&lst=3&lst=4
or an HTTP request with some POST content. Mason automatically parses
the GET/POST values and makes them available to the component as
parameters.
=head2 Accessing Parameters
Component parameters, whether they come from GET/POST or another
component, can be accessed in two ways.
1. Declared named arguments: Components can define an
C<< <%args> >> section listing argument names, types, and
default values. For example:
<%args>
$a
@b # a comment
%c
# another comment
$d => 5
$e => $d*2
@f => ('foo', 'baz')
%g => (joe => 1, bob => 2)
%args>
Here, I<$a>, I<@b>, and I<%c> are required arguments; the component generates
an error if the caller leaves them unspecified. I<$d>, I<$e>, I<@f> and I<%g> are
optional arguments; they are assigned the specified default values if
unspecified. All the arguments are available as lexically scoped ("my")
variables in the rest of the component.
Arguments are separated by one or more newlines. Comments may be used at
the end of a line or on their own line.
Default expressions are evaluated in top-to-bottom order, and one
expression may reference an earlier one (as $e references $d above).
Only valid Perl variable names may be used in C<< <%args> >>
sections. Parameters with non-valid variable names cannot be
pre-declared and must be fetched manually out of the %ARGS hash (see
below). One common example of undeclarable parameters are the
"button.x/button.y" parameters sent for a form submit.
2. %ARGS hash: This variable, always available, contains all of the
parameters passed to the component (whether or not they were
declared). It is especially handy for dealing with large numbers of
parameters, dynamically named parameters, or parameters with non-valid
variable names. %ARGS can be used with or without an
C<< <%args> >> section, and its contents are unrelated to what you
have declared in C<< <%args> >>.
Here's how to pass all of a component's parameters to another component:
<& template, %ARGS &>
=head2 Parameter Passing Examples
The following examples illustrate the different ways to pass and receive parameters.
1. Passing a scalar I with value 5.
In a URL: /my/URL?id=5
In a component call: <& /my/comp, id => 5 &>
In the called component, if there is a declared argument named...
$id, then $id will equal 5
@id, then @id will equal (5)
%id, then an error occurs
In addition, $ARGS{id} will equal 5.
2. Passing a list I with values red, blue, and green.
In a URL: /my/URL?colors=red&colors=blue&colors=green
In an component call: <& /my/comp, colors => ['red', 'blue', 'green'] &>
In the called component, if there is a declared argument named...
$colors, then $colors will equal ['red', 'blue', 'green']
@colors, then @colors will equal ('red', 'blue', 'green')
%colors, then an error occurs
In addition, $ARGS{colors} will equal ['red', 'blue', 'green'].
3. Passing a hash I with pairs Alice => 92 and Bob => 87.
In a URL: /my/URL?grades=Alice&grades=92&grades=Bob&grades=87
In an component call: <& /my/comp, grades => {Alice => 92, Bob => 87} &>
In the called component, if there is a declared argument named...
@grades, then @grades will equal ('Alice', 92, 'Bob', 87)
%grades, then %grades will equal (Alice => 92, Bob => 87)
In addition, $grade and $ARGS{grades} will equal
['Alice',92,'Bob',87] in the URL case, or {Alice => 92, Bob => 87}
in the component call case. (The discrepancy exists because, in a
query string, there is no detectable difference between a list or
hash.)
=head2 Using @_ instead
If you don't like named parameters, you can pass a traditional
list of ordered parameters:
<& /mktg/prods.html', 'dog', [2, 3, 4], {a => 7, b => 8} &>
and access them as usual through Perl's @_ array:
my ($scalar, $listref, $hashref) = @_;
In this case no C<< <%args> >> section is necessary.
We generally recommend named parameters for the benefits of
readability, syntax checking, and default value automation. However
using C<@_> may be convenient for very small components, especially
subcomponents created with C<< <%def> >>.
Before Mason 1.21, @_ contained I of the caller's arguments.
In Mason 1.21 and beyond, this unnecessary copying was eliminated and
@_ now contains I to the caller's arguments, just as with
regular Perl subroutines. For example, if a component updates $_[0],
the corresponding argument is updated (or an error occurs if it is not
updatable).
Most users won't notice this change because C<< <%args> >> variables
and the C<%ARGS> hash always contain copies of arguments.
See perlsub for more information on @_ aliasing.
=head1 INITIALIZATION AND CLEANUP
The following sections contain blocks of Perl to execute at specific
times.
=head2 <%init>
This section contains initialization code that executes as soon as the
component is called. For example: checking that a user is logged in;
selecting rows from a database into a list; parsing the contents of a
file into a data structure.
Technically an C<< <%init> >> block is equivalent to a C<< <%perl> >>
block at the beginning of the component. However, there is an
aesthetic advantage of placing this block at the end of the component
rather than the beginning.
We've found that the most readable components (especially for
non-programmers) contain HTML in one continuous block at the top, with
simple substitutions for dynamic elements but no distracting blocks of
Perl code. At the bottom an C<< <%init> >> block sets up the substitution
variables. This organization allows non-programmers to work with the
HTML without getting distracted or discouraged by Perl code. For example:
<% $headline %>
<% $headline %>
By <% $author %>, <% $date %>
<% $body %>
<%init>
# Fetch article from database
my $dbh = DBI::connect ...;
my $sth = $dbh->prepare("select * from articles where id = ?");
$sth->execute($article_id);
my ($headline, $date, $author, $body) = $sth->fetchrow_array;
# Massage the fields
$headline = uc($headline);
my ($year, $month, $day) = split('-', $date);
$date = "$month/$day";
%init>
<%args>
$article_id
%args>
=head2 <%cleanup>
This section contains cleanup code that executes just before the
component exits. For example: closing a database connection or closing
a file handle.
A << <%cleanup> >> block is equivalent to a C<< <%perl> >> block at
the end of the component. This means it will NOT execute if the
component explicitly returns, or if an abort or error occurs in that
component or one of its children. Because of this limitation, and
because Perl is usually so good about cleaning up at the end of a
lexical scope (e.g. component), C<< <%cleanup> >> sections are rarely
needed.
If you need code that is guaranteed to run when the component or
request exits, consider using a mod_perl cleanup handler, or creating
a custom class with a DESTROY method.
=head2 <%once>
This code executes once when the component is loaded. Variables
declared in this section can be seen in all of a component's code and
persist for the lifetime of the component.
This section is useful for declaring persistent component-scoped
lexical variables (especially objects that are expensive to create),
declaring subroutines (both named and anonymous), and initializing
state.
This code does not run inside a request context. You cannot call
components or access C<$m> or C<$r> from this section. Also, do not
attempt to C from a C<< <%once> >> section; the current
compiler cannot properly handle it.
Normally this code will execute individually from every HTTP child
that uses the component. However, if the component is preloaded, this
code will only execute once in the parent. Unless you have total control
over what components will be preloaded, it is safest to avoid
initializing variables that can't survive a fork(), e.g. DBI handles.
Use code like this to initialize such variables in the C<< <%init> >>
section:
<%once>
my $dbh; # declare but don't assign
...
%once>
<%init>
$dbh ||= DBI::connect ...
...
%init>
In addition, using C<$m> or <$r> in this section will not work in a
preloaded component, because neither of those variable exist when a
component is preloaded.
=head2 <%shared>
As with C<< <%once> >>, lexical (C) variables declared in this
section can be seen in all the rest of a component's code: the main
body, subcomponents, and methods. However, unlike C<< <%once> >>, the
code runs once per request (whenever the component is used) and its
variables last only until the end of the request.
A C<< <%shared> >> section is useful for initializing variables needed in, say, the
main body and one more subcomponents or methods. See L for an example of usage.
It's important to realize that you do not have access to the C<%ARGS>
hash or variables created via an C<< <%args> >> block inside a shared
section. However, you can access arguments via
L<$m-Erequest_args|HTML::Mason::Request/item_request_args>.
Additionally, you cannot call a components' own methods or
subcomponents from inside a C<< <%shared> >>, though you can call
other components.
Avoid using C<< <%shared> >> for side-effect code that needs to run at a
predictable time during page generation. You may assume only that
C<< <%shared> >> runs just before the first code that needs it and runs at
most once per request.
In the current implementation, the scope sharing is done with
closures, so variables will only be shared if they are visible at
compile-time in the other parts of the component. In addition, you
can't rely on the specific destruction time of the shared variables,
because they may not be destroyed until the first time the C<<
<%shared> >> section executes in a future request. C<< <%init> >>
offers a more predictable execution and destruction time.
Currently any component with a C<< <%shared> >> section incurs an
extra performance penalty, because Mason must recreate its
anonymous subroutines the first time each new request uses the
component. The exact penalty varies between systems and for most
applications will be unnoticeable. However, one should avoid using
C<< <%shared> >> when patently unnecessary, e.g. when an C<< <%init> >>
would work just as well.
Do not attempt to C from a C<< <%shared> >> section; the
current compiler cannot properly handle it.
=head1 EMBEDDED COMPONENTS
=head2 <%def I>
Each instance of this section creates a I embedded
inside the current component. Inside you may place anything that a
regular component contains, with the exception of C<< <%def> >>, C<< <%method> >>,
C<< <%once> >>, and C<< <%shared> >> tags.
The I consists of characters in the set C<[\w._-]>. To
call a subcomponent simply use its name in <& &> or C<< $m->comp >>. A
subcomponent can only be seen from the surrounding component.
If you define a subcomponent with the same name as a file-based
component in the current directory, the subcomponent takes
precedence. You would need to use an absolute path to call the
file-based component. To avoid this situation and for general clarity,
we recommend that you pick a unique way to name all of your
subcomponents that is unlikely to interfere with file-based
components. A commonly accepted practice is to start subcomponent
names with ".".
While inside a subcomponent, you may use absolute or relative paths to
call file-based components and also call any of your "sibling"
subcomponents.
The lexical scope of a subcomponent is separate from the main
component. However a subcomponent can declare its own C<< <%args> >> section
and have relevant values passed in. You can also use a C<< <%shared> >>
section to declare variables visible from both scopes.
In the following example, we create a ".link" subcomponent to produce a
standardized hyperlink:
<%def .link>
<% $label %>
<%args>
$site
$label=>ucfirst($site)
%args>
%def>
Visit these sites:
<& .link, site=>'yahoo' &>
<& .link, site=>'cmp', label=>'CMP Media' &>
<& .link, site=>'excite' &>
=head2 <%method I>
Each instance of this section creates a I embedded inside
the current component. Methods resemble subcomponents in terms of
naming, contents, and scope. However, while subcomponents can only be
seen from the parent component, methods are meant to be called from
other components.
There are two ways to call a method. First, via a path of the form
"comp:method":
<& /foo/bar:method1 &>
$m->comp('/foo/bar:method1');
Second, via the call_method component method:
my $comp = $m->fetch_comp('/foo/bar');
...
$comp->call_method('method1');
Methods are commonly used in conjunction with autohandlers to make
templates more flexible. See L for more information.
You cannot create a subcomponent and method with the same name.
This is mostly to prevent obfuscation and accidental errors.
=head1 FLAGS AND ATTRIBUTES
The C<< <%flags> >> and C<< <%attr> >> sections consist of key/value
pairs, one per line, joined by '=>'. In each pair, the key must be
any valid Perl "bareword identifier" (made of letters, numbers, and
the underscore character), and the value may be any scalar value,
including references. An optional comment may follow each line.
=head2 <%flags>
Use this section to set official Mason flags that affect the current
component's behavior.
Currently there is only one flag, C, which specifies the
component's I in the form of a relative or absolute component
path. A component inherits methods and attributes from its parent; see
L for
examples.
<%flags>
inherit=>'/site_handler'
%flags>
=head2 <%attr>
Use this section to assign static key/value attributes that can be
queried from other components.
<%attr>
color => 'blue'
fonts => [qw(arial geneva helvetica)]
%attr>
To query an attribute of a component, use the C method:
my $color = $comp->attr('color')
where $comp is a component object.
Mason evaluates attribute values once when loading the component.
This makes them faster but less flexible than methods.
=head1 FILTERING
This section describes several ways to apply filtering functions over
the results of the current component. By separating out and hiding a
filter that, say, changes HTML in a complex way, we allow
non-programmers to work in a cleaner HTML environment.
=head2 <%filter> section
The C<< <%filter> >> section allows you to arbitrarily filter the output of
the current component. Upon entry to this code, C<$_> contains the
component output, and you are expected to modify it in place. The code
has access to component arguments and can invoke subroutines, call
other components, etc.
This simple filter converts the component output to UPPERCASE:
<%filter>
tr/a-z/A-Z/
%filter>
The following navigation bar uses a filter to "unlink" and highlight
the item corresponding to the current page:
Home | Products |
Background | Financials |
Tech Support | Contact Us
<%filter>
my $uri = $r->uri;
s{(.*?)} {$1}i;
%filter>
This allows a designer to code such a navigation bar intuitively
without C statements surrounding each link! Note that the regular
expression need not be very robust as long as you have control over what
will appear in the body.
A filter block does not have access to variables declared in a
component's C<< <%init> >> section, though variables declared in the
C<< <%args> >>, C<< <%once> >> or C<< <%shared> >> blocks are
usable in a filter.
It should be noted that a filter cannot rely on receiving all of a
component's output at once, and so may be called multiple times with
different chunks of output. This can happen if autoflush is on, or if
a filter-containing component, or the components it calls, call the
C<< $m->flush_buffer() >> method.
You should never call Perl's C function inside a filter
section, or you will not see any output at all.
You can use L if you want to filter specific parts of a component rather
than the entire component.
=head1 COMMENT MARKERS
There are several ways to place comments in components, i.e. arbitrary
text that is ignored by the parser.
=head2 <%doc>
Text in this section is treated as a comment and ignored. Most useful
for a component's main documentation. One can easily write a program
to sift through a set of components and pull out their C<< <%doc> >>
blocks to form a reference page.
=head2 <% # comment... %>
A C<< <% %> >> tag is considered a comment if all of its lines are
either whitespace, or begin with a '#' optionally preceded by
whitespace. For example,
<% # This is a single-line comment %>
<%
# This is a
# multi-line comment
%>
=head2 %# comment
Because a line beginning with C<%> is treated as Perl, C<%#>
automatically works as a comment. However we prefer the C<< <% #
comment %> >> form over C<< %# >>, because it stands out a little more
as a comment and because it is more flexible with regards to preceding
whitespace.
=head2 % if (0) { }
Anything between these two lines
% if (0) {
...
% }
will be skipped by Mason, including component calls. While we don't
recomend this for comments per se, it is a useful notation for
"commenting out" code that you don't want to run.
=head2 HTML/XML/... comments
HTML and other markup languages will have their own comment markers, for example
C<< >>. Note two important differences with these comments versus the
above comments:
=over
=item *
They will be sent to the client and appear in the source of the page.
=item *
They do not block component calls and other code from running,
so don't try to use them to comment out code!
=back
=head1 OTHER SYNTAX
=head2 <%text>
Text in this section is printed as-is with all Mason syntax ignored.
This is useful, for example, when documenting Mason itself from a
component:
<%text>
% This is an example of a Perl line.
<% This is an example of an expression block. %>
%text>
This works for almost everything, but doesn't let you output
C<< %text> >> itself! When all else fails, use C<< $m->print >>:
% $m->print('The tags are <%text> and %text>.');
=head2 Escaping expressions
Mason has facilities for I the output from C<< <% %> >>
tags, on either a site-wide or a per-expression basis.
Any C<< <% %> >> expression may be terminated by a '|' and one or more
escape flags (plus arbitrary whitespace), separated by commas:
<% $file_data |h %>
The current valid flags are:
=over
=item * h
Escape HTML ('<' => '<', etc.) using C.
Before Perl 5.8.0 this module assumes that text is in the ISO-8859-1
character set; see L for how to override this escaping. After 5.8.0, the encoding
assumes that text is in Unicode.
=item * u
Escape a URL query string (':' => '%3A', etc.) - all but
[a-zA-Z0-9_.-]
=item * n
This is a special flag indicating that the default escape flags should
I be used for this substitution.
=back
The administrator may specify a set of default escape flags via the
L parameter. For example, if the administrator
sets L to C<['h']>, then all <% %> expressions
will automatically be HTML-escaped. In this case you would use the
C flag to turn off HTML-escaping for a specific expression:
<% $html_block |n %>
Multiple escapes can be specified as a comma-separated list:
<% $uri | u, n %>
The old pre-defined escapes, 'h', 'u', and 'n', can be used I
commas, so that this is legal:
<% $uri | un %>
However, this only works for these three escapes, and no others. If
you are using user-defined escapes as well, you I use a comma:
<% $uri | u, add_session %>
=head3 User-defined Escapes
Besides the default escapes mentioned above, it is possible for the
user to define their own escapes or to override the built-in 'h' and
'u' escapes.
This is done via the Interp object's L parameter or
L method. Escape
names may be any number of characters as long as it matches the regex
C^[\w-]+$/>. The one exception is that you cannot override the 'n'
flag.
Each escape flag is associated with a subroutine reference. The
subroutine should expect to receive a scalar reference, which should
be manipulated in place. Any return value from this subroutine is
ignored.
Escapes can be defined at any time but using an escape that is not
defined will cause an error when executing that component.
A common use for this feature is to override the built-in HTML
escaping, which will not work with non-ISO-8559-1 encodings. If
you are using such an encoding and want to switch the 'h' flag to do
escape just the minimal set of characters (C>, C>, C<&>,
C<">), put this in your Apache configuration:
PerlSetVar MasonEscapeFlags "h => \&HTML::Mason::Escapes::basic_html_escape"
Or, in a top-level autohandler:
$m->interp->set_escape( h => \&HTML::Mason::Escapes::basic_html_escape );
Or you could write your own escape function for a particular encoding:
$ah->interp->set_escape( h => \&my_html_escape );
And of course this can be used for all sorts of other things, like a
naughty words filter for the easily offended:
$interp->set_escape( 'no-naughty' => \&remove_naughty_words );
=head3 Manually applying escapes
You can manually apply one or more escapes to text using the L
method|HTML::Mason::Interp/item_apply_escapes>. e.g.
$m->interp->apply_escapes( 'some html content', 'h' );
=head2 Backslash at end of line
A backslash (\) at the end of a line suppresses the newline. In HTML
components, this is mostly useful for fixed width areas like C<<
>>
tags, since browsers ignore white space for the most part. An example:
foo
% if (1) {
bar
% }
baz
outputs
foo
bar
baz
because of the newlines on lines 2 and 4. (Lines 3 and 5 do not
generate a newline because the entire line is taken by Perl.)
To suppress the newlines:
foo\
% if (1) {
bar\
% }
baz
which prints
foobarbaz
=head1 DATA CACHING
Mason's data caching interface allows components to cache the results
of computation for improved performance. Anything may be cached, from
a block of HTML to a complex data structure.
Each component gets its own private, persistent data cache. Except
under special circumstances, one component does not access another
component's cache. Each cached value may be set to expire at a certain
time.
Data caching is implemented on top of DeWitt Clinton's C
package. Mason implements its own extended subclass of Dewitt's module
called L.
See that module's documentation for a companion API reference to this
section.
=head2 Basic Usage
The C<< $m->cache >> method returns an
L