=head1 NAME Statistics::Basic::LeastSquareFit - find the least square fit for two lists =head1 SYNOPSIS A machine to calculate the Least Square Fit of given vectors x and y. The module returns the alpha and beta filling this formula: $y = $beta * $x + $alpha for a given set of I and I co-ordinate pairs. Say you have the set of Cartesian coordinates: my @points = ( [1,1], [2,2], [3,3], [4,4] ); The simplest way to find the LSF is as follows: my $lsf = lsf()->set_size(int @points); $lsf->insert(@$_) for @points; Or this way: my $xv = vector( map {$_->[0]} @points ); my $yv = vector( map {$_->[1]} @points ); my $lsf = lsf($xv, $yv); And then either query the values or print them like so: print "The LSF for $xv and $yv: $lsf\n"; my ($yint, $slope) = my ($alpha, $beta) = $lsf->query; L is meant for finding a line of best fit. C<$beta> is the slope of the line and C<$alpha> is the y-offset. Suppose you want to draw the line. Use these to calculate the C for a given C or vice versa: my $y = $lsf->y_given_x( 7 ); my $x = $lsf->x_given_y( 7 ); (Note that C can sometimes produce a divide-by-zero error since it has to divide by the C<$beta>.) Create a 20 point "moving" L like so: use Statistics::Basic qw(:all nofill); my $sth = $dbh->prepare("select x,y from points where something"); my $len = 20; my $lsf = lsf()->set_size($len); $sth->execute or die $dbh->errstr; $sth->bind_columns( my ($x, $y) ) or die $dbh->errstr; my $count = $len; while( $sth->fetch ) { $lsf->insert( $x, $y ); if( defined( my ($yint, $slope) = $lsf->query ) { print "LSF: y= $slope*x + $yint\n"; } # This would also work: # print "$lsf\n" if $lsf->query_filled; } =head1 METHODS This list of methods skips the methods inherited from L (things like L, and L). =over 4 =item B Create a new L object. This function takes two arguments -- which can either be arrayrefs or L objects. This function is called when the L shortcut-function is called. =item B L is meant for finding a line of best fit. C<$beta> is the slope of the line and C<$alpha> is the y-offset. my ($alpha, $beta) = $lsf->query; =item B Automatically calculate the y-value on the line for a given x-value. my $y = $lsf->y_given_x( 7 ); =item B Automatically calculate the x-value on the line for a given y-value. my $x = $lsf->x_given_y( 7 ); C can sometimes produce a divide-by-zero error since it has to divide by the C<$beta>. This might be helpful: if( defined( my $x = eval { $lsf->x_given_y(7) } ) ) { warn "there is no x value for 7"; } else { print "x (given y=7): $x\n"; } =item B Return the L for the first vector used in the computation of alpha and beta. =item B Return the L object for the second vector used in the computation of alpha and beta. =item B Returns the L object for the first vector used in the computation of alpha and beta. =item B Returns the L object for the first vector used in the computation of alpha and beta. =item B Returns the L object used in the computation of alpha and beta. =back =head1 OVERLOADS This object is overloaded. It tries to return an appropriate string for the calculation, but raises an error in numeric context. In boolean context, this object is always true (even when empty). =head1 AUTHOR Paul Miller C<< >> =head1 COPYRIGHT Copyright 2009 Paul Miller -- Licensed under the LGPL =head1 SEE ALSO perl(1), L, L, L =cut