#!/usr/bin/perl -w
#
# dbsort.pm
# Copyright (C) 1991-2018 by John Heidemann <johnh@isi.edu>
#
# This program is distributed under terms of the GNU general
# public license, version 2. See the file COPYING
# in $dblibdir for details.
#
=head1 NAME
dbsort - sort rows based on the the specified columns
=head1 SYNOPSIS
dbsort [-M MemLimit] [-T TemporaryDirectory] [-nNrR] column [column...]
=head1 DESCRIPTION
Sort all input rows as specified by the numeric or lexical columns.
Dbsort consumes a fixed amount of memory regardless of input size.
(It reverts to temporary files on disk if necessary, based on the -M
and -T options.)
The sort should be stable, but this has not yet been verified.
For large inputs (those that spill to disk),
L<dbsort> will do some of the merging in parallel, if possible.
The B<--parallel> option can control the degree of parallelism,
if desired.
=head1 OPTIONS
General option:
=over 4
=item B<-M MaxMemBytes>
Specify an approximate limit on memory usage (in bytes).
Larger values allow faster sorting because more operations
happen in-memory, provided you have enough memory.
=item B<-T TmpDir>
where to put tmp files.
Also uses environment variable TMPDIR, if -T is
not specified.
Default is /tmp.
=item B<--parallelism N> or B<-j N>
Allow up to N merges to happen in parallel.
Default is the number of CPUs in the machine.
=back
Sort specification options (can be interspersed with column names):
=over 4
=item B<-r> or B<--descending>
sort in reverse order (high to low)
=item B<-R> or B<--ascending>
sort in normal order (low to high)
=item B<-t> or B<--type-inferred-sorting>
sort fields by type (numeric or leicographic), automatically
=item B<-T> or B<--no-type-inferred-sorting>
sort fields only as specified based on C<-n> or C<-N>
=item B<-n> or B<--numeric>
sort numerically
=item B<-N> or B<--lexical>
sort lexicographically
=back
=for comment
begin_standard_fsdb_options
This module also supports the standard fsdb options:
=over 4
=item B<-d>
Enable debugging output.
=item B<-i> or B<--input> InputSource
Read from InputSource, typically a file name, or C<-> for standard input,
or (if in Perl) a IO::Handle, Fsdb::IO or Fsdb::BoundedQueue objects.
=item B<-o> or B<--output> OutputDestination
Write to OutputDestination, typically a file name, or C<-> for standard output,
or (if in Perl) a IO::Handle, Fsdb::IO or Fsdb::BoundedQueue objects.
=item B<--autorun> or B<--noautorun>
By default, programs process automatically,
but Fsdb::Filter objects in Perl do not run until you invoke
the run() method.
The C<--(no)autorun> option controls that behavior within Perl.
=item B<--header H>
Use H as the full Fsdb header, rather than reading a header from
then input.
=item B<--help>
Show help.
=item B<--man>
Show full manual.
=back
=for comment
end_standard_fsdb_options
=head1 SAMPLE USAGE
=head2 Input:
#fsdb cid cname
10 pascal
11 numanal
12 os
=head2 Command:
cat data.fsdb | dbsort cname
=head2 Output:
#fsdb cid cname
11 numanal
12 os
10 pascal
# | dbsort cname
=head1 SEE ALSO
L<dbmerge(1)>,
L<dbmapreduce(1)>,
L<Fsdb(3)>
=cut
# WARNING: This code is derived from dbsort.pm; that is the master copy.
use Fsdb::Filter::dbsort;
my $f = new Fsdb::Filter::dbsort(@ARGV);
$f->setup_run_finish; # or could just --autorun
exit 0;
=head1 AUTHOR and COPYRIGHT
Copyright (C) 1991-2018 by John Heidemann <johnh@isi.edu>
This program is distributed under terms of the GNU general
public license, version 2. See the file COPYING
with the distribution for details.
=cut
1;