sustaining/oi_151a/sfw-gate: usr/src/cmd/bzip2/bzip2.1.sunman@87960ed158f9 (annotated)

0 b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	1	'\" t
11 87960ed158f9 Import sfw build 137 Cyril Plisko <cyril.plisko@grigale.com> parents: 0 diff changeset	2	.\" ident "@(#)bzip2.1.sunman 1.7 10/03/16 SMI"
0 b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	3	.\"
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	4	.\" modified to reference existing Solaris man pages, and to add note
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	5	.\" about source availability ([email protected])
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	6	.\"
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	7	.PU
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	8	.TH bzip2 1
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	9	.SH NAME
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	10	bzip2, bunzip2 \- a block-sorting file compressor, v1.0.5
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	11	.br
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	12	bzcat \- decompresses files to stdout
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	13	.br
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	14	bzip2recover \- recovers data from damaged bzip2 files
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	15
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	16	.SH SYNOPSIS
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	17	.ll +8
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	18	.B bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	19	.RB [ " \-cdfkqstvzVL123456789 " ]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	20	[
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	21	.I "filenames \&..."
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	22	]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	23	.ll -8
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	24	.br
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	25	.B bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	26	.RB [ " \-fkvsVL " ]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	27	[
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	28	.I "filenames \&..."
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	29	]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	30	.br
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	31	.B bzcat
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	32	.RB [ " \-s " ]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	33	[
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	34	.I "filenames \&..."
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	35	]
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	36	.br
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	37	.B bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	38	.I "filename"
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	39
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	40	.SH DESCRIPTION
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	41	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	42	compresses files using the Burrows-Wheeler block sorting
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	43	text compression algorithm, and Huffman coding. Compression is
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	44	generally considerably better than that achieved by more conventional
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	45	LZ77/LZ78-based compressors, and approaches the performance of the PPM
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	46	family of statistical compressors.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	47
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	48	The command-line options are deliberately very similar to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	49	those of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	50	.I GNU gzip,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	51	but they are not identical.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	52
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	53	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	54	expects a list of file names to accompany the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	55	command-line flags. Each file is replaced by a compressed version of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	56	itself, with the name "original_name.bz2".
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	57	Each compressed file
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	58	has the same modification date, permissions, and, when possible,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	59	ownership as the corresponding original, so that these properties can
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	60	be correctly restored at decompression time. File name handling is
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	61	naive in the sense that there is no mechanism for preserving original
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	62	file names, permissions, ownerships or dates in filesystems which lack
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	63	these concepts, or have serious file name length restrictions, such as
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	64	MS-DOS.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	65
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	66	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	67	and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	68	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	69	will by default not overwrite existing
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	70	files. If you want this to happen, specify the \-f flag.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	71
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	72	If no file names are specified,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	73	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	74	compresses from standard
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	75	input to standard output. In this case,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	76	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	77	will decline to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	78	write compressed output to a terminal, as this would be entirely
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	79	incomprehensible and therefore pointless.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	80
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	81	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	82	(or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	83	.I bzip2 \-d)
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	84	decompresses all
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	85	specified files. Files which were not created by
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	86	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	87	will be detected and ignored, and a warning issued.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	88	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	89	attempts to guess the filename for the decompressed file
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	90	from that of the compressed file as follows:
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	91
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	92	filename.bz2 becomes filename
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	93	filename.bz becomes filename
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	94	filename.tbz2 becomes filename.tar
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	95	filename.tbz becomes filename.tar
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	96	anyothername becomes anyothername.out
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	97
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	98	If the file does not end in one of the recognised endings,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	99	.I .bz2,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	100	.I .bz,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	101	.I .tbz2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	102	or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	103	.I .tbz,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	104	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	105	complains that it cannot
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	106	guess the name of the original file, and uses the original name
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	107	with
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	108	.I .out
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	109	appended.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	110
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	111	As with compression, supplying no
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	112	filenames causes decompression from
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	113	standard input to standard output.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	114
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	115	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	116	will correctly decompress a file which is the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	117	concatenation of two or more compressed files. The result is the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	118	concatenation of the corresponding uncompressed files. Integrity
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	119	testing (\-t)
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	120	of concatenated
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	121	compressed files is also supported.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	122
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	123	You can also compress or decompress files to the standard output by
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	124	giving the \-c flag. Multiple files may be compressed and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	125	decompressed like this. The resulting outputs are fed sequentially to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	126	stdout. Compression of multiple files
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	127	in this manner generates a stream
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	128	containing multiple compressed file representations. Such a stream
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	129	can be decompressed correctly only by
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	130	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	131	version 0.9.0 or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	132	later. Earlier versions of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	133	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	134	will stop after decompressing
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	135	the first file in the stream.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	136
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	137	.I bzcat
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	138	(or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	139	.I bzip2 -dc)
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	140	decompresses all specified files to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	141	the standard output.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	142
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	143	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	144	will read arguments from the environment variables
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	145	.I BZIP2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	146	and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	147	.I BZIP,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	148	in that order, and will process them
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	149	before any arguments read from the command line. This gives a
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	150	convenient way to supply default arguments.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	151
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	152	Compression is always performed, even if the compressed
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	153	file is slightly
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	154	larger than the original. Files of less than about one hundred bytes
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	155	tend to get larger, since the compression mechanism has a constant
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	156	overhead in the region of 50 bytes. Random data (including the output
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	157	of most file compressors) is coded at about 8.05 bits per byte, giving
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	158	an expansion of around 0.5%.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	159
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	160	As a self-check for your protection,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	161	.I
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	162	bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	163	uses 32-bit CRCs to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	164	make sure that the decompressed version of a file is identical to the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	165	original. This guards against corruption of the compressed data, and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	166	against undetected bugs in
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	167	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	168	(hopefully very unlikely). The
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	169	chances of data corruption going undetected is microscopic, about one
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	170	chance in four billion for each file processed. Be aware, though, that
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	171	the check occurs upon decompression, so it can only tell you that
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	172	something is wrong. It can't help you
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	173	recover the original uncompressed
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	174	data. You can use
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	175	.I bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	176	to try to recover data from
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	177	damaged files.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	178
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	179	Return values: 0 for a normal exit, 1 for environmental problems (file
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	180	not found, invalid flags, I/O errors, &c), 2 to indicate a corrupt
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	181	compressed file, 3 for an internal consistency error (eg, bug) which
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	182	caused
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	183	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	184	to panic.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	185
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	186	.SH OPTIONS
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	187	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	188	.B \-c --stdout
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	189	Compress or decompress to standard output.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	190	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	191	.B \-d --decompress
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	192	Force decompression.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	193	.I bzip2,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	194	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	195	and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	196	.I bzcat
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	197	are
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	198	really the same program, and the decision about what actions to take is
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	199	done on the basis of which name is used. This flag overrides that
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	200	mechanism, and forces
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	201	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	202	to decompress.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	203	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	204	.B \-z --compress
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	205	The complement to \-d: forces compression, regardless of the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	206	invocation name.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	207	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	208	.B \-t --test
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	209	Check integrity of the specified file(s), but don't decompress them.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	210	This really performs a trial decompression and throws away the result.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	211	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	212	.B \-f --force
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	213	Force overwrite of output files. Normally,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	214	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	215	will not overwrite
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	216	existing output files. Also forces
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	217	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	218	to break hard links
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	219	to files, which it otherwise wouldn't do.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	220
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	221	bzip2 normally declines to decompress files which don't have the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	222	correct magic header bytes. If forced (-f), however, it will pass
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	223	such files through unmodified. This is how GNU gzip behaves.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	224	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	225	.B \-k --keep
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	226	Keep (don't delete) input files during compression
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	227	or decompression.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	228	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	229	.B \-s --small
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	230	Reduce memory usage, for compression, decompression and testing. Files
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	231	are decompressed and tested using a modified algorithm which only
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	232	requires 2.5 bytes per block byte. This means any file can be
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	233	decompressed in 2300k of memory, albeit at about half the normal speed.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	234
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	235	During compression, \-s selects a block size of 200k, which limits
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	236	memory use to around the same figure, at the expense of your compression
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	237	ratio. In short, if your machine is low on memory (8 megabytes or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	238	less), use \-s for everything. See MEMORY MANAGEMENT below.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	239	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	240	.B \-q --quiet
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	241	Suppress non-essential warning messages. Messages pertaining to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	242	I/O errors and other critical events will not be suppressed.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	243	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	244	.B \-v --verbose
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	245	Verbose mode -- show the compression ratio for each file processed.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	246	Further \-v's increase the verbosity level, spewing out lots of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	247	information which is primarily of interest for diagnostic purposes.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	248	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	249	.B \-L --license -V --version
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	250	Display the software version, license terms and conditions.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	251	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	252	.B \-1 (or \-\-fast) to \-9 (or \-\-best)
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	253	Set the block size to 100 k, 200 k .. 900 k when compressing. Has no
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	254	effect when decompressing. See MEMORY MANAGEMENT below.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	255	The \-\-fast and \-\-best aliases are primarily for GNU gzip
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	256	compatibility. In particular, \-\-fast doesn't make things
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	257	significantly faster.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	258	And \-\-best merely selects the default behaviour.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	259	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	260	.B \--
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	261	Treats all subsequent arguments as file names, even if they start
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	262	with a dash. This is so you can handle files with names beginning
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	263	with a dash, for example: bzip2 \-- \-myfilename.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	264	.TP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	265	.B \--repetitive-fast --repetitive-best
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	266	These flags are redundant in versions 0.9.5 and above. They provided
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	267	some coarse control over the behaviour of the sorting algorithm in
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	268	earlier versions, which was sometimes useful. 0.9.5 and above have an
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	269	improved algorithm which renders these flags irrelevant.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	270
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	271	.SH MEMORY MANAGEMENT
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	272	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	273	compresses large files in blocks. The block size affects
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	274	both the compression ratio achieved, and the amount of memory needed for
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	275	compression and decompression. The flags \-1 through \-9
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	276	specify the block size to be 100,000 bytes through 900,000 bytes (the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	277	default) respectively. At decompression time, the block size used for
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	278	compression is read from the header of the compressed file, and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	279	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	280	then allocates itself just enough memory to decompress
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	281	the file. Since block sizes are stored in compressed files, it follows
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	282	that the flags \-1 to \-9 are irrelevant to and so ignored
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	283	during decompression.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	284
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	285	Compression and decompression requirements,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	286	in bytes, can be estimated as:
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	287
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	288	Compression: 400k + ( 8 x block size )
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	289
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	290	Decompression: 100k + ( 4 x block size ), or
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	291	100k + ( 2.5 x block size )
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	292
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	293	Larger block sizes give rapidly diminishing marginal returns. Most of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	294	the compression comes from the first two or three hundred k of block
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	295	size, a fact worth bearing in mind when using
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	296	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	297	on small machines.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	298	It is also important to appreciate that the decompression memory
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	299	requirement is set at compression time by the choice of block size.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	300
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	301	For files compressed with the default 900k block size,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	302	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	303	will require about 3700 kbytes to decompress. To support decompression
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	304	of any file on a 4 megabyte machine,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	305	.I bunzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	306	has an option to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	307	decompress using approximately half this amount of memory, about 2300
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	308	kbytes. Decompression speed is also halved, so you should use this
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	309	option only where necessary. The relevant flag is -s.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	310
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	311	In general, try and use the largest block size memory constraints allow,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	312	since that maximises the compression achieved. Compression and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	313	decompression speed are virtually unaffected by block size.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	314
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	315	Another significant point applies to files which fit in a single block
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	316	-- that means most files you'd encounter using a large block size. The
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	317	amount of real memory touched is proportional to the size of the file,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	318	since the file is smaller than a block. For example, compressing a file
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	319	20,000 bytes long with the flag -9 will cause the compressor to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	320	allocate around 7600k of memory, but only touch 400k + 20000 * 8 = 560
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	321	kbytes of it. Similarly, the decompressor will allocate 3700k but only
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	322	touch 100k + 20000 * 4 = 180 kbytes.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	323
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	324	Here is a table which summarises the maximum memory usage for different
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	325	block sizes. Also recorded is the total compressed size for 14 files of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	326	the Calgary Text Compression Corpus totalling 3,141,622 bytes. This
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	327	column gives some feel for how compression varies with block size.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	328	These figures tend to understate the advantage of larger block sizes for
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	329	larger files, since the Corpus is dominated by smaller files.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	330
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	331	Compress Decompress Decompress Corpus
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	332	Flag usage usage -s usage Size
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	333
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	334	-1 1200k 500k 350k 914704
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	335	-2 2000k 900k 600k 877703
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	336	-3 2800k 1300k 850k 860338
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	337	-4 3600k 1700k 1100k 846899
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	338	-5 4400k 2100k 1350k 845160
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	339	-6 5200k 2500k 1600k 838626
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	340	-7 6100k 2900k 1850k 834096
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	341	-8 6800k 3300k 2100k 828642
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	342	-9 7600k 3700k 2350k 828642
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	343
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	344	.SH RECOVERING DATA FROM DAMAGED FILES
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	345	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	346	compresses files in blocks, usually 900kbytes long. Each
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	347	block is handled independently. If a media or transmission error causes
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	348	a multi-block .bz2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	349	file to become damaged, it may be possible to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	350	recover data from the undamaged blocks in the file.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	351
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	352	The compressed representation of each block is delimited by a 48-bit
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	353	pattern, which makes it possible to find the block boundaries with
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	354	reasonable certainty. Each block also carries its own 32-bit CRC, so
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	355	damaged blocks can be distinguished from undamaged ones.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	356
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	357	.I bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	358	is a simple program whose purpose is to search for
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	359	blocks in .bz2 files, and write each block out into its own .bz2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	360	file. You can then use
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	361	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	362	\-t
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	363	to test the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	364	integrity of the resulting files, and decompress those which are
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	365	undamaged.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	366
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	367	.I bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	368	takes a single argument, the name of the damaged file,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	369	and writes a number of files "rec00001file.bz2",
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	370	"rec00002file.bz2", etc, containing the extracted blocks.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	371	The output filenames are designed so that the use of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	372	wildcards in subsequent processing -- for example,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	373	"bzip2 -dc rec*file.bz2 > recovered_data" -- processes the files in
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	374	the correct order.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	375
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	376	.I bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	377	should be of most use dealing with large .bz2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	378	files, as these will contain many blocks. It is clearly
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	379	futile to use it on damaged single-block files, since a
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	380	damaged block cannot be recovered. If you wish to minimise
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	381	any potential data loss through media or transmission errors,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	382	you might consider compressing with a smaller
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	383	block size.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	384
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	385	.SH PERFORMANCE NOTES
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	386	The sorting phase of compression gathers together similar strings in the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	387	file. Because of this, files containing very long runs of repeated
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	388	symbols, like "aabaabaabaab ..." (repeated several hundred times) may
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	389	compress more slowly than normal. Versions 0.9.5 and above fare much
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	390	better than previous versions in this respect. The ratio between
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	391	worst-case and average-case compression time is in the region of 10:1.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	392	For previous versions, this figure was more like 100:1. You can use the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	393	\-vvvv option to monitor progress in great detail, if you want.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	394
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	395	Decompression speed is unaffected by these phenomena.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	396
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	397	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	398	usually allocates several megabytes of memory to operate
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	399	in, and then charges all over it in a fairly random fashion. This means
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	400	that performance, both for compressing and decompressing, is largely
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	401	determined by the speed at which your machine can service cache misses.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	402	Because of this, small changes to the code to reduce the miss rate have
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	403	been observed to give disproportionately large performance improvements.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	404	I imagine
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	405	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	406	will perform best on machines with very large caches.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	407
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	408	.SH CAVEATS
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	409	I/O error messages are not as helpful as they could be.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	410	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	411	tries hard to detect I/O errors and exit cleanly, but the details of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	412	what the problem is sometimes seem rather misleading.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	413
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	414	This manual page pertains to version 1.0.5 of
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	415	.I bzip2.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	416	Compressed data created by this version is entirely forwards and
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	417	backwards compatible with the previous public releases, versions
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	418	0.1pl2, 0.9.0, 0.9.5, 1.0.0, 1.0.1, 1.0.2, 1.0.3 and 1.0.4 but with the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	419	following exception: 0.9.0 and above can correctly decompress multiple
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	420	concatenated compressed files. 0.1pl2 cannot do this; it will stop
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	421	after decompressing just the first file in the stream.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	422
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	423	.I bzip2recover
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	424	versions prior to 1.0.2 used 32-bit integers to represent
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	425	bit positions in compressed files, so they could not handle compressed
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	426	files more than 512 megabytes long. Versions 1.0.2 and above use
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	427	64-bit ints on some platforms which support them (GNU supported
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	428	targets, and Windows). To establish whether or not bzip2recover was
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	429	built with such a limitation, run it without arguments. In any event
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	430	you can build yourself an unlimited version if you can recompile it
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	431	with MaybeUInt64 set to be an unsigned 64-bit integer.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	432
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	433
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	434
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	435	.SH AUTHOR
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	436	Julian Seward, jsewardbzip.org.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	437
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	438	http://www.bzip.org
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	439
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	440	The ideas embodied in
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	441	.I bzip2
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	442	are due to (at least) the following
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	443	people: Michael Burrows and David Wheeler (for the block sorting
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	444	transformation), David Wheeler (again, for the Huffman coder), Peter
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	445	Fenwick (for the structured coding model in the original
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	446	.I bzip,
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	447	and many refinements), and Alistair Moffat, Radford Neal and Ian Witten
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	448	(for the arithmetic coder in the original
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	449	.I bzip).
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	450	I am much
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	451	indebted for their help, support and advice. See the manual in the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	452	source distribution for pointers to sources of documentation. Christian
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	453	von Roques encouraged me to look for faster sorting algorithms, so as to
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	454	speed up compression. Bela Lubkin encouraged me to improve the
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	455	worst-case compression performance.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	456	Donna Robinson XMLised the documentation.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	457	The bz* scripts are derived from those of GNU gzip.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	458	Many people sent patches, helped
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	459	with portability problems, lent machines, gave advice and were generally
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	460	helpful.
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	461	.SH ATTRIBUTES
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	462	See
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	463	.BR attributes (5)
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	464	for descriptions of the following attributes:
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	465	.sp
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	466	.TS
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	467	box;
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	468	cbp-1 \| cbp-1
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	469	l \| l .
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	470	ATTRIBUTE TYPE ATTRIBUTE VALUE
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	471	=
11 87960ed158f9 Import sfw build 137 Cyril Plisko <cyril.plisko@grigale.com> parents: 0 diff changeset	472	Availability compress/bzip2
0 b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	473	=
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	474	Interface Stability Committed
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	475	.TE
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	476	.PP
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	477	.SH NOTES
b34509ac961f Import sfw repo b126 Cyril Plisko <cyril.plisko@grigale.com> parents: diff changeset	478	Source for bzip2 is available on http://opensolaris.org.

author	Cyril Plisko <cyril.plisko@grigale.com>
	Tue, 06 Apr 2010 16:00:14 +0300
changeset 11	87960ed158f9
parent 0	b34509ac961f
child 48	b3a54d4b169c
permissions	-rw-r--r--