--- wget.1-orig 2006-12-20 01:32:53.000000000 -0500 +++ wget.1 2006-12-20 01:34:16.000000000 -0500 @@ -171,7 +171,7 @@ arguments. Thus you may write: .PP .Vb 1 -\& wget -r --tries=10 http://fly.srk.fer.hr/ -o log +\& wget -r --tries=10 http:\&//fly.srk.fer.hr/ -o log .Ve .PP The space between the option accepting an argument and the argument may @@ -825,7 +825,7 @@ .PD The opposite of \fB\-nd\fR\-\-\-create a hierarchy of directories, even if one would not have been created otherwise. E.g. \fBwget \-x -http://fly.srk.fer.hr/robots.txt\fR will save the downloaded file to +http:\&//fly.srk.fer.hr/robots.txt\fR will save the downloaded file to \&\fIfly.srk.fer.hr/robots.txt\fR. .IP "\fB\-nH\fR" 4 .IX Item "-nH" @@ -834,13 +834,13 @@ .IX Item "--no-host-directories" .PD Disable generation of host-prefixed directories. By default, invoking -Wget with \fB\-r http://fly.srk.fer.hr/\fR will create a structure of +Wget with \fB\-r http:\&//fly.srk.fer.hr/\fR will create a structure of directories beginning with \fIfly.srk.fer.hr/\fR. This option disables such behavior. .IP "\fB\-\-protocol\-directories\fR" 4 .IX Item "--protocol-directories" Use the protocol name as a directory component of local file names. For -example, with this option, \fBwget \-r http://\fR\fIhost\fR will save to +example, with this option, \fBwget \-r http:\&//\fR\fIhost\fR will save to \&\fBhttp/\fR\fIhost\fR\fB/...\fR rather than just to \fIhost\fR\fB/...\fR. .IP "\fB\-\-cut\-dirs=\fR\fInumber\fR" 4 .IX Item "--cut-dirs=number" @@ -899,7 +899,7 @@ you're mirroring a remote site that uses \fB.asp\fR pages, but you want the mirrored pages to be viewable on your stock Apache server. Another good use for this is when you're downloading CGI-generated materials. A \s-1URL\s0 -like \fBhttp://site.com/article.cgi?25\fR will be saved as +like \fBhttp:\&//site.com/article.cgi?25\fR will be saved as \&\fIarticle.cgi?25.html\fR. .Sp Note that filenames changed in this way will be re-downloaded every time @@ -1042,7 +1042,7 @@ .Vb 3 \& wget --header='Accept-Charset: iso-8859-2' \e \& --header='Accept-Language: hr' \e -\& http://fly.srk.fer.hr/ +\& http:\&//fly.srk.fer.hr/ .Ve .Sp Specification of an empty string as the header value will clear all @@ -1053,7 +1053,7 @@ localhost, but to specify \fBfoo.bar\fR in the \f(CW\*(C`Host\*(C'\fR header: .Sp .Vb 1 -\& wget --header="Host: foo.bar" http://localhost/ +\& wget --header="Host: foo.bar" http:\&//localhost/ .Ve .Sp In versions of Wget prior to 1.10 such use of \fB\-\-header\fR caused @@ -1142,13 +1142,13 @@ \& # Log in to the server. This can be done only once. \& wget --save-cookies cookies.txt \e \& --post-data 'user=foo&password=bar' \e -\& http://server.com/auth.php +\& http:\&//server.com/auth.php .Ve .Sp .Vb 3 \& # Now grab the page or pages we care about. \& wget --load-cookies cookies.txt \e -\& -p http://server.com/interesting/article.php +\& -p http:\&//server.com/interesting/article.php .Ve .Sp If the server is using session cookies to track user authentication, @@ -1394,7 +1394,7 @@ pages through a proxy, e.g.: .Sp .Vb 1 -\& wget -r -nd --delete-after http://whatever.com/~popular/page/ +\& wget -r -nd --delete-after http:\&//whatever.com/~popular/page/ .Ve .Sp The \fB\-r\fR option is to retrieve recursively, and \fB\-nd\fR to not @@ -1434,7 +1434,7 @@ Example: if the downloaded file \fI/foo/doc.html\fR links to \&\fI/bar/img.gif\fR (or to \fI../bar/img.gif\fR), then the link in \&\fIdoc.html\fR will be modified to point to -\&\fIhttp://\fIhostname\fI/bar/img.gif\fR. +\&\fIhttp:\&//\fIhostname\fI/bar/img.gif\fR. .RE .RS 4 .Sp @@ -1493,7 +1493,7 @@ If one executes the command: .Sp .Vb 1 -\& wget -r -l 2 http:///1.html +\& wget -r -l 2 http:\&///1.html .Ve .Sp then \fI1.html\fR, \fI1.gif\fR, \fI2.html\fR, \fI2.gif\fR, and @@ -1503,21 +1503,21 @@ where to stop the recursion. However, with this command: .Sp .Vb 1 -\& wget -r -l 2 -p http:///1.html +\& wget -r -l 2 -p http:\&///1.html .Ve .Sp all the above files \fIand\fR \fI3.html\fR's requisite \fI3.gif\fR will be downloaded. Similarly, .Sp .Vb 1 -\& wget -r -l 1 -p http:///1.html +\& wget -r -l 1 -p http:\&///1.html .Ve .Sp will cause \fI1.html\fR, \fI1.gif\fR, \fI2.html\fR, and \fI2.gif\fR to be downloaded. One might think that: .Sp .Vb 1 -\& wget -r -l 0 -p http:///1.html +\& wget -r -l 0 -p http:\&///1.html .Ve .Sp would download just \fI1.html\fR and \fI1.gif\fR, but unfortunately @@ -1528,7 +1528,7 @@ \&\fB\-r\fR and \fB\-l\fR: .Sp .Vb 1 -\& wget -p http:///1.html +\& wget -p http:\&///1.html .Ve .Sp Note that Wget will behave as if \fB\-r\fR had been specified, but only @@ -1539,7 +1539,7 @@ likes to use a few options in addition to \fB\-p\fR: .Sp .Vb 1 -\& wget -E -H -k -K -p http:/// +\& wget -E -H -k -K -p http:\&/// .Ve .Sp To finish off this topic, it's worth knowing that Wget's idea of an @@ -1620,7 +1620,7 @@ and its requisites, using a command-line like: .Sp .Vb 1 -\& wget --ignore-tags=a,area -H -k -K -r http:/// +\& wget --ignore-tags=a,area -H -k -K -r http:\&/// .Ve .Sp However, the author of this option came across a page with tags like @@ -1681,7 +1681,7 @@ Say you want to download a \s-1URL\s0. Just type: .Sp .Vb 1 -\& wget http://fly.srk.fer.hr/ +\& wget http:\&//fly.srk.fer.hr/ .Ve .IP "\(bu" 4 But what will happen if the connection is slow, and the file is lengthy? @@ -1692,7 +1692,7 @@ insure that the whole file will arrive safely: .Sp .Vb 1 -\& wget --tries=45 http://fly.srk.fer.hr/jpg/flyweb.jpg +\& wget --tries=45 http:\&//fly.srk.fer.hr/jpg/flyweb.jpg .Ve .IP "\(bu" 4 Now let's leave Wget to work in the background, and write its progress @@ -1700,7 +1700,7 @@ shall use \fB\-t\fR. .Sp .Vb 1 -\& wget -t 45 -o log http://fly.srk.fer.hr/jpg/flyweb.jpg & +\& wget -t 45 -o log http:\&//fly.srk.fer.hr/jpg/flyweb.jpg & .Ve .Sp The ampersand at the end of the line makes sure that Wget works in the @@ -1738,14 +1738,14 @@ document, saving the log of the activities to \fIgnulog\fR: .Sp .Vb 1 -\& wget -r http://www.gnu.org/ -o gnulog +\& wget -r http:\&//www.gnu.org/ -o gnulog .Ve .IP "\(bu" 4 The same as the above, but convert the links in the \s-1HTML\s0 files to point to local files, so you can view the documents off\-line: .Sp .Vb 1 -\& wget --convert-links -r http://www.gnu.org/ -o gnulog +\& wget --convert-links -r http:\&//www.gnu.org/ -o gnulog .Ve .IP "\(bu" 4 Retrieve only one \s-1HTML\s0 page, but make sure that all the elements needed @@ -1754,7 +1754,7 @@ references the downloaded links. .Sp .Vb 1 -\& wget -p --convert-links http://www.server.com/dir/page.html +\& wget -p --convert-links http:\&//www.server.com/dir/page.html .Ve .Sp The \s-1HTML\s0 page will be saved to \fIwww.server.com/dir/page.html\fR, and @@ -1768,20 +1768,20 @@ .Sp .Vb 2 \& wget -p --convert-links -nH -nd -Pdownload \e -\& http://www.server.com/dir/page.html +\& http:\&//www.server.com/dir/page.html .Ve .IP "\(bu" 4 Retrieve the index.html of \fBwww.lycos.com\fR, showing the original server headers: .Sp .Vb 1 -\& wget -S http://www.lycos.com/ +\& wget -S http:\&//www.lycos.com/ .Ve .IP "\(bu" 4 Save the server headers with the file, perhaps for post\-processing. .Sp .Vb 2 -\& wget --save-headers http://www.lycos.com/ +\& wget --save-headers http:\&//www.lycos.com/ \& more index.html .Ve .IP "\(bu" 4 @@ -1793,12 +1793,12 @@ .Ve .IP "\(bu" 4 You want to download all the GIFs from a directory on an \s-1HTTP\s0 -server. You tried \fBwget http://www.server.com/dir/*.gif\fR, but that +server. You tried \fBwget http:\&//www.server.com/dir/*.gif\fR, but that didn't work because \s-1HTTP\s0 retrieval does not support globbing. In that case, use: .Sp .Vb 1 -\& wget -r -l1 --no-parent -A.gif http://www.server.com/dir/ +\& wget -r -l1 --no-parent -A.gif http:\&//www.server.com/dir/ .Ve .Sp More verbose, but the effect is the same. \fB\-r \-l1\fR means to @@ -1813,7 +1813,7 @@ It would be: .Sp .Vb 1 -\& wget -nc -r http://www.gnu.org/ +\& wget -nc -r http:\&//www.gnu.org/ .Ve .IP "\(bu" 4 If you want to encode your own username and password to \s-1HTTP\s0 or @@ -1831,14 +1831,14 @@ to files? .Sp .Vb 1 -\& wget -O - http://jagor.srce.hr/ http://www.srce.hr/ +\& wget -O - http:\&//jagor.srce.hr/ http:\&//www.srce.hr/ .Ve .Sp You can also combine the two options and make pipelines to retrieve the documents from remote hotlists: .Sp .Vb 1 -\& wget -O - http://cool.list.com/ | wget --force-html -i - +\& wget -O - http:\&//cool.list.com/ | wget --force-html -i - .Ve .Sh "Very Advanced Usage" .IX Subsection "Very Advanced Usage" @@ -1850,7 +1850,7 @@ .Sp .Vb 2 \& crontab -\& 0 0 * * 0 wget --mirror http://www.gnu.org/ -o /home/me/weeklog +\& 0 0 * * 0 wget --mirror http:\&//www.gnu.org/ -o /home/me/weeklog .Ve .IP "\(bu" 4 In addition to the above, you want the links to be converted for local @@ -1861,7 +1861,7 @@ .Sp .Vb 2 \& wget --mirror --convert-links --backup-converted \e -\& http://www.gnu.org/ -o /home/me/weeklog +\& http:\&//www.gnu.org/ -o /home/me/weeklog .Ve .IP "\(bu" 4 But you've also noticed that local viewing doesn't work all that well @@ -1873,13 +1873,13 @@ .Vb 3 \& wget --mirror --convert-links --backup-converted \e \& --html-extension -o /home/me/weeklog \e -\& http://www.gnu.org/ +\& http:\&//www.gnu.org/ .Ve .Sp Or, with less typing: .Sp .Vb 1 -\& wget -m -k -K -E http://www.gnu.org/ -o /home/me/weeklog +\& wget -m -k -K -E http:\&//www.gnu.org/ -o /home/me/weeklog .Ve .SH "FILES" .IX Header "FILES" @@ -1904,7 +1904,7 @@ .IP "2." 4 Try to repeat the bug in as simple circumstances as possible. E.g. if Wget crashes while downloading \fBwget \-rl0 \-kKE \-t5 \-Y0 -http://yoyodyne.com \-o /tmp/log\fR, you should try to see if the crash is +http:\&//yoyodyne.com \-o /tmp/log\fR, you should try to see if the crash is repeatable, and if will occur with a simpler set of options. You might even try to start the download at the page where the crash occurred to see if that page somehow triggered the crash.