<?xml version="1.0"?>
<rss version="2.0">
	<channel>
		<title>Seeking tool that download all files from internet directory</title>
		<link>http://www.allegro.cc/forums/view/585600</link>
		<description>Allegro.cc Forum Thread</description>
		<webMaster>matthew@allegro.cc (Matthew Leverton)</webMaster>
		<lastBuildDate>Fri, 26 May 2006 14:48:36 +0000</lastBuildDate>
	</channel>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Hi! I have found a specific site (<a href="http://fun.barnal.de/videos">http://fun.barnal.de/videos</a>) and want to download all files from this page to view them offline. Question is: how?</p><p>I tried it with HTTrack, but I can&#39;t resume the download after I have stopped it.</p><p>Do you know a tool (must be available for Linux) that could do this for me?
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Michael Faerber)</author>
		<pubDate>Thu, 25 May 2006 03:40:25 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Wget.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (gnolam)</author>
		<pubDate>Thu, 25 May 2006 04:24:24 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>If you use FireFox you could try the &quot;Download them all&quot; extension. I&#39;m not sure about the name, it&#39;s popular so you&#39;ll find it.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (kentl)</author>
		<pubDate>Thu, 25 May 2006 04:45:07 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>downTHEMall is the name of the extension... at least the one I have.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (BAF)</author>
		<pubDate>Thu, 25 May 2006 06:07:49 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><div class="quote_container"><div class="title">man wget said:</div><div class="quote"><p>
</p><pre>o   Retrieve the first two levels of wuarchive.wustl.edu, saving them to /tmp.

           wget -r -l2 -P/tmp <a href="ftp://wuarchive.wustl.edu/">ftp://wuarchive.wustl.edu/</a></pre><p>
</p></div></div><p>
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Kitty Cat)</author>
		<pubDate>Thu, 25 May 2006 07:57:59 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>wget or httrack
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (ReyBrujo)</author>
		<pubDate>Thu, 25 May 2006 08:02:07 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>I used to use httrack. It was teh awesome.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (miran)</author>
		<pubDate>Thu, 25 May 2006 10:09:08 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Problem with wget is that I cannot resume the download after I have stopped it. HTTrack seemed to offer that option, but it seemed to random - sometimes it worked, sometimes it started redownloading the whole page again.</p><p>DownThemAll however seems to work fine! So, if nobody proposes a better program, I will use this.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Michael Faerber)</author>
		<pubDate>Thu, 25 May 2006 15:36:16 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>-c resumes partially downloaded files. -N makes sure only new files get downloaded. What&#39;s the problem?
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (gnolam)</author>
		<pubDate>Thu, 25 May 2006 15:38:21 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>I second wget.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Evert)</author>
		<pubDate>Thu, 25 May 2006 16:45:53 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>&quot;DownThemAll!&quot; is really quite nice for in-Firefox use.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Mars)</author>
		<pubDate>Thu, 25 May 2006 16:51:20 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Hey, gnolam, you really helped me with your &quot;-c&quot; option. I suppose I have to read the man pages more often. <img src="http://www.allegro.cc/forums/smileys/smiley.gif" alt=":)" /></p><p>So I&#39;ll use wget now! Thanks for your help!
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Michael Faerber)</author>
		<pubDate>Thu, 25 May 2006 17:24:42 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Now if only someone made a frontend for wget that would make it actually behave like it should. Wget is another fine example of opensource at its prime: Needlessly complicated, poorly documented and it doesn&#39;t work like it&#39;s supposed to.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (jhuuskon)</author>
		<pubDate>Fri, 26 May 2006 13:59:02 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><div class="quote_container"><div class="title">Quote:</div><div class="quote"><p>
Now if only someone made a frontend for wget that would make it actually behave like it should. Wget is another fine example of opensource at its prime: Needlessly complicated, poorly documented and it doesn&#39;t work like it&#39;s supposed to.
</p></div></div><p>
Care to elaborate?<br />I&#39;ve used it without any problems for weeks before even looking at the manpage (which I needed only when I wanted to make a local copy of a website). The manpage itself is detailed and list all options very clearly.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Evert)</author>
		<pubDate>Fri, 26 May 2006 14:14:55 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>I tried numerous times to downlad an image gallery (a html page that links to the jpegs). However, it only downloads the index page and stops regardless of recursion options specified. Another funky thing, even when i tell wget to retain only donwloaded jpegs, it keeps the index even though i told it to retain only jpegs.</p><p>The help file (yes i&#39;ve tried it in windows) lists all options allright, but the explanations are arbitrary at best and the examples are, while well demonstrating the flexibility of Wget, totally useless from a practical point of view.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (jhuuskon)</author>
		<pubDate>Fri, 26 May 2006 14:32:31 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><div class="quote_container"><div class="title">man wget said:</div><div class="quote"><p>
</p><pre>o   You have a file that contains the URLs you want to download?  Use the -i
    switch:

           wget -i &lt;file&gt;</pre><p>
</p></div></div><p>
</p><div class="quote_container"><div class="title">man wget also said:</div><div class="quote"><p>
</p><pre>-F
--force-html
    When input is read from a file, force it to be treated as an HTML file.  This
    enables you to retrieve relative links from existing HTML files on your local
    disk, by adding &quot;&lt;base href=&quot;url&quot;&gt;&quot; to HTML, or using the --base command-line
    option.

-B URL
--base=URL
    Prepends URL to relative links read from the file specified with the -i option.</pre><p>
</p></div></div><p>
If the images are all the same extension and in the same directory on the site:
</p><div class="quote_container"><div class="title">Quote:</div><div class="quote"><p>
</p><pre>o   You want to download all the GIFs from a directory on an HTTP server.  You
    tried wget <a href="http://www.server.com/dir/*.gif">http://www.server.com/dir/*.gif</a>, but that didn&#39;t work because HTTP
    retrieval does not support globbing.  In that case, use:

            wget -r -l1 --no-parent -A.gif <a href="http://www.server.com/dir/">http://www.server.com/dir/</a>

    More verbose, but the effect is the same.  -r <s>l1 means to retrieve recur</s>
    sively, with maximum depth of 1.  <s>-no-parent means that references to the par</s>
    ent directory are ignored, and -A.gif means to download only the GIF files.  -A
    &quot;*.gif&quot; would have worked too.</pre><p>
</p></div></div><p>
That method won&#39;t work if the site has a robots.txt file set up, though.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (Kitty Cat)</author>
		<pubDate>Fri, 26 May 2006 14:46:02 +0000</pubDate>
	</item>
	<item>
		<description><![CDATA[<div class="mockup v2"><p>Didn&#39;t you think i tried that? Just didn&#39;t work. I even forged the user agent and told it to ignore robots.txt but to no avail.
</p></div>]]>
		</description>
		<author>no-reply@allegro.cc (jhuuskon)</author>
		<pubDate>Fri, 26 May 2006 14:48:36 +0000</pubDate>
	</item>
</rss>
