First: Get a URL
- Go to http://web.archive.org/
- Enter the old domain into the search bar. [need to include screenshot]
- Click on an old snapshot of the website. Then, copy the URL of the snapshot. [need to include GIF]. The URL should look something like this – http://web.archive.org/web/20060118051114/http://www.example.com/.
Second: Configure Screaming Frog
Open Screaming Frog.
Set the mode to Spider. [need to include gif]
Open configuration settings of Screaming Frog.
In the “Include” setting, enter the web.archive.org snapshot URL. [explain what this does]
- Modify the URL so that it uses regex. [explain what the regex does] Example: http://web.archive.org/web/(.*)/http://www.example.com/(.*).
Tell Screaming Frog to crawl outside the current folder.
Uncheck the options to crawl images, css, js, and swf.
Third: Start the Crawl
Paste the web.archive.org snapshot URL into Screaming Frogs address bar.
Start the crawl.