Crawling Private Pages of Password Protected Websites
How to Crawl a Password Protected Website

Crawling Private Pages of Password Protected Websites

How to Crawl a Password Protected Website

What is a password protected website? Why would a website be password protected? Why would I want to crawl a password protected website? These are common questions that everyone asks themselves at some point, especially if you work in website design, development, or SEO. Let’s take each question one at a time, before looking at a few others as well.

What is Password Protection?

Password protection is used by many websites as a security process in order to secure information that should not be made available to the public at large. This information is otherwise accessible from any computer, but password protection makes it unavailable to the general public and requires a password (usually with an authorized user name as well) before being accessible.

Some websites are entirely password protected, whereas others are only partially so—meaning there’s a staff or membership area.


Why use Password Protection?

It may seem confusing to come across a password-protected site, especially if the reason is not immediately clear. If you’re a website owner or manager, you may be asking the same question—“Why should I password protect my site?”

After all, password protection prevents ordinary visitors from viewing your site, which restricts traffic. The whole point of having a site up is to get as much traffic as possible, with as much visibility as you can generate, right?

Actually, there are three major reasons why you as a website manager may want—or even need—to password protect your website. If you come across one as a user, you can be sure that it’s one of the three following reasons for the website being password protected:


Why Crawl Password Protected Websites?

Crawling a password protected website may sound illegal—after all, it’s password protected for a reason—but there are a number of fully legitimate reasons for wanting to crawl your protected site.

Let’s take a look at why you would want to crawl your website while it’s under password protection.


Best Practices for Crawling Password Protected Websites

Before we get into the details of how to crawl a password protected website, there are some practices that you want to bear in mind and adhere to.

Let us build it for you

Every Dyno Mapper subscription comes with authentication support.

Submit a support ticket and include the following information.

We will supply you with an import code within 24 to 48 hours (Business hours Monday through Friday (9 am - 5 pm Eastern Time Zone).

After you have received your import code, add it to Dyno Mapper.

  1. Click Create Project

    001 Auth Create Project


  2. Click CREATE under the Create from URL

    002 Auth Create from URL


  3. Click CLOSE WIZARD - you'll need to close the wizard so that you can edit the authentication settings

    003 Auth Close Wizard


  4. Open the Authentication Options section, and Click the Advanced Custom System Login icon

    004 Auth Add Plugin


  5. Click IMPORT and add the code that we supply you with and click IMPORT

    005 Auth Import

    006 Auth Import Code


That's it; you can TEST your Custom System Login with your login credentials. After confirming that your authentication is successful, you can crawl your site. Repeat steps 1-3; add your user credentials, and start crawling.

  1. Click Create Project
  2. Click CREATE under the Create from URL
  3. Click CLOSE WIZARD
  4. Add the LOGIN URL

    007 Auth Login URL

  5. Open the Authentication Options section, in the System dropdown, select the new Custom System Login, and add your Login Credentials
  6. START CRAWLING


Can I build it myself? Yes

What Do I Need to Know?

1. Learn about CSS selectors and HTML

When building a Custom System Login, you’re going to need some basic knowledge of both CSS and HTML, which are the programming protocols used. We’ll give you a breakdown of the four best sites for learning CSS and HTML to help you get started.

Theoretically, you could ask your developers to set this up for you. But you want to have full control over your Custom System Login, so for security reasons, it’s always going to be better for you to do it yourself. But don’t think you’ll have to enroll yourself in a college or university course to learn how!

Here are the four best online tutorials and courses, which are available 100% free of charge.


2. Learn how to use a browser inspector tool

This knowledge will prove invaluable in finding the right information in the code that will be necessary for building your custom system login. You’re going to need a browser inspector tool for finding the necessary information in your code to help you build your Custom System Login.

We’ve found the six best browser inspector tools, specifically designed and developed for the most popular browsers in use.


3. Build a Custom System Login

Create a DYNO Mapper account if you do not already have one. Tiered pricing is available based on the page count of your project. After you have logged into DYNO Mapper, follow these instructions.

  1. Click Create Project
  2. Click CREATE under the Create from URL
  3. Click CLOSE WIZARD - you'll need to close the wizard so that you can edit the authentication settings
  4. Open the Authentication Options section, and Click the Advanced Custom System Login icon
  5. Click CREATE
  6. Add a Title for your Custom Login
  7. Add Each Neccessary Action and SAVE & EXIT

    008 Auth Actions

 

Garenne Bigby
Author: Garenne BigbyWebsite: http://garennebigby.com
Founder of DYNO Mapper and Former Advisory Committee Representative at the W3C.

Create Interactive Visual Sitemaps

Get Started with DYNO Mapper

Join thousands of professionals using the most advanced visual sitemap tool to simplify discovery, IA, and content planning.

👉 Start Your Free Trial — No credit card required.