The Ultimate Guide to White Hat SEO using Scrapebox – Onely Blog
More than a year ago, on my G+ profile, I posted about something that I found funny: using Scrapebox for white hat. During this year a lot has changed, so now we know we need to focus more and more on the quality of the backlinks instead of quantity. This means that we have to rethink which tools should we use and how they can help us maximize our SEO.
Personally, like Bartosz mentioned in his blog post on LRT, I find Scrapebox very useful for every single SEO task I do connected with link analysis or link building.
Scrapebox – a forbidden word in SEO
I bet everybody knows Scrapebox, more or less. In short – it’s a tool used for mass scraping, harvesting, pinging and posting tasks in order to maximize the amount of links you can gain for your website to help it rank better in Google. A lot of webmasters and blog owners treat Scrapebox like a spam machine, but in fact it is only a tool, and it what it’s actually used for depends on the “driver”.
Now, due to all the Penguin updates, a lot of SEO agencies have changed their minds about linkbuilding and have started to use Scrapebox as support for their link audits or outreach.
Scrapebox – general overview
You can skip this section if you know Scrapebox already. If not – here is some basic information about the most important functions you can use.
Scrapebox is cheap. Even without the discount code, it costs $97. You can order ScrapeBox here.
In this field, you can put the footprint you want to use for harvesting blogs/domains/other resources. You can choose from the Custom option and predefined platforms. Personally, I love to use the “Custom footprint” option because it allows you to get more out of each harvest task
Here, you can post keywords related to your harvest. For example, if you want to get WordPress blogs about flowers and gardening, you can post “flowers” and “gardening” along with the custom footprint “Powered by WordPress”. It will give you a list of blogs containing these keywords and this footprint.
The URL’s Harvested box shows the total amount of websites harvested. Using the option number 6, you can get even more from each results list.
Select Engines & Proxies allow you to choose which search engine you want to get results from, and how many of them to harvest. For link detox needs or competition analysis, I recommend making use of Bing and Yahoo as well (different search engines give different results, which results in more information harvested). Also, you can post the list of proxies you want to use and manage them by checking if they are alive, and not blocked by Google and so on. After that, you can filter your results and download them as a file for further usage.
Comment Poster allows you to post comments to a blog list you have harvested, but in our White Hat tasks – we do not use it. Instead of that, we can use it to ping our links to get them indexed faster.
Scrapebox – Addons
By default, Scrapebox allows you to use a lot of different addons to get more and more from your links. You can find them by clicking “Addons” in the top menu in the main interface. Here is our list of addons:
To get more addons You can click on “Show available addons”. Also, remember about premium plugins, which can boost your SEO a lot.
Keyword Scraper – the very beginning on your link building
One of the most massive things in Scrapebox that I use all the time is the integrated Google suggested keywords scraper. It works very simply and allows you to get a list of keywords you should definitely use while optimizing your website content or preparing new blog post very, very quickly. To do this, just click on the “Scrape” button in the “Harvester” box and select “Keyword Scraper”. You will see a Keyword Scraper window like this one:
The fun starts right now. On the left side, simply put a list of keywords related to your business or blog and select Keyword Scraper Sources. Later, select the search engine you want to have research done on and hit the “Scrape” button.
As you can see on the screenshot above, you can also select the total “level” for the keyword scraper. For most keyword research tasks, it’s okay to have it on 2, but when it’s specific for each niche you want to target (for example for cooking blogs, it should be level 4 to get more keywords related to specific recipes or kitchen tips and tricks), you can adjust it up to 4. Remember that the higher level you choose, the longer it will take to see results.
After that, do a quick overview of the results you’ve got – if you see some superfluous keywords you don’t want to have in your keywords list, use “Remove” from the drop down list to remove keywords containing/not containing specified string or entries from a specified source.
If the list is ready – you can send it to ScrapeBox for further usage or just copy and save to your notepad for later.
Now: let’s start our Outreach – scrape URLs with Scrapebox
So: we have our keyword research done (after checking the total amount of traffic that keywords can bring to your domain) – now let’s see if we can get some interesting links from specified niche websites.
After sending our URL list to ScrapeBox we can now start searching for specified domains we would like to get links from.
Footprints – what they are and how to build them
Footprints are (in a nutshell) pieces of code or sentences that appear in a website’s code or in text. For example when somebody creates a WordPress blog, he has “Powered by WordPress” in his footer by default. Each CMS can have its very own footprints connected both with content or the URL structure. To learn more about footprints, you should test top Content Management Systems or forum boards to check if they index any repeatable pieces of code.
How to build footprints for ScrapeBox
Firstly, learn more about Google Search Operators. For your basic link building tasks you should know and understand these three search operators:
Inurl: – shows URLs containing a specified string in their address
Intitle: – shows URLs which have a title optimized for a specified text string
Site: – lists domains/URLs/links from a specified domain, ccTLD etc.
So if you already know this, do a test search answering questions related to your business right now:
Do I need do follow links from blogs and bloggers related to my niche?
Do I need backlinks from link directories to boost my SEO for one specified money keyword?
Should these links be do follow only?
On which platforms I can easily share my product/services and why?
Got it? Nice! Now let’s move to the next step – creating our footprint:
So let’s say that you are the owner of a marketing blog related to CPC campaigns and conversion rate optimization. The best idea to get new customers for your services is:
Manual commenting on specified blogs
Creating and posting guest posts on other marketing blogs related to your business
Being in top business link directories which allow you to post a lot information about your business
Let’s state that we need top 100 links where we can post a comment/get in touch with bloggers and contact them for any guest postings.
From our experience and after we did keyword research with Keyword Scraper in ScrapeBox, we’ve noticed that the top platform for blogging about marketing is WordPress – both on our own domain and on free platform.
To get the top 100 blogs related to our needs you can simply use:
“Powered by WordPress” + AdWords AND
This means that we want to search for WordPress blogs on Polish TLD domains with “AdWords” in every single part of the site. However, the results may not be so well-targeted if you fail to use advanced operators you can use search operators where a specified string can be found.
Use footprints in ScrapeBox
Now, after you’ve learned the basics of footprints, you can use them to get specific platforms which will allow you to post a link to your website (or find new customers if you would like to guest blog sometimes).
To do that, simply put them here:
You can combine footprints with advanced search engine commands like site:, inurl or intitle to get only these URLs.
Advanced search operators and footprints have to be connected with the keywords we want to target so as to find more, better pages to link from.
For example you can search only for domains () containing specified keyword in URL (inurl) and title (intitle). Now the URL list will be shorter, but it will contain only related keywords matching our needs.
For your product or service outreach, you can harvest a lot of interesting blogs hosted on free blog network sites like, or your language-related sites. Links from these pages will have different IP addresses, so they can be really valuable for your rankings.
Find Guest Blogging opportunities using ScrapeBox
By using simple footprints like:
“guest blogger” or “guest post” (to search only for links where somebody posted a guest post already – you can also use the allinurl search operator because a lot of blogs have a “guest posts” category which can be found in its URL structure)
Later, combine it with your target keywords and get ready to mail and post fresh guest posts to share your knowledge and services with others!
Check the value of the harvested links using ScrapeBox
Now, when your keyword research is done and you have harvested your very first links list, you can start with checking some basic information about the links. Aside from ScrapeBox, you will also need MozAPI.
Start with trimming to domain
In general, our outreach is supposed to help us build relationships and find customers. This means that you shouldn’t be only looking at a specific article, but rather the whole domain in general. To do that, select the “Trim to root” option from the Manage Lists box:
Later, remove duplicates by clicking the Remove/Filter button and select “Remove duplicate URLs”.
Check Page Rank in ScrapeBox
Start with checking Page Rank – even if it’s not the top ranking factor right now, it still provides basic information about the domain. If the domain has a page rank higher than 1 or 2, this means that it’s trusted and has links from other related/hight PR sources.
To check Page Rank in ScrapeBox, simply click on “Check Page Rank” button and select “Get domain Page Rank”:
To be 100% sure that each domain legit PR – use “ScrapeBox Fake Page Rank Checker”. You can find it in the Addons Section in your ScrapeBox main window.
I tend to say that it’s not a good idea to believe in any 3rd party tools results about Link Trust (because it’s hard to measure if link is trusted or not), although it’s another great sign if a link’s every single result is “green”.
To check Domain Authority in ScrapeBox you can use the Page Authority addon. You can find it in your Addons list in ScrapeBox. To get it to work you will have to get your very own Moz API information (the window will appear after you select the addon).
This provides a quick overview of your links list. You can get information about the Page/Domain Authority, MozRank and the amount of external links pointing to the domain/page. With that, you can see if a URL is worthy of your link building tactics and all the work you plan to put in or not.
Remember: Do not rely on MozRank or Page/Domain authority only.
To get top links, try to look for average ones – a lot of backlinks with medium MozRank/Page/Domain authority.
Email scraping from a URL list using ScrapeBox
After you’ve harvested your first link list, you will probably want to get in touch with bloggers to start your outreach campaign. To do this effectively, use the Scrapebox Email Scraper feature. Simply click on the Grab/Check button and select to grab emails from harvested URLs or from a local list:
The results may not be perfect, but they can really give you a lot of useful information. You can export data to a text file and sort them by email addresses to find connections between domains.
Merge and remove duplicates using ScrapeBox
If you are running a link detox campaign, it’s strongly recommended to use more than one backlink source to get all of the data needed to lift a penalty, for example. For example, if you have more than 40 thousand in each file, you will probably want to merge them into one file and dig into it later.
To do this quickly, install the DupeRemove addon from the available addon list. After running it, this window will pop up:
Now simply choose “Select source files to merge” and go directly to the folder with the different text files with URL addresses. Later press “Merge files” to have them all in one text file.
To remove Duplicate URLs or Domains “Select Source file” and choose where to export non duplicated URLs/Domains. Voila! You have one file containing every single backlink you need to analyze.
For those who like to do things in smaller parts – you have the option of splitting a large file into smaller ones. Select your text file with backlinks and choose how many lines per file it should contain. From my point of view, it’s very effective to split your link file into groups of 1000 links per file. It’s very comfortable and gives you the chance to manage your link analysis tasks.
ScrapeBox Meta Scraper
ScrapeBox allows you to scrape titles and descriptions from your harvested list. To do that, choose the Grab/Check option then, from the drop down menu, “Grab meta info from harvested URLs”:
Here, you can take a look at some example results:
You can export this data to an CSV file and use it to check how many pages use an exact match keyword in the title or optimize it some other way (i. e., do the keywords look natural to Google and not Made For SEO? ).
Check if links are dead or alive with ScrapeBox
If you want to be pretty sure that every single intern/external link is alive you can use the “ScrapeBox Alive Checker” addon. First – if you haven’t done this yet – install the Alive Checker addon.
Later, to use it, head to the Addons list and select ScrapeBox Alive Check.
f you were previously harvesting URLs – simply load them from Harvester. If not, you can load them from the text file.
Now, let’s begin with Options:
Also, remember to have the checkbox for “Follow relocation” checked.
The results can be seen here:
If a link returns HTTP status code different than 301 or 200 it means “Dead” for ScrapeBox.
Check which internal links are not indexed yet
So if you are working on some big onsite changes connected with the total amount of internal pages you will probably want to be pretty sure that Google re-indexes everything. To sure that everything is as it should be, you can use Screaming Frog, SEO Spider and ScrapeBox.
So start crawling your page in Screaming Frog, using the very basic setup in the crawler setting menu:
f you are a crawling huge domain – you can use a Deep Crawl tool instead of the Screaming Frog SEO Spider.
Later, when your crawl is done, save the results in the file, open it and copy it to Clipboard or export it to a file it with one click in ScrapeBox:
When your import is done, simply hit the Check Indexed button and select the Google Indexed option.
Remember to set up the Random Delay option for indexing and checking and total amount of connections based on your internet connection. Mostly, I use 25 connection and Random Delay between each query sent by ScrapeBox to be sure that my IP/Proxy addresses won’t be blocked by Google.
After that, you will get a pop up with information about how many links are indexed or not, and there will be an extra column added to your URLs harvested box with information about whether they are Indexed or not:
You can export unindexed URLs for further investigation.
Get more backlinks straight from Google using ScrapeBox
“Some people create free templates for WordPress and share them with others to both help people have nicely designed blogs and obtain free dofollow links from a lot of different TLDs. ”
Sometimes it’s not enough to download backlink data from Google Webmaster Tools or some other software made for that (although Bartosz found a real nice “glitch” in Webmaster Tools to get more links).
In this case – especially when you are fighting a manual penalty for your site and Google has refused to lift it – go deep into these links and find a pattern that is the same for every single one.
For example – if you are using automatic link building services with spun content, sometimes you can find a sentence or string that is not spun. You can use it as a footprint, harvest results from Google, and check if your previous disavow file contained those links or not.
And another example – some people create free templates for WordPress and share them with others to both help people have nicely designed blogs and obtain free dofollow links from a lot of different TLDs. Here is an example:
“Responsive Theme powered by WordPress”
This returns every single domain using the kind of theme from Cyberchimps. If you will combine it with the keywords you were linking to your site, you will probably get a very big, nice, WordPress blog list. You can combine it with keywords you want to target to get more related and 100% accurate results.
Check external links on your link lists
After you have done your first scrape for custom made footprint it’s good to know what is the quality of links you have found. And once againg – ScrapeBox and its amazing list of Addons will help you!
“Outbound Link Checker” is a addon which will cheack links line by line and list both internal and external links. Because addon works fine supports multithread technology you can check tousands of links at the same time.
To use “Outbound Link Checker” go to your Addons list and selec Outbound Link Checker:
Next, choose to load a URL list from ScrapeBox or from an external file.
After that, you will see something like this:
The magic starts now – simply press the “Start” button.
Now you can filter the results if they contain more than X outgoing links. Later, you can also check the authority of those links and how valuable they are.
As you can see – ScrapeBox in the Penguin era is still a powerful tool which will speed up your daily SEO tasks if used properly. Even if you do not want to post comments or links manually, it can still help you find links where you can get both traffic and customers.
Working across the technical spectrum of SEO, Onely provides strong commercial value to clients through cutting-edge solutions.
Free Scrapebox Tutorial – Craig Campbell SEO
Scrapebox is a tool thats been around for many years and I’d be surprised if there is an SEO out there who is not using this tool. However, for the average user it can be quite a complex tool, hard to navigate and you will most likely be unaware how to pull the appropriate data you may be looking to scrape.
So I’ve decided to provide an in-depth, FREE Scrapbox tutorial/guide that will hopefully help you with this.
It has been broken down into sections so that you are not overloaded with all this information at the one time.
Introduction to Scrapebox
Scrapebox is often called the ” Swiss army knife of SEO ” and to be fair it probably is. The things this tool can do is unbelievable, an old tool that is just updated regularly but retains its very simple interface, this makes this tool one of the best even in 2017.
Now you can get the tool from here, it costs $97 dollars. You often get people whining and moaning at the cost of a tool but for what this tool does it is very much worth the $97 dollars that they charge which gives you a lifetime licence. If you want a tool to provide data that is worth using then there is always going to be a cost as the developers have to constantly work on the tool to make it work and they cant provide that for FREE.
Above is the interface you will see when you get Scrapebox it all looks simple and easy to use. I have made a video to show you what all of the options do which will give you an overview of exactly what Scrapebox can do for you.
You can either download this directly onto your PC or you can use a VPS. Now that it’s downloaded you need to buy yourself some private proxies.
These are what Scrapebox will use to get the information from search engines. The problem here is that Google notices when it is being hit by the same IP address in quick succession. The way to combat this is to buy private proxies(only you have access to them) and these will rotate so Google won’t pick up on them. Proxies can be bought from a few places, one place I’d recommend is:
I recommend that you get at least 20 proxies to begin with, you can always add more later if you need more but this is a good amount to start with. Proxies are paid for monthly, every month the company you have used should send you a new list by email.
Now you have your proxies it’s time add them to Scrapebox. Your proxies can be saved directly onto Scrapebox through a simple copy and paste feature.
To save your proxies click on the ‘manage’ button at the bottom left of Scrapebox (highlighted in red in picture above). This will open up a window that allows you to paste your proxies. There is an option to harvest proxies, this will find you a set of public proxies you can use but I would not advise this. Most public proxies have been abused and Google will blacklist them.
When you purchase your proxies you will most likely receive them in a format like so: ipaddress:port:username:password. If this is the format you’ve received them in then go ahead and paste you proxies into Scrapebox. If not then you need to arrange them in this format for Scrapebox to accept them.
To add them, copy the proxies from the email that you received them in, click on the option that says ‘load from clipboard’. Now all of your proxies will be in front of you, it’s best to quickly test them. (If one or two fail, re-test failed proxies and they should be passed). Any proxies that pass will be highlighted in green and any that fail in red.
One final thing is to filter out any that don’t work with Google, any that don’t work will be completely useless and will hinder your scrapes.
Once your happy with all the proxies on your list make sure you save them to Scrapebox.
Now your proxies are set, there are a few settings you can choose to adjust. For the most part all of the settings can be default, if you have a lot of proxies you could adjust the amount of proxy connections that connect to Google. To do this head into the tab ‘connections, timeout and other settings’, If you are using 50 proxies then you should probably use somewhere between 5 – 10 connections.
Once the proxies are set you need to understand footprints before beginning any kind of scrape. The next topic is all about your Scrapebox footprints.
Footprints regularly appear on webpages. For example the words “powered by WordPress” or “leave a comment”, the first can be seen on a lot of site’s using WordPress as it appears on it’s default theme’s and the second one can be seen on most blogs. This is what makes a footprint. If you were planning on finding WordPress site’s to blog on then you can enter this footprint along with a few niche keywords and Scrapebox will find plenty of sites for you.
What you want to begin doing is making your own good footprints. Having great footprints is key when using Scrapebox, these will give you the best results possible. Building your own takes a bit of time and research but once you’ve got a few good ones you can use them over and over again. When looking for footprints you can use some of these common operators: inurl:, intitle:, intext:.
Here is a list of common footprints for you to download: Footprints List
The best way to test your footprint is simply Googling it, now you can judge it on how many results appear. If it’s only throwing up 1000-2000 results, then your footprint is useless. You need to try and find one that has a good amount of results before it becomes any kind of use to you. Creating a list of footprints, separating them into different blog platforms etc is a good idea before you start scraping. Once your happy you can begin putting them to the test.
Now you’ve got a good set of footprints to use it’s time to start scraping.
When it comes to scraping you could carry out scrapes that take ten minutes or one’s that take a day. This is where having a VPS comes in pretty handy as you can close it down and leave it running all day without it taking up any of your PC’s power. If you do decide to run it locally you will only be able to run it on a windows PC. You also have the option to run it on parallels for your mac, but make sure you allocate your RAM appropriately if you choose to do so.
Whenever you decide to start a scrape you need to make sure you plan out everything. A few factors you need to decide on are:
The amount of proxies
The amount of connections
Amount of queries
Speed of your Proxies
The default settings should be good to go, it’s all about how many keywords you put in which will determine how long a scrape will run. Depending on whether you are using public or private proxies and how many you have, then you could change the amount of connections you use.
The next part is the keyword box.
You can paste all of your keywords in here which will be used alongside your footprint. So if your footprint is simply “leave a comment” and you have a few keywords like “window repairs” and “door repairs”, then your searches will look like this:
“leave a comment” “window repairs”
“leave a comment” “door repairs”
The next step is to make sure that you check the box that says “use proxies” otherwise the scrape won’t run.
You will also see in this section ‘results’. This is quite a straightforward setting. This is the amount of results it will pull from each search engine, so if you’re scraping only for a few site’s you might want to choose 50 results for each of your keywords. However if you’re scraping for a massive amount of site’s then you should use 1000 which is the maximum per keyword.
The only problem here is that good footprints can have over 100, 000 results. To narrow down your results to the best possible you have to start using stop words. For example if you search using the footprints “leave a comment” and “window repairs” you get over 30, 000 results.
Now if we add the stop word “about” to the search then we get just over 20, 000 results. These words allow you narrow down your search through Google’s index so you can get exactly what you are looking for.
Here is an example of a few stop words you could use:
Now that you’ve got your footprints, keywords and stop words to go with them it’s time to start your scrape. If you’re running a scrape with a mass amount of keyword’s then you should leave it running on your VPS to do its thing (If you’re using your PC then you should run it overnight).
When you come back it should be finished, showing you a full list of URLs that have been scraped. If your list hasn’t finished scraping yet, you can leave it longer or stop it manually. If you do stop it you will be given a list of the keywords that have finished and which one’s haven’t. Any results that come back with nothing have either not been searched yet or no results were found.
Once you’ve stopped your scrape you can export all the uncompleted keywords so you can pick it up where you left off.
Scrapebox can only have 1, 000, 000 results in its harvester at a time. It will automatically create a folder with a date and time that contains the 1, 000, 000 and continue onto the next few. This is great but there will be a really high number of duplicate URLs. This is easy to remove when you have a bunch if URLs under the harvester limit. You can do this by opening the ‘Remove/Filter’ section and removing duplicate domains and URLs, but if you’re working with multiple files you’ll need to use dupremove – it is a free Scrapebox add-on that can combine millions of URLs and remove any duplicate URLs and domains on all of the files.
Dupremove also allows you to split your files into smaller files, which is a whole lot easier to manage. To begin using this you need to click on the ‘Addons’ tab and select ‘Show Available Addons’. Scroll down to dupremove and install the add-on, now it’s installed you’ll see the window in the picture above. It’s a really simple process, all you do is select the files containing the URLs which are found in the scrapebox harvester folder (also remember to export the existing list in the harvester to that file also). Merge the files and name it whatever you want. I call it ‘merged list’. Then you add the list into the source file and name the new target file, which I would call ‘final merged’. Now you can split the files up, setting the amount of URLs that you want per file.
If you want to scan a site individually then you can enter the URL(including) into the custom footprint box and it will show you the whole list of all its pages. From here you can use the email grabber which I will show later on in the course or check for backlinks on the site.
There are quite a few things you can do with your harvested URL list, a lot of the features are shown on the right hand side.
Remove/Filter – This area allows you to clean up any unnecessary URLs and sort your list into an order that you prefer. In this section you would remove your duplicate domains and URLs and can even remove URLs containing certain words, numbers of a certain length and can even remove sub domains.
Trim – You can trim any URLs to its root domain removing any sub pages within the site from your harvested list.
Check Metrics – This allows you to check the URL and domain metrics on Yandex.
Check Indexed – Once you’ve got a harvested URL’s list it’s a good idea to check if the site’s are indexed on Google. You can also check if it’s indexed on Yahoo or Bing.
Grab and Check – From here you can get quite a lot out of your URLs. You can grab anything from comments to images and emails.
Import URL List – You can import a list from either a file or paste it from a copied list of URLs to the harvester from your PC, this can also be added onto the existing URLs in the harvester.
Export URL List – There are quite a few file formats you can export too including text, excel, and html.
Import/Export URL’s & PR – This allows you to import and export the URLs with their page rank.
More List Tools – Features like randomising the list’s order are available in here.
The next topic I will go onto explain is keywords in Scrapebox.
Scrapebox has a few decent tools you can use for some keyword research.
This tool allows you to enter a few keywords and the tool will look through the selected search engines and find keywords similar to the one’s you’ve entered. First you need to click on the scrape button just underneath the keyword section, then you’ll be presented with a window that looks like this:
If you click on the menu at the bottom left you can select the search engines that you want the scraper to take it from.
Now that you’ve selected your sources it’s time to enter a few words that you want to scrape. I’ve typed in the words ‘window repairs’ and ‘door repairs’, clicked ‘start’ and up comes the suggested list.
This is a list of 34 unique keywords that have been generated by Scrapebox. You need to click on the ‘export’ button to save the keywords to Scrapebox and then start the process over again with your new 34. You can keep doing this until you’ve finally got your ideal amount, now you can install the Google competition finder.
Google Competition Finder
This is another Add-on that is freely available on Scrapebox. With this you can check how many pages have been indexed by Google for each of your keywords. It is very useful in finding out how competitive each of your keywords are. To use this head to the ‘Addons’ tab and once you’ve installed it, load your keywords from Scrapebox onto it.
Now your keywords are ready, make sure that you check the box for exact match. This means your keywords will use quotes to ensure the most accurate results. You can adjust the amount of connections you are using as well but I recommend about ’10’ to be safe.
Once it’s finished export the file as an excel file, this is done at the bottom right hand site by clicking on the ‘Export’ button. Open up the excel file and sort the files from lowest to highest so you can get a better idea of how competitive each of your exact match keywords are. You could from here start to sort your keywords into high and low if you have a really high volume of words.
The stats you gain from here aren’t 100% accurate but they can give you a good indication of your Google competition. If the keywords number is also pretty low then this should give you a good knowledge of how easy it will be to rank them.
Scrapebox TDNAM Scraper
This is a handy tool for finding expired domains. The only problem with this tool is that it takes all of its domains on from the ‘Go Daddy’ auction place. There is a whole marketplace throughout the web which leaves this tool kind of limited. Overall this is a decent tool to use if you don’t want to pay for any other auctioning tool.
To use this, import your keywords from Scrapebox and this tool will find any expired domains relating to these keywords. Once the scan is finished export your potential domains into a simple text file.
With every domain you think about buying you should do your own manual checks to be sure that it is a domain worth buying. The problem using Scrapebox in that it still uses PR as its main metric. This used to be a great way to determine how powerful a site is but since the infamous penguin update everything has changed. PR hasn’t been updated in years so you need to make sure that you check with tools such as Ahrefs and Majestic SEO to get a better idea before you commit to buying a domain.
One tool it does have that can be quite useful is ‘Scrapebox page authority’. This tool allows you to import all of the domains you collected earlier and can bulk check the page authority(PA) and Domain Authority(DA). Again this isn’t the most important metric but it is useful.
To begin using this tool you have to head into the ‘account setup’ page where you will need to enter your access ID and your API information. You can get this information from the MOZ website with this link here:
Create a free Moz account, then click on the button that says “Generate Mozscape API key” and up will come your access id and secret key. (You have to be signed in to generate the key or it will just take you back to the overview page).
Now you paste this information into the ‘account setup’ page. You can add one of your proxies with the format looking like this access ID|secret key|proxy-details.
There is a lot more that goes into finding powerful expired domains but for now this is just about what Scrapebox can do to filter out some of the useless domains before you go ahead with some proper research.
6. Where to place Links
Scrapebox can be used to find lots of link building opportunities. For example say you find a WordPress blogging site that is on page 4 or 5 for a keyword that you want to rank for, all you would do is create a profile, comment on a few articles and then place a link back to your own website.
With Scrapebox you can use a few footprints that will scan a massive amount of sites with the specific platforms. To do this, harvest a few URLs using a list of keywords. Once the list is harvested, delete any of the duplicate URLs and open up the ‘Scrapebox page scanner‘.
The page scanner will check each of your websites and display the platforms they use.
If you click on the ‘edit’ button at the bottom you can edit the footprints currently used. As you can see Scrapebox contains quite a few footprints by default which are simply a bunch of popular platform names. You could add any footprints you’ve created here as well, the footprints used here are not the same as your typical harvesting ones. The difference here is that the scanner searches the source code of all the sites looking for the footprints that will determine the platform being used. If you take the time to build some really good footprints you can get the best results finding any platform you want.
Load in your harvested keywords and start the process. The scan will run and the platforms will begin to show in the results tab. Once it’s finished click on the export button and it will create a file separating your sites into each of their platforms.
Once you’ve got the list your happy with, run the checks discussed in the previous topics like your page authority (PA) and domain authority (DA), along with doing your own additional external research to help narrow down what site’s have the best link building opportunities.
One thing about commenting using Scrapebox is it can leave obvious footprints if it’s not done properly. Comment blasting like this should never be done on your money site, if you do any commenting like this it should be done on your third tier sites. When you harvest a list of site’s you need to check the PR on each of the sites and how many outbound links the site has. Sites with low outbound links and high PR is what you are looking for. The problems can start to occur if you don’t take time to have good spun comments and have enough fake emails you could leave a footprint.
Once you have a decent set a URLs it’s time to create your profiles. This is what you’ll need:
List of sites to place links
Use the keyword scraper to gather up a massive list from all of the sources and grab no less than 100. You can also add in some generic anchors and then save your file as a simple text file.
The next step is creating a whole list of fake emails. Luckily Scrapebox has a great little tool for this exact purpose. Under the tools tab open up the ‘name and email generator’.
All you have to do here is change the ‘generate’ sections to about 20, 000, check the box to include number in the email and change the ‘Domains for Emails @’ to one of the email services. Once you’ve done this, transfer your names and emails into the poster, then create another list of 20, 000 emails but this time with a different email service.
One of the easiest ways to get a massive list of spun comments is taking comments from similar sites in your niche and spinning those. Scrapebox has a feature available that allows you to grab comments from your harvested URLs.
Click on the ‘Grab/Check’ button, then ‘grab comments from harvested URL List’. This will tell you what platform each site is using and how many comments it’s managed to scrape. Once it’s finished save your list as a text file.
Now you need to use a spinner for all of your comments. You can get article spinners out there from $50 a year. For comment spinning you don’t have to spend a fortune on one of these tools. One I would recommend This will cost you $47 a year and is great for comment spinning.
Once you’ve got a tool you want to use, go through the settings which will be pretty straightforward. Once you have all of your spun comments save them as a text file and put them into the comments poster.
Create a list of sites that you want the backlinks for and name it
Auto Approve List
If you Google ‘auto approve website lists’ on Google you should get plenty of results, just grab one of these lists.
You need to adjust the timeout setting of the comment poster, to do this
click on the settings tab, head into the ‘Connections, Timeout and Other Settings’.
In the ‘timeouts’ section you need to make the poster timeout time 120 seconds. This gives the poster enough time to load bigger sites with lots of comments.
You can either choose the fast poster or manual poster. With manual poster you can go through each site and post comments individually. Once everything is set up click ‘start poster’. Now you can work your way down the list and add each comment. Scrapebox will automatically fill in the form as best it can which is quite helpful in saving some time. Of course if you decide to manually post it will take a lot more time but you will be able to decide exactly what is being posted on each site.
If you are using fast poster, check the box and let it run in the background.
8. Email Scraping
Scrapebox is great at scraping email addresses from harvested URL lists. First enter your footprint, add in all of your keywords and start scraping. Once you have your list delete any duplicate URL’s. Now you head along to the option that says ‘Grab/Check’ and choose ‘Grab emails from harvested URL List’
You can set it to take only one email per domain or grab all that’s available (I recommended taking all of them as it may take one email from a user in the comments instead of an email relevant to the site). Click start and it will now begin going through each of the sites looking for any email addresses, when it’s finished you can save all of them into a text file. You also have the option to filter out emails containing certain words before you save the file, which is great for narrowing down the search to your exact requirements.
Verifying Your Emails
The next stage is to verify that they are all working emails. I do this with a tool called ‘GSA email Verifier’. When all of your emails are ready you can import the text file into this tool, you can run a quick test or a full test on each email with the difference being the quick test will ping the server for a response and the full will try and connect with the server. You are best running a full test for more reliable results, once the results are finished you have the option to export only the emails that are working properly into a text file.
Scrapebox comes with a whole array of free add-ons. It’s a good idea to familiarise yourself with them in case you ever need to use them. Go to the ‘add-ons’ tab and take a look through the list. Any plugins already installed show up in green and ones available are in red.
This plugin needs the Moz API information, once entered you can scan up to 1000 URL backlinks.
The add-on allows you to check the status of each of the sites and will even follow any redirects, telling you where the final destination is.
Google Image Grabber
It allows you to scrape google images based on the keywords you’ve enter. You can either download these to your PC or preview them.
This plugin allows you do extract both internal and external links from site’s and save them onto a text file.
The plugin will scrape a URL’s XML or AXD sitemap instead of scraping Google’s index pages. There is also a feature called ‘deep crawl’ which will scrape the URL’s within the sitemap and any sites found within that.
Malware and Phishing Filter
This allows you to check sites for malware or if they have had it within the last 90 days. You can use your harvested URL list or load in a file. Any sites that contain malware can be filtered out to help keep your entire list risk free.
There are also some great link building opportunities with this tool, if you do find malware on someone’s site you can contact the webmaster and inform them of the problem. You could maybe even explain to them how to fix the issue or link them to an article that will. A lot of the time you might find that they will be willing to link to your website. You probably shouldn’t ask for a link but it’s entirely up to you how you approach the situation.
Just a simple audio add on that is available for entertainment.
The program is simply a playable chess game.
This plugin allows you to check all your active connections, it’s useful for monitoring your connections if any problems occur.
Checks your upstream and downstream speed.
Broken Links Checker
An add-on that checks your list of URL’s and extracts all of its links. Once this is done it will check to see if these links are dead or alive.
Fake PR Checker
This add on works hand in hand with the TDNAM scanner, it will bulk check all of your sites PR values and tell you which ones are real and which are fake.
Google Cache Extractor
Will allow you to check the Google cache date of each URL and save it as a file.
This will bulk check social metrics such as twitter, Facebook, LinkedIn, Google+. You can export these into many different formats like,, etc.
It will pull up all of your site’s Whois data, such as registrars name, email and domain expiration date.
This will check a bulk list of URLs backlinks and will determine which are do follow links and no follow links.
You will be able to submit your sites onto various other sites such as Whois. This will help you index all of your sites a lot faster onto Google.
This will find articles from directories based on the keywords you enter, you can then save them onto your computer with a simple text file.
Anchor Text Checker
You can enter your domain into this tool along with the URL’s of sites that link back to you. This will then take the anchor text from each of your links and display them to you with a percentage of how much each occurs.
Mobile Site Tester
Instead of individually entering every page on your site, you can use the mobile site tester to automatically go through your pages and it will tell you if it has passed or failed along with scores for each of the attributes that factor in.
This add on will bulk check the final destination your site’s URLs.
Automator (Premium Plugin)
With this plugin you can set up a lot of the features to automatically run without you there. It is quite simple to set up, all you need to do is double click on the features you want automated to add it to the list. It will carry out the task at the top first working downwards, for example below is a picture of an email scrape I have set up.
First you harvest the URLs, then delete duplicate domains and then grab the emails. I have put a delay on of a couple of seconds before it starts the process over again.
You can also set up an email notification at the end of the process, then go in and get the emails saved in the destination you choose. The command parameters section is the settings for the individual processes. Set them in any way that suits you.
So you can go and get Scrapebox from and i’m sure you will agree that it’s the best money you are likely to spend on any one tool.
Need help with your SEO campaign? Get in touch to find out how we can be of assistance!
The Only Scrapebox Tutorial You Need – SEO – Matthew …
Home > SEO > Tools > Scrapebox Tutorial
Scrapebox is like the Swiss army knife of SEO tools.
It should be in everybody’s arsenal and can be used for a huge range of functions.
A lot of people have asked me for a dedicated Scrapebox tutorial series & well, here it is:)
What You Will Learn
What it does
How to build footprints
How to scrape in bulk
How to use it for keyword research
How to buy expired domains
How to analyse links
How to get guest posts
How to blast comments
How to build niche specific comments
Page rank sculpting
How to automate everything
How to analyse competitor backlinks
How to use the free addons
You will also get a bunch of exclusive resources and links to additional tutorials teaching you how to use it like a pro!
Download The Scrapebox Tutorial In PDF Format
This was the culmination of months of hard work from Jacob King along with contributions from Charles Floate & Chris Dyson.
So props to them – it was so good I wanted to share it here with you all!
You should check out some of their other posts-
How To Sucker Punch Google in the Gonads with Fake Branding
How To Be An SEO For $57
Newsjacking – Real Time Link Building
Ranking For Rand Fishkin
The Link Building Process In Gif Format
Scrapebox Discount Code
To close off the Scrapebox tutorial, I wanted to show you the cheapest place to buy it.
There’s an exclusive Scrapebox discount available for users of the BlackHatWorld forum.
But you can access that directly here without being a member which will save you $30 off the usual price.
Frequently Asked Questions about scrapebox tutorial
How do you use a Scrapebox?
Scrapebox – a forbidden word in SEO. I bet everybody knows Scrapebox, more or less. In short – it’s a tool used for mass scraping, harvesting, pinging and posting tasks in order to maximize the amount of links you can gain for your website to help it rank better in Google.