I’m really enjoying this blogging thing! Every comment I am getting from my readers is a new idea that I feel rushed to put into practice.

My reader, Andrea, mentioned she parses log files to mine for keywords as well. That is an excellent idea.

I decided to put that idea into code and here is a new tool to mine for long tail keywords.

To make really good use of it, I would setup a PPC campaign in Google with a “head keyword” in broad match, bidding at the minimum possible. Make sure your ads maintain good click-through rates (over 0.5%) to avoid getting disabled. Run it for a week or two (preferably more) and you will have a good number of search referrals and “long tail keywords” that people are actually looking for. You can later create good content pages that include those keywords. In most cases, long tail keywords are really easy to rank with on-page optimization only.

I will probably write a Youmoz entry with more detailed instructions on how to take advantage of this. In this way I can get more people to try it and get really valuable feedback.

Here is the Python code: longtailminer.zip

Here is the output:

search.sympatico.msn.ca - best places to vacation in april
www.ask.com - help find a cheap vacation package anywhere
www.ask.com - new york vacation package deals
search.yahoo.com - vegas vacation packages
search.yahoo.com - what is the best beaches to stay in jamaica
search.yahoo.com - outrageous hawaii vacation packages
www.google.se - “paris in 5 days” versailles
search.sympatico.msn.ca - Vacation Package Deals
search.yahoo.com - vacationpackage
search.yahoo.com - vacation packages
search.msn.com - 10 best places for vacation
in.search.yahoo.com - vacation package
search.yahoo.com - best places to vacation in june/july
search.sympatico.msn.ca - best travel deals for june
search.msn.com - last minute caribean deals
search.yahoo.com - package vacation
www.google.com - Tripscan
search.sympatico.msn.ca - best places to vacation in June
search.sympatico.msn.ca - best places to travel in october
search.yahoo.com - vacation package
search.msn.com - caribean vacation
search.msn.com - Best Caribean vacation
www.ask.com - Cheap Vacation Package
search.sympatico.msn.ca - CANYON RANCH IN LENNOX
search.sympatico.msn.ca - find vacation packages
ca.search.yahoo.com - Hawaii all inclusive Vacation Packages
search.yahoo.com - california vacation ideas
search.yahoo.com - vacaton package
ie.search.msn.com - caribean vacation
search.yahoo.com - all inclusive package deals from New York to Cancun
search.yahoo.com - best places to explore
search.msn.com - caribean vacation island packages
search.yahoo.com - vancation package
search.yahoo.com - puerto vallarta nude resorts
www.ask.com - all inclusive vacation places
search.yahoo.com - vacation packge
search.yahoo.com - vacation package
www.ask.com - caribean deals
search.msn.com - best hotels in caribean
search.msn.com - the best caribean vacation
www.google.com - related:www.exectourtravel.com/
ca.search.yahoo.com - vacation packages

This is just scratching the surface. One improvement we can make, is to identify the landing pages to which the keywords lead, so we can make sure visitors are finding what they want.

Usage

In order to use the script you need to download Python from http://www.python.org. The script should run in Unix/Linux, Mac, and Windows but I only tested it in Linux.

1. Copy your log file to the directory were you saved the script.

2. Change the name of the log file (inside the quotes) in the line log = open(’tripscan.actual_log’) to the name of your log file.

3. In the command line type: python LongTailMiner.py and you should see the report.

Gregarious FeedFlare Share This

Popularity: 12% [?]

Related posts

Did you enjoy reading this? Subscribe to this site! Choose from either daily email updates or our RSS feed and never miss a thing!

This entry was posted on Thursday, May 31st, 2007 at 3:54 pm and is filed under , , , , , . You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.



6 comments ↓


Heather Paquinas on 06.04.07 at 7:59 pm

Hittail essentially does this, but makes it much easier.

Permalink

Hamlet Batista on 06.05.07 at 1:20 am

Yes. You are absolutely right.

I will try to make my tools accessible to the non-technical person.

Permalink

Tsim on 06.14.07 at 3:08 am

There’s another interesting tool for researching longtails: www.103bees.com

Hamlet, the solution you proposed sounds interesting, only it’s hard to understand for a non-technical person like me ;-)
P.S. Like your blog a lot, great content !

Regards,

Tsim

Permalink

Heather Paquinas on 06.20.07 at 7:42 pm

So you run tripscan.com?

Permalink

Hamlet Batista on 06.21.07 at 2:27 am

Tsim,

Thanks for your comment. I am working to make the tools available to use directly from the blog.

Heather,

That is correct. Tripscan.com is one of my sites.

Permalink

Newcastle Hotelier on 02.12.08 at 7:30 pm

Nice little tool. With regards keeping a good CTR accross so many long tail words, you could use the mining tool then use a tool like SpeedPPC to dynaically insert each of the keywords into the add. You could even then pass them in the link from google and use PHP to generate the landing page dynamically.

Hmm, looks like I am gonna be busy today. ;)

Permalink



Leave a Comment


 
pic2
There are many blogs about SEO. Many of them have done, and continue to do, a great job with traditional ideas. Unfortunately, knowing and doing what everybody else does is not a competitive advantage.

This blog is different. It’s about learning the most advanced SEO techniques, led by one of the industry’s up and coming SEO thinkers. Here you will find advanced search engine marketing tips and techniques that give you an edge over your competitors. The ideas are totally original: a fusion of Hamlet Batista’s own experience, research and careful experimentation, along with his readers’ questions, ideas, and thought-provoking input. Come along for the ride and explore, participate and push the limits of today’s SEO.
  » Read More