Copyscape and Content Stealing

I’ve seen the “protected by Copyscape” images before, but never bothered to check it out. I always assumed it was some code-related method to prevent people from being able to copy and paste or something of that nature. You know what happens when we assume :)

http://www.copyscape.com/

– per Willie Crawford’s blog

Anyhow, their site has a nice quick check – where you type in the URL of your page, and it searches for other pages on the web that have a large portion of common text.

Definitely on my useful list now, as I just noticed I had some user with 300+ page views on a site of mine that is usually lucky to average 5 per user. It became apparent that they just went to every page systematically and were saving them to their hard drive. I noticed that when the reported pages viewed in my web stats program began looking a bit suspicious:

Default Page
pagenameA.asp
pagenameB.asp
pagenameA.asp
Default Page
c:/documents and settings/***edited***/desktop/2ev/index.html
c:/documents and settings/***edited***/desktop/2ev/index.html
c:/documents and settings/***edited***/desktop/2ev/index.html

I’m tracking them whether or not the pages are being served from my server :) Shhh, let’s hope they go ahead and publish the pages and reveal their true identity!

I hate content stealers! Sheesh, the content on that site isn’t even very good! If they spent the time just writing their own they’d be much better served. Hopefully the search engines’ content filters will ding them hard for that, should they try and publish those pages.