![]() |
![]() |
|
|
||
Because of the large number of unwanted email that webmasters are receiving, hiding email addresses from spammers' email harvesting robots is becoming commonplace. Methods vary (see below). There is no guarantee. If a browser can reveal your email address to site users or create a link with it or send it to a script, chances are the dark one's robots can harvest it. Their robots are likely to get smarter and smarter in order to circumvent ever more sophisticated obfuscation attempts. One could argue that spammers, realizing the fact that those who attempt to hide their email addresses wouldn't buy into their "marketing messages" anyway, won't make an attempt to harvest hidden email addresses. However, harvesters want valid email addresses, especially if the addresses are going on a CD for sale to spammers. Looking at it that way, email addresses in hiding are near 100% certain to be valid, with a live recipient at the other end. Thus, they could be considered prime value. To demonstrate how easy it is to find your "hidden" email addresses, I've made a CGI program that can be used to find them. Simply type in the URL of your page and the program will tell you which email addresses it found. http://willmaster.com/a/15/pl.pl?art153demo Methods of hiding email addresses include:
The anti-spam function function of Advanced Email Link Generator with Anti-Spam Encoder from http://willmaster.com/a/15/pl.pl?aelgwase uses this encoding method. "ab@ab.com" becomes:
ab@ab.com
%61%62%40%61%62%2e%63%6f%6d
a<!-- hi :) -->b@ab.com
might cause the robot to incorrectly assume "b@ab.com" is the email address. Encoding "ab@ab.com" as
ab<!-- X -->@<!---->a<!-- -->b.com
might cause the robot to miss the email address altogether.
a<!-- hi :) -->b@ab.com
The encoded email address could contain a multi-line HTML comment tag:
a<!-- hi :) -->b@<!-- any text here -->ab.com
As a link, "ab@ab.com" could be encoded as:
a<!-- hi :) -->b%40%61b.<!---->com
This would print "ab@ab.com" on a web page:
<script type="text/javascript" language="JavaScript"><!--
document.write('ab');
document.write('@');
document.write('ab.com');
//--></script>
JavaScript code could be much more convoluted. Even the use of document.write() can be hidden. But the above should work for any robots that can't read JavaScript. The demonstration to find your hidden email addresses at http://willmaster.com/a/15/pl.pl?art153demo is designed to find email addresses hidden with the first four methods listed above. If you successfully hide an email address with any of those methods, please let me know the URL of the page so I can upgrade the program. The JavaScript method hasn't yet been incorporated into the program. It will be a bit of work because of the myriad ways JavaScript can be coded, but not impossible. If you become aware of a method not addressed above, please let me know. I sincerely want to keep the program updated. It can reveal vulnerable pages to webmasters. I know of no way to extract email addresses from images. That method will probably be safe for the foreseeable future. (I've quit saying "it's impossible" because I almost always had ended up being proved wrong.) Do you have vulnerable pages? Use the form at the demonstration page and see. If you wish, you may offer the same demonstration to your site visitors at no charge. Copy the URL from your browser's address bar and offer it to your site visitors. Put the URL into a link, into a popup, into an iframe, or even into an ebook to offer readers the service of checking their pages for email harvesting vulnerability. The CGI program underlying the demonstration is not and will not be made available to the public. Otherwise, and contrary to its current purpose, it might be converted into an automated email address harvester. So I keep the program code secret and won't distribute it. By: Will Bontrager Copyright 2002 Bontrager Connection, LLC
| ||