Robots and Spiders and Bots -- Oh my
Looking through the access logs for my web site, I find that most of the accesses are from robots and spiders and bots. Some of them I know about, others I've never heard of before. Somehow I don't think my site would be very relevant to "ichiro/mobile goo;+http://help.goo.ne.jp/door/crawler.html". Several of them have broken URL parsing code that assumes that all links are relative, "Purebot/1.1; +http://www.puritysearch.net/)" is one example. "Baiduspider+(+http://www.baidu.com/search/spider.htm)" frequently fetches the top-level page from multiple IPs, and doesn't get anything else.
There are also a significant attempts to break into php and/or sql, neither of which I have installed. One of them has "whitehat" it it's URL, as if anyone would be stupid enough to believe that.
Of the over 1700 IPs that have accessed my site so far, I think several hundred are real people that looked at my web site at least once. My use of a picture in an rv.net forum post accounts for a bunch of the IPs. Several people are checking back manually, and only one has subscribed to the rss feed via google.