1
Getty Images Letter Forum / Latest image scraper – a troll by any other name
« on: January 10, 2013, 06:08:27 PM »
Today as I was looking over our server logs for the New Year to make sure my new security measures were keeping the bad bots at bay, I came across this little nugget of info in my browsers used log…
BPImageWalker/2.0 (www.bdbrandprotect.com)
Haha! a new image scraper and from the well laid out name of the browser (they rarely are this kind), I knew they were not scanning my server to help me out in any way. A little research and I found out that this Canadian based company does “brand protection” services, and one of those services is scanning of images. Here is a link to a pdf that describes their capabilities…
http://www.brandprotect.com/files/BP_Services_Unique_Capabilities.pdf
More research turned up remarks on user agent forums about how this company’s bots ignore the robots.txt file (much like PicScout’s) and that most webmasters just block their domains and IP addresses. I was able to find out the domains they like to use…
bdbrandprotect.com
brandprotect.com
brandimensions.net
brandimensions.com
Here are the IP addresses I was able to verify so far….
72.14.164.103
72.14.163.101
72.14.163.107
72.14.170.60
216.183.93.163
And these addresses have been reported as some they have used in the past (could not verify with DNS records other than they are used at the same hosting company)…
72.14.164.122
72.14.164.131
72.14.164.143
72.14.164.157
72.14.164.161
72.14.164.176
72.14.164.183
Tomorrow I will be going over the firewall logs to see if I can spot exactly what kind of traffic and what time they “accessed” my web server. Now, this may be a legitimate company doing legitimate business, but if they have ways to “bypass” the most basic of web server security settings, then in my opinion, they are no better than hackers and I would refuse to do business with such companies.
BPImageWalker/2.0 (www.bdbrandprotect.com)
Haha! a new image scraper and from the well laid out name of the browser (they rarely are this kind), I knew they were not scanning my server to help me out in any way. A little research and I found out that this Canadian based company does “brand protection” services, and one of those services is scanning of images. Here is a link to a pdf that describes their capabilities…
http://www.brandprotect.com/files/BP_Services_Unique_Capabilities.pdf
More research turned up remarks on user agent forums about how this company’s bots ignore the robots.txt file (much like PicScout’s) and that most webmasters just block their domains and IP addresses. I was able to find out the domains they like to use…
bdbrandprotect.com
brandprotect.com
brandimensions.net
brandimensions.com
Here are the IP addresses I was able to verify so far….
72.14.164.103
72.14.163.101
72.14.163.107
72.14.170.60
216.183.93.163
And these addresses have been reported as some they have used in the past (could not verify with DNS records other than they are used at the same hosting company)…
72.14.164.122
72.14.164.131
72.14.164.143
72.14.164.157
72.14.164.161
72.14.164.176
72.14.164.183
Tomorrow I will be going over the firewall logs to see if I can spot exactly what kind of traffic and what time they “accessed” my web server. Now, this may be a legitimate company doing legitimate business, but if they have ways to “bypass” the most basic of web server security settings, then in my opinion, they are no better than hackers and I would refuse to do business with such companies.