This is a public-interest archive. Personal data is pseudonymized and retained under GDPR Article 89.

Re: blogs and scrapers


> A second note - here's the list of blogs this person is scraping.
> http://gardening-blogs.emoondo.com/sitemap
> 
> to help you out with the search

Maybe it's time to start embedding image 'captcha' when people want to
read past the first paragraph of a blog entry.

(captcha is the thing where you enter the number or letters or whatever
shown/heard to defeat automated robots from harvesting stuff).

Now, what I would do is trace through my logs for the IP of the scraper
and then modify my blog to present different data for just him/her.  Maybe
embed a few non-family images, a la goatse.cx[1] for giggles sake.

Chris

[1] http://en.wikipedia.org/wiki/Goatse.cx (article NSFW text)

http://www.hort.net/gallery/      4135 online plant photos and growing!
http://www.hort.net/gallery/date/2007-07-01/       The latest additions
http://www.bonvivantnursery.com/                     Bon Vivant Nursery
_______________________________________________
gardenwriters mailing list
gardenwriters@lists.ibiblio.org
http://lists.ibiblio.org/mailman/listinfo/gardenwriters

GWL has searchable archives at:
http://www.hort.net/lists/gardenwriters

Send photos for GWL to gwlphotos@hort.net to be posted
at: http://www.hort.net/lists/gwlphotos

Post gardening questions/threads to
"Gardenwriters on Gardening" <gwl-g@lists.ibiblio.org>

For GWL website and Wiki, go to
http://www.ibiblio.org/gardenwriters



Other Mailing lists | Author Index | Date Index | Subject Index | Thread Index