[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [tor-talk] Yelp blocking Tor users from viewing entire site
It would be interesting to know if sites like Yelp/Craiglists are more
afraid of anonymity and possible spam/trolls than crawlers.
If they cannot detect a crawler using Tor, then they cannot detect any
other crawler, like a crawler switching IPs as mentionned in another
post, using vpns or proxies, etc
So in that case it's useless to block Tor, because Tor network's size is
not really significant compared to other means that crawlers have,
probably they just choosed the easy way as well as crawlers might have
chosen the easy way too (use Tor), blocking Tor so they have solved one
But in fact they have solved nothing if they are not protected against
crawlers, and if they are protected the protection would be something
like blocking the IP or sending a captcha.
Maybe the exit nodes could implement an anti-crawler feature, even if
the crawler is switching among 1000 exit nodes I think it's feasible to
fingerprint it in the Tor network finite space, I don't know if there
are studies about this, an efficient crawler can never behave like a
human being or a normal browser.
This might sound like a kind of censorship but that's probably not the
goal of the Tor network to crawl and spam the web, the exit nodes that
would have removed the feature would just get blocked.
Le 20/01/2015 21:24, Roger Dingledine a écrit :
On Tue, Jan 20, 2015 at 03:16:43PM -0500, Greg Norcie wrote:
So today I noticed Yelp appears to be blocking Tor.
I tried using multiple identities, but get a 503 error every time.
Anyone else have this issue?
This seems really overbroad... I could understand the argument
against letting a Tor user post reviews, but to disallow simple
browsing seems like overkill.
See their two mentions in
I think Yelp blocks Tor for a similar reason to why Craigslist blocks
Tor -- their value-add is their web pages, and if their competitors
"steal" their web pages, they've got nothing else.
That kind of business model leads to jealously guarding all of their
pages from being crawled or otherwise viewed en masse -- which in turn
leads to being angry and bitter at every new Internet technology.
Unfortunately, since this business model is so fragile, I don't have
good ideas on how to make it compatible with Tor (or heck, with most
of the rest of the Internet).
Peersm : http://www.peersm.com
node-Tor : https://www.github.com/Ayms/node-Tor
GitHub : https://www.github.com/Ayms
tor-talk mailing list - email@example.com
To unsubscribe or change other settings go to