Alexa Web Information Service support for Glype Proxy
The short & sweet:
If you own a web proxy service using glype and want to stop people from trying to get to adult sites, you’ll want this file:
http://natefanaro.com/scripts/awis_lib_v02.zip
This will work with Glype 0.5.2 and I’m sure 0.5.1. Your mileage may vary with anything lower than that.
Keep reading if you want but how to install it is in the file.
I own a few proxy sites for kids to get around blocks at school or people to use at work etc. But there are two things that I totally don’t want on my network: porn (mostly of the child variety) and spam.
I had a pretty decent setup for a while with porn filtering. I’d wake up every day and pour though images, logs or whatever I could find and block them. Some was done automatically and some are done by hand. This required a lot of time and energy that was quickly fading away. The problem with me not having time for this upkeep is the amount of money that I can make on ads suffers because of it. Every time a site is blocked, they are taken back to the home page of the proxy and told not to do that. The home page has all of the ads. Some are for other proxies. They want to go somewhere else? That’s fine with me! Click on an ad, get me some change and off my back, and they get a proxy to do what they want. So upkeep on blocking porn is pretty important to me on keeping the revenue up.
I personally know that all of this upkeep is bad and would eventually have to stop. I’ve been looking around for any type of API service that I could use to automate this thing. I didn’t find anything free. But Amazon’s API with Alexa works well and is not that expensive. Hopefully the revenue earned by your proxy will cover that cost. ($0.15 per 1k requests, not bad.)
When any url is loaded, the url is sent off to AWIS. They return a bunch of information about any site but we’re just after one thing: AdultContent. If that site has adult content, it’s blocked. People get an error message and sent back to the homepage. If it’s allowed, that’s great. They go through and that’s the end of that. Now this could be VERY expensive but we’re caching the data in a MySQL table. So the second time that website is hit, the script will get the information out of the cache instead of costing you .00015 cents.
To use the script just open it up and read the instructions. You can create an account with Amazon Web Services here: http://aws.amazon.com and then sign up with Alexa Web Information Service (AWIS).
I’m assuming that your first day is going to be the most expensive since nothing is cached. Just bear with it. Eventually the cost will go down. I started on a Saturday and it’s cost me 17 cents so far. I’d recommend keeping your eye on it if you’re worried about cash.
Enjoy!