Could anybody suggest some software/algorithm for providing both content and keyword based filtering for a squid proxy running on Linux? The problem should be solved in such a way that it can differentiate sex from Essex and chat from chatterjee etc.