Convolutional Networks and Applications in Vision
Source: New York University
Intelligent tasks, such as visual perception, auditory perception, and language understanding require the construction of good internal representations of the world (or "Features"), which must be invariant to irrelevant variations of the input while, preserving relevant information. A major question for machine learning is how to learn such good features automatically. Convolutional Networks (ConvNets) are a biologically-inspired trainable architecture that can learn invariant features. Each stage in a ConvNets is composed of a filter bank, some non-linearities, and feature pooling layers. With multiple stages, a ConvNet can learn multi-level hierarchies of features. While ConvNets have been successfully deployed in many commercial applications from OCR to video surveillance, they require large amounts of labeled training samples.