I am interested to know the principle of Sound and Picture Recognition (in automation). For example, when I captured commercials from TV, how can I recognize it again in next time when I captured the same track? Any outstanding source I can reach?