Algorithmic Programming Language Identification

Motivated by the amount of code that goes unidentified on the web, the authors introduce a practical method for algorithmically identifying the programming language of source code. Their work is based on supervised learning and intelligent statistical features. They also explored, but abandoned, a grammatical approach. In testing, their implementation greatly outperforms that of an existing tool that relies on a Bayesian classifier. Code is written in Python and available under an MIT license. The purpose of algorithmic programming language detection is to determine the programming language with which a particular program or program fragment was written.

Provided by: University of Rochester Topic: Software Date Added: Nov 2011 Format: PDF

Find By Topic