Provided by: Association for Computing Machinery
Topic: Data Management
Date Added: Jan 2012
In general, a relational DBMS provides limited capabilities to perform multidimensional statistical analysis, which requires manipulating vectors and matrices. In this paper, the authors study how to extend a DBMS with basic vector and matrix operators by programming User-Defined Functions (UDFs). They carefully analyze UDF features and limitations to implement vector and matrix operations commonly used in statistics, machine learning and data mining, paying attention to DBMS, operating system and computer architecture constraints. UDFs represent a C programming interface that allows the definition of scalar and aggregate functions that can be used in SQL.