Association for Computing Machinery
The demand to access large amounts of heterogeneous structured data is emerging as a trend for many users and applications. However, the effort involved in querying heterogeneous and distributed third-party databases can create major barriers for data consumers. At the core of this problem is the semantic gap between the way users express their information needs and the representation of the data. This paper aims to provide a natural language interface and an associated semantic index to support an increased level of vocabulary independency for queries over linked data/semantic web datasets, using a distributional-compositional semantics approach.