Academy & Industry Research Collaboration Center
Table Of Content (TOC) detection has drawn attention now-a-day because it plays an important role in digitization of multi-page document. Generally book document is multi-page document. So it becomes necessary to detect table of content page for easy navigation of multi-page document and also to make information retrieval faster for desirable data from the multipage document. All the table of content pages follow the different layout, different way of presenting the contents of the document like chapter, section, subsection etc. This paper introduces a new method to detect Table of content using machine learning technique with different features.