Big Data Processing with MapReduce for E-Book

Provided by: Science & Engineering Research Support soCiety (SERSC)
Topic: Data Management
Format: PDF
Evolution of IT and computer has made e-books popular day by day. In this paper, the authors are interested in searching a word in e-books. However, it is impossible to search a word in digitized e-books if they consist of image files such as JPG and PDF. Their solution to this problem is to transform the image file based e-books into text files based e-books to enable searching a word in e-books. They use EPUB, a XML-based text file, which is defined by IDPF (International Digital Publishing Forum).

Find By Topic