Provided by: International Journal of Innovative Technology and Exploring Engineering (IJITEE)
Topic: Data Management
Spreadsheets are widely used in industry: it is estimated that end-user programmers outnumber programmers by a factor. However, spreadsheets are error-prone, numerous companies have lost money because of spreadsheet errors. One of the causes for spreadsheet problems is the prevalence of copy-pasting. Based on existing text-based clone detection algorithms, different developed algorithms have been designed to detect data clones in spreadsheets: formulas whose values are copied as plain text in a different location. The results of the evaluation clearly indicate that data clones are common and data clones pose threats to spreadsheet quality. The data are cloned by including multiple copies of the encounter histories, i.e., duplicating the encounter histories.