International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE)
Relational data has many variants like SQL server, Oracle, Mysql etc. Each relational database system has primary and foreign key concept by which the matching of records can be done. There are lot of services like OLAP have features to identify the duplicity of records. There are few models addressed and surveyed on the duplicate detection of hierarchical data like eXtensible Markup Language (XML). An algorithm XMLDup can be addressed a Bayesian network to determine the XML elements probability. The heterogeneous data duplication method can also be used for XMLDup to develop and guarantee hierarchical duplication using XML mining.