Provided by: International Journal of Advanced Technology in Engineering and Science (IJATES)
Topic: Data Management
The problems start during data acquisition, when the bulk data requires to make decisions, currently in an ad hoc manner, about what data to keep and what to discard and how to store what the users keep reliably with the right metadata. Many data today is not natively in structured format, for e.g.: blogs and tweets are weakly structured pieces of text, while images and video are structured for storage and display, but not for semantic content and search, so transforming such content into a structured format for later study is a major test.