Analysis and design of algorithms to optimize performance to unstructured data from cloud environment