DESIGN AND DEVELOPMENT OF METHODOLOGY FOR SEMANTIC DOCUMENT CLUSTERING OF GENOMIC AND PROTEIN SEQUENCES