Multi-layered approach for tracking generic multiple objects and extraction of video objects for performance analysis