EFFICIENT PAIR-WISE SIMILARITY COMPUTATION USING APACHE SPARK