Parallel algorithms for enabling fast and scalable analysis of high-throughput sequencing datasets