MHASS: Microbiome HiFi Amplicon Sequencing Simulator
Abstract
Summary: Microbiome HiFi Amplicon Sequence Simulator (MHASS) creates realistic synthetic PacBio HiFi amplicon sequencing datasets for microbiome studies, by integrating genome-aware abundance modeling, realistic dual-barcoding strategies, and empirically derived pass-number distributions from actual sequencing runs. MHASS generates datasets tailored for rigorous benchmarking and validation of long-read microbiome analysis workflows, including ASV clustering and taxonomic assignment. Availability and Implementation: Implemented in Python with automated dependency management, the source code for MHASS is freely available at https://github.com/rhowardstone/MHASS along with installation instructions. Contact: rye.howard-stone@uconn.edu or ion.mandoiu@uconn.edu Supplementary information: Supplementary data are available online at https://github.com/rhowardstone/MHASS_evaluation.