AbstractEnvironmental DNA (eDNA) metabarcoding has become widely applied to gauge biodiversity in a noninvasive and cost-efficient manner. The detection of species using eDNA metabarcoding is, however, imperfect owing to various factors that can cause false negatives in the inherent multi-stage workflow.Imperfect detection in the multi-stage workflow of eDNA metabarcoding also raises an issue of study design: namely, how available resources should be allocated among the different stages to optimize survey efficiency.Here, we propose a variant of the multispecies site occupancy model for eDNA metabar-coding studies where samples are collected at multiple sites within a region of interest. This model describes the variation in sequence reads, the unique output of the high-throughput sequencers, in terms of the hierarchical workflow of eDNA metabarcoding and interspecific heterogeneity, allowing the decomposition of the sources of variation in the detectability of species throughout the different stages of the workflow. We also introduced a Bayesian decision analysis framework to identify the study design that optimizes the efficiency of species detection with a limited budget.An application of the model to freshwater fish communities in the Lake Kasumigaura watershed, in Japan, highlighted a remarkable inhomogeneity in the detectability of species, indicating a potential risk of the biased detection of specific species. Species with lower site occupancy probabilities tended to be difficult to detect as they had lower capture probabilities and lower dominance of the sequences. The expected abundance of sequence reads was predicted to vary by up to 23.5 times between species.An analysis of the study design suggested that ensuring multiple within-site replications of the environmental samples is preferred in order to achieve better species detection efficiency, provided that a throughput of tens of thousands of sequence reads was secured.The proposed framework makes the application of eDNA metabarcoding more error-tolerant, allowing ecologists to monitor ecological communities more efficiently.