Too much data, too little time
Sometimes, a sequencing run generates a lot of data. Say we’re only interested in getting a quick preview of data quality without having to analyze our entire dataset.
Use the fastp README to find a parameter that allows you to only process the first 10,000 reads of the FASTQ files HG004_R1.fastq.gz and HG004_R2.fastq.gz.
fastp on only the first 10,000 reads