Hello again Bao. I have another question .. how does SEED incorporate read abundance when determining a representative sequence? For instance, let's say I have a cluster with 10 different sequences. One of these sequences is present at 1,000 reads, while the other nine sequences are present only once in the data. Do all ten sequences count 'equally' in terms of determining the representative sequence? Or is the determination of the representative weighted by abundance (in which case we'd expect the 1,000-read sequence to be the representative in my example).
Thanks again!
Hello again Bao. I have another question .. how does SEED incorporate read abundance when determining a representative sequence? For instance, let's say I have a cluster with 10 different sequences. One of these sequences is present at 1,000 reads, while the other nine sequences are present only once in the data. Do all ten sequences count 'equally' in terms of determining the representative sequence? Or is the determination of the representative weighted by abundance (in which case we'd expect the 1,000-read sequence to be the representative in my example).
Thanks again!