http://en.wikipedia.org/wiki/S%C3%B8rensen_similarity_index#cite_note-4The Sørensen index, also known as Sørensen’s similarity coefficient, is a statistic used for comparing the similarity of two samples. where A and B are the species numbers in samples A and B, respectively, and C is the number of species shared by the two samples.
The Sørensen index is identical to Dice's coefficient which is always in [0, 1] range. Sørensen index used as a distance measure, 1 ? QS, is identical to Hellinger distance and Bray–Curtis dissimilarity.
The Sørensen coefficient is mainly useful for ecological community data (e.g. Looman & Campbell, 1960[3]). Justification for its use is primarily empirical rather than theoretical (although it can be justified theoretically as the intersection of two fuzzy sets[4]). As compared to Euclidean distance, Sørensen distance retains sensitivity in more heterogeneous data sets and gives less weight to outliers
This function assumes you pass in a and b as sets.
The sorensen index extended to abundance instead of incidence of species is called the Czekanowski index.
Comments top
No comments for sorensen-index. Log in to add a comment.