Starting with a known fact: Sorenson-Dice Index is the same as F1 Score. A fact I wasn’t aware of, given that I learned about the two in different occasions and for different applications.
F1 Score is usually taught in Information Retrieval courses as a metric to evaluate retrieval systems. They never said that it’s nothing but a reformulation of Sorenson-Dice Index but in terms of recall and precision. I’ll go through the proof quickly.
For any two sets and , the Sorenson-Dice Index is defined as
On the other hand , where is recall and is precision. Now we need to prove that the previous formula could be reduced to Sorenson-Dice Index.
First we need to rewrite the equation in terms of sets. For any two sets and , we consider that is the set of relevant documents, and is the set of retrieved documents. That way we end up with the following equations: , and .
Which is the same as The Sorenson-Dice Index.