If there are multiple matches above the threshold, they are returned all, sorted by score, up to the results count limit (candidates count). It is up to the application to decide what to do with multiple users matching with score above threshold. If you have too many false positives, it might help to raise the similarity threshold.