Hii, how do we inference the F1 score which has been provided in the paper for nested NER task and why is the results on the below average side as below when i run as per the steps mentioned in the github for evaluation.
--- NER ---
type precision recall f1-score support
ORG 0.00 0.00 0.00 508
PER 20.42 2.28 4.11 1270
WEA 0.00 0.00 0.00 41
VEH 0.00 0.00 0.00 12
GPE 4.00 0.19 0.35 540
FAC 0.00 0.00 0.00 67
LOC 0.00 0.00 0.00 76
micro 17.96 1.19 2.24 2514
macro 3.49 0.35 0.64 2514
Hii, how do we inference the F1 score which has been provided in the paper for nested NER task and why is the results on the below average side as below when i run as per the steps mentioned in the github for evaluation.
--- NER ---
type precision recall f1-score support
ORG 0.00 0.00 0.00 508
PER 20.42 2.28 4.11 1270
WEA 0.00 0.00 0.00 41
VEH 0.00 0.00 0.00 12
GPE 4.00 0.19 0.35 540
FAC 0.00 0.00 0.00 67
LOC 0.00 0.00 0.00 76
micro 17.96 1.19 2.24 2514
macro 3.49 0.35 0.64 2514