EVALITA 2007 The Named Entity Recognition Task Manuela
- Slides: 16
EVALITA 2007 The Named Entity Recognition Task Manuela Speranza, FBK-irst
Outline • Named Entity Recognition at EVALITA 2007 – Introduction to the task – Participants • Evaluation – Dataset – Metrics • Results – Ranking – Discussion • Conclusion EVALITA 2007 Workshop Rome, September 10, 2007
Introduction to the NER Task • Task: Recognize Named Entities in Italian newspaper articles • Four types of Named Entities: – – Geo-Political Entities (GPE): e. g. Italy Location Entities (LOC): e. g. Tevere Organization Entities (ORG): e. g. FIAT Person Entities (PER): e. g. Napolitano • Based on the ACE Entity Recognition and Normalization Task • Adaptations from ACE: – limit the task to the recognition of Named Entities – adapt it to Italian EVALITA 2007 Workshop Rome, September 10, 2007
Participants • In the NER Task we had six participants: – – – FBK-irst, Trento (FBKirst_Zanoli_NER) LDC, University of Pennsylvania (LDC_Walker_NER) University of Alicante (Uni. Ali_Kozareva_NER) University of Dortmund (Uni. Dort_Jungermann_NER) University of Duisburg-Essen (Uni. Du. E_Roessler_NER) Yahoo, Barcelona (Yahoo_Ciaramita_NER) • Only one Italian institution, while two from Spain and two from Germany • One participant from the USA EVALITA 2007 Workshop Rome, September 10, 2007
Evaluation Dataset: I-CAB (i) • 525 news stories from the Italian local newspaper “L’Adige” • 7 -8 September 2004 • 7 -8 October 2004 • 4 days • 5 categories • Two sections • • • News Stories Cultural News Economic News Sports News Local News • training (335 news stories) • test (190 news stories) Number of words = 182. 500 Average number of words per file = 348 EVALITA 2007 Workshop Rome, September 10, 2007
Evaluation Dataset: I-CAB (ii) Training # News stories Test Total 335 190 525 7, 227 4, 002 11, 229 # Words 113, 634 68, 930 182, 564 # Tokens 132, 587 79, 889 212, 476 # GPE 1, 740 1, 073 2, 813 # LOC 240 122 362 # ORG 2, 518 1, 140 3, 658 # PER 2, 936 1, 641 4, 577 # Sentences EVALITA 2007 Workshop Rome, September 10, 2007
Evaluation of Results • Scorer: CONLL Shared Task 2002 • Metrics: Precision (Pr. ), Recall (Re. ), and F-Measure (FB 1) • Official ranking is based on FB 1 EVALITA 2007 Workshop Rome, September 10, 2007
Official Ranking FB 1 Participant Over. FB 1 Over. Prec. Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55 Rank
Official Ranking FB 1 Participant Over. FB 1 Over. Pre. Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55 Rank
Discussion FB 1 Participant Over. FB 1 Over. Prec. Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55 Rank
Discussion Rank Participant Over. FB 1 Over. Prec. FB 1 Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55
Discussion FB 1 Participant Over. FB 1 Over. Prec. Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55 Rank
Discussion FB 1 Participant Over. FB 1 Over. Prec. Over. Rec. GPE LOC ORG PER 1 FBKirst_Zanoli_r 2 82. 14 83. 41% 80. 91% 85. 54 73. 04 64. 27 92. 12 2 FBKirst_Zanoli_r 1 81. 28 82. 97% 79. 65% 85. 52 73. 04 64. 06 90. 40 3 Uni. Du. E_Roessler_r 1 72. 27 71. 62% 72. 94% 78. 39 53. 92 49. 89 84. 42 4 Uni. Du. E_Roessler_r 2 71. 93 73. 28% 70. 62% 78. 75 54. 73 49. 01 83. 64 5 Yahoo_Ciaramita_r 1 68. 99 71. 28% 66. 85% 75. 38 52. 83 49. 08 78. 89 6 Yahoo_Ciaramita_r 2 68. 15 70. 44% 66. 00% 75. 08 52. 31 46. 85 78. 36 7 Uni. Dort_Jungermann_r 2 67. 90 70. 93% 65. 12% 73. 18 46. 07 45. 85 79. 78 8 Uni. Dort_Jungermann_r 1 67. 79 70. 93% 64. 91% 73. 18 46. 07 45. 74 79. 58 9 Uni. Ali_Kozareva 66. 59 62. 73% 70. 95% 72. 60 47. 26 47. 81 78. 66 10 LDC_Walker_r 1 63. 10 83. 05% 50. 88% 65. 25 52. 94 40. 70 75. 39 11 LDC_Walker_r 2 62. 70 82. 12% 50. 70% 65. 13 50. 56 36. 26 76. 44 - BASELINE 41. 11 42. 44% 39. 86% 69. 67 27. 63 40. 32 25. 48 - BASELINE -u 36. 85 40. 29% 33. 95% 57. 64 26. 32 39. 43 25. 55 Rank
Conclusions • Good interest from the community: – 14 initial registrations – 6 participants (though only one Italian Institution) • Relatively high rate of abandonment (8/14, 60%) • Good performance – best system at CONLL: 88. 8% for English, 72. 4% for German – best system at EVALITA: 82. 1% EVALITA 2007 Workshop Rome, September 10, 2007
Thanks to all who participated EVALITA 2007 Workshop Rome, September 10, 2007
References • • ACE. http: //www. nist. gov/speech/tests/ace/index. htm CONLL. http: //www. cnts. ua. ac. be/conll 2002/ner/ L’Adige. http: //www. ladige. it/ Linguistic Data Consortium (LDC). Automatic Content Extraction English Annotation Guidelines for Entities, version 5. 6. 1 2005. 23. http: //projects. ldc. upenn. edu/ ace/docs/English-Entities-Guidelines_v 5. 6. 1. pdf Magnini, Cappelli, Pianta, Speranza, Bartalesi Lenzi, Sprugnoli, Romano, Girardi, Negri. Annotazione di contenuti concettuali in un corpus italiano: I-CAB. In Proceedings of SILFI 2006, X Congresso Internazionale della Società di Linguistica e Filologia Italiana, Firenze 14 -17 giugno 2006. Magnini, Pianta, Speranza, Bartalesi Lenzi, Sprugnoli. Italian Content Annotation Bank (I-CAB): Named Entities, Technical report, ITC-irst, 2007. http: //evalita. itc. it/tasks/I-CAB-Report-Named-Entities. pdf ONTOTEXT. http: //ontotext. itc. it/ EVALITA 2007 Workshop Rome, September 10, 2007
- Named entity recognition in tweets an experimental study
- Public interest entity
- Cardinality ratio constraint
- Contoh strong entity dan weak entity
- Public interest entity
- Unsupervised models for named entity classification
- Tiered task bias task
- Manuela quintero xxx
- Mensch roboter interaktion
- Manuela caruso pediatra
- Manuela jukic
- Manuela hurtado pedraza
- Id de manuela
- Color 05112004
- Startup arena
- Manuela vecchi
- Casd manuela beltran