RTE-2 submitted runs and results
Notice: Proper scientific methodology requires that testing should be blind. Therefore, if you plan to further use the RTE-2 test set for evaluation, it is
advisable that you will not perform any analysis of this data set, including the
detailed information provided in these runs.
|
|
# |
First Author (Group) |
Run |
Accuracy |
Average Precision |
|
1 |
Adams (Dallas) |
0.6262 |
0.6282 |
|
|
2 |
Bos (Rome & Leeds) |
0.6162 |
0.6689 |
|
|
0.6062 |
0.6042 |
|||
|
3 |
Burchardt (Saarland) |
0.5900 |
|
|
|
0.5775 |
|
|||
|
4 |
Clarke (Sussex) |
0.5275 |
0.5254 |
|
|
0.5475 |
0.5260 |
|||
|
5 |
de Marneffe (Stanford) |
0.5763 |
0.6131 |
|
|
0.6050 |
0.5800 |
|||
|
6 |
Delmonte (Venice) |
run1* |
0.5563 |
0.5685 |
|
7 |
Ferr?ndez (Alicante) |
0.5563 |
0.6089 |
|
|
0.5475 |
0.5743 |
|||
|
8 |
Herrera (UNED) |
0.5975 |
0.5663 |
|
|
0.5887 |
|
|||
|
9 |
Hickl (LCC) |
0.7538 |
0.8082 |
|
|
10 |
Inkpen (Ottawa) |
0.5800 |
0.5751 |
|
|
0.5825 |
0.5816 |
|||
|
11 |
Katrenko (Amsterdam) |
0.5900 |
|
|
|
0.5713 |
|
|||
|
12 |
Kouylekov (ITC-irst & Trento) |
0.5725 |
0.5249 |
|
|
0.6050 |
0.5046 |
|||
|
13 |
Kozareva (Alicante) |
0.5487 |
0.5589 |
|
|
0.5500 |
0.5485 |
|||
|
14 |
Litkowski (CL Research) |
0.5813 |
|
|
|
0.5663 |
|
|||
|
15 |
Marsi (Tilburg & Twente) |
0.6050 |
|
|
|
16 |
Newman (Dublin) |
0.5250 |
0.5052 |
|
|
0.5437 |
0.5103 |
|||
|
17 |
Nicholson (Melbourne) |
0.5288 |
0.5464 |
|
|
0.5088 |
0.5053 |
|||
|
18 |
Nielsen (Colorado) |
run1* |
0.6025 |
0.6396 |
|
run2* |
0.6112 |
0.6379 |
||
|
19 |
Rus (Memphis) |
0.5900 |
0.6047 |
|
|
0.5837 |
0.5785 |
|||
|
20 |
Schilder (Thomson & Minnesota) |
0.5437 |
|
|
|
0.5550 |
|
|||
|
21 |
Tatu (LCC) |
0.7375 |
0.7133 |
|
|
22 |
Vanderwende (Microsoft Research & Stanford) |
0.6025 |
0.6181 |
|
|
0.5850 |
0.6170 |
|||
|
23 |
Zanzotto (Milan & Rome) |
0.6388 |
0.6441 |
|
|
0.6250 |
0.6317 |
* Resubmitted after publication of the official results. Resubmission was
allowed only in case of a bug fix, so that the
updated results are the correct output of the system described in the RTE-2
proceedings.