|
|
| Line 4: |
Line 4: |
|
| |
|
| == Corpus == | | == Corpus == |
| *[http://www.unlweb.net.br/resources/geneva2012/corpus_eng.txt Reference corpus in English (500 sentences)], to be manually translated to the target languages, in order to be used as the input for IAN | | *[[Corpus500]] |
| *[http://www.unlweb.net.br/resources/geneva2012/corpus_unl.txt Reference corpus in UNL (500 graphs)], to be used as the input for EUGENE
| |
| *Reference corpus according to the complexity of the graphs (the same as above, but split in different files)
| |
| {| border="1" cellpadding="2" align=center
| |
| |+Corpus
| |
| !Order
| |
| !Description
| |
| !Analysis (English original)
| |
| !Generation (UNL)
| |
| !Word list (English original)
| |
| |-
| |
| |1
| |
| |Temporary entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/temp_org.txt temp_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/temp_unl.txt temp_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/temp_dic.txt temp_dic.txt]
| |
| |-
| |
| |2
| |
| |Entries with no attribute or relation
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute0_org.txt attribute0_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute0_unl.txt attribute0_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute0_dic.txt attribute0_dic.txt]
| |
| |-
| |
| |3
| |
| |one-attribute entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute1_org.txt attribute1_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute1_unl.txt attribute1_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute1_dic.txt attribute1_dic.txt]
| |
| |-
| |
| |4
| |
| |two-attribute entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute2_org.txt attribute2_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute2_unl.txt attribute2_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute2_dic.txt attribute2_dic.txt]
| |
| |-
| |
| |5
| |
| |three-attribute entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute3_org.txt attribute3_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute3_unl.txt attribute3_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/attribute3_dic.txt attribute3_dic.txt]
| |
| |-
| |
| |6
| |
| |one-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation1_org.txt relation1_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation1_unl.txt relation1_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation1_dic.txt relation1_dic.txt]
| |
| |-
| |
| |7
| |
| |two-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation2_org.txt relation2_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation2_unl.txt relation2_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation2_dic.txt relation2_dic.txt]
| |
| |-
| |
| |8
| |
| |three-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation3_org.txt relation3_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation3_unl.txt relation3_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation3_dic.txt relation3_dic.txt]
| |
| |-
| |
| |9
| |
| |four-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation4_org.txt relation4_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation4_unl.txt relation4_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation4_dic.txt relation4_dic.txt]
| |
| |-
| |
| |10
| |
| |five-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation5_org.txt relation5_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation5_unl.txt relation5_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation5_dic.txt relation5_dic.txt]
| |
| |-
| |
| |11
| |
| |six-relation entries
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation6_org.txt relation6_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation6_unl.txt relation6_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relation6_dic.txt relation6_dic.txt]
| |
| |-
| |
| |12
| |
| |numbers and numerals
| |
| |[http://www.unlweb.net.br/resources/geneva2012/numbers.txt numbers_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/numbers.txt numbers_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/numbers.txt numbers_dic.txt]
| |
| |-
| |
| |13
| |
| |expressions of time
| |
| |[http://www.unlweb.net.br/resources/geneva2012/time.txt time_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/time.txt time_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/time.txt time_dic.txt]
| |
| |-
| |
| |14
| |
| |relative clauses
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relatives.txt relatives_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relatives.txt relatives_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/relatives.txt relatives_dic.txt]
| |
| |-
| |
| |15
| |
| |special issues
| |
| |[http://www.unlweb.net.br/resources/geneva2012/problems.txt problems_org.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/problems.txt problems_unl.txt]
| |
| |[http://www.unlweb.net.br/resources/geneva2012/problems.txt problems_dic.txt]
| |
| |}
| |
|
| |
|
| == Deliverables == | | == Deliverables == |
Goals
- To build the basic modules of a NL-UNL (analysis) grammar
- To build the basic modules of a UNL-UNL (generation) grammar
Corpus
Deliverables
- DEADLINE
- 15 JUL 2012
- ANALYSIS (IAN)
- The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
- The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
- The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
- The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
- The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
- GENERATION (EUGENE)
- The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
- The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
- The generation disambiguation grammar, if any, used to generate the reference corpus onto natural language (gen_dis_LID.txt)
- The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
- LID is to be replaced by the ISO639-2 two-character code of the language (en = English, el = Greek, etc.). Accordingly, the Greek corpus is to be provided as corpus_el.txt; the Greek analysis dictionary, as ana_dic_el.txt; and so on. All the files must be shared with "martins" in EUGENE and IAN.
Presentations
English examples and templates
- ANALYSIS (IAN)
- GENERATION (EUGENE)
- English generation dictionary (to be provided soon)
- English generation grammar (to be provided soon)
- English disambiguation grammar (for generation) (to be provided soon)
Participants
- Carolin ARNOLD (German)
- Ewa CZAJKOWSKA (Polish)
- Grega MILHARCIC (Slovenian)
- Luisa GOUVEIA (Portuguese)
- Martin LUTS (Estonian)
- Mihaela ILIOAIA (Romanian)
- Ofelia HOVHANNISYAN (Armenian)
- Olga VARTZIOTI (Greek)
- Polina LENKOVA (Russian)
- Ronaldo MARTINS (UNL)
- Sameh ALANSARY (Arabic)
- Sara STYMNE (Swedish)
- Yordanka STANCHEVA (Bulgarian)
Schedule
- Feb 06th, 2012 - Monday
- 09:00-10:00 Introduction
- 10:00-12:00 I – Corpus
- 14:00-17:00 II – UNL-NL dictionary
- Feb 07th, 2012 - Tuesday
- 09:00-12:00 III – Morphology (inflectional paradigms)
- 14:00-17:00 IV – NL dictionary
- Feb 08th, 2012- Wednesday
- 09:00-12:00 V – UNL-NL grammar (I)
- 14:00-17:00 V – UNL-NL grammar (II)
- Feb 09th, 2012 - Thursday
- 09:00-12:00 VI – NL-UNL grammar (I)
- 14:00-17:00 VI – NL-UNL grammar (II)
- Feb 10th, 2012 - Friday
- 09:00-12:00 Evaluation
- 14:00-17:00 Discussion