Grammar Specs: Difference between revisions
From UNLwiki
Jump to navigationJump to search
imported>Martins No edit summary |
imported>Martins No edit summary |
||
| Line 1: | Line 1: | ||
# | |||
== Basic symbols == | |||
{| border="1" cellpadding="2" align=center | |||
|+Basic symbols used in UNL grammar rules | |||
!Symbol | |||
!Definition | |||
!Example | |||
|- | |||
|align=center|<nowiki>^</nowiki> | |||
|not | |||
|^a = not a | |||
|- | |||
|align=center|{ | } | |||
|or | |||
|<nowiki>{a|b}</nowiki> = a or b | |||
|- | |||
|align=center|% | |||
|index for nodes, attributes and values | |||
|%x (see [[#Indexes|below]]) | |||
|- | |||
|align=center|# | |||
|index for sub-NLWs | |||
|#01 (see [[#Indexes|below]]) | |||
|- | |||
|align=center|= | |||
|attribute-value assignment | |||
|POS=NOU | |||
|- | |||
|align=center|! | |||
|rule trigger | |||
|!PLR | |||
|- | |||
|align=center|& | |||
|merge operator | |||
|%x&%y | |||
|- | |||
|align=center|? | |||
|dictionary lookup operator | |||
|?[a] | |||
|- | |||
|align=center|“ “ | |||
|string | |||
|"went" | |||
|- | |||
|align=center|[ ] | |||
|natural language entry (headword) | |||
|[go] | |||
|- | |||
|align=center|[[ ]] | |||
|UW | |||
|[[to go(icl>to move)]] | |||
|- | |||
|align=center|( ) | |||
|node | |||
|(a) | |||
|- | |||
|align=center|// | |||
|regular expression | |||
|/a{2,3}/ = aa,aaa | |||
|} | |||
== Basic Concepts == | |||
;The differences between "", [] and [[]] | |||
:Double quotes are always used to represent strings: "a" will match only the string "a" | |||
:Simple square brackets are always used to represent natural language entries (headwords) in the dictionary: [a] will match the node associated to the entry [a] retrieved from the dictionary, no matter its current realization, which may be affected by other rules (the original [a] may have been replaced, for instance, by "b", but will still be indexed to the entry [a]) | |||
:Double square brackets are always used to represent UWs: <nowiki>[[a]]</nowiki> will match the node associated to the UW <nowiki>[[a]]</nowiki> | |||
;Predefined values (assigned by default) | |||
:SCOPE - Scope | |||
:SHEAD - Sentence head (the beginning of a sentence) | |||
:STAIL - Sentence tail (the end of a sentence) | |||
:CHEAD - Scope head (the beginning of a scope) | |||
:CTAIL - Scope tail (the end of a scope) | |||
:TEMP - Temporary entry (entry not found in the dictionary) | |||
:DIGIT - Any sequence of digits (i.e.: 0,1,2,3,4,5,6,7,8,9) | |||
Revision as of 18:29, 16 August 2013
Basic symbols
| Symbol | Definition | Example |
|---|---|---|
| ^ | not | ^a = not a |
| { | } | or | {a|b} = a or b |
| % | index for nodes, attributes and values | %x (see below) |
| # | index for sub-NLWs | #01 (see below) |
| = | attribute-value assignment | POS=NOU |
| ! | rule trigger | !PLR |
| & | merge operator | %x&%y |
| ? | dictionary lookup operator | ?[a] |
| “ “ | string | "went" |
| [ ] | natural language entry (headword) | [go] |
| [[ ]] | UW | [[to go(icl>to move)]] |
| ( ) | node | (a) |
| // | regular expression | /a{2,3}/ = aa,aaa |
Basic Concepts
- The differences between "", [] and [[]]
- Double quotes are always used to represent strings: "a" will match only the string "a"
- Simple square brackets are always used to represent natural language entries (headwords) in the dictionary: [a] will match the node associated to the entry [a] retrieved from the dictionary, no matter its current realization, which may be affected by other rules (the original [a] may have been replaced, for instance, by "b", but will still be indexed to the entry [a])
- Double square brackets are always used to represent UWs: [[a]] will match the node associated to the UW [[a]]
- Predefined values (assigned by default)
- SCOPE - Scope
- SHEAD - Sentence head (the beginning of a sentence)
- STAIL - Sentence tail (the end of a sentence)
- CHEAD - Scope head (the beginning of a scope)
- CTAIL - Scope tail (the end of a scope)
- TEMP - Temporary entry (entry not found in the dictionary)
- DIGIT - Any sequence of digits (i.e.: 0,1,2,3,4,5,6,7,8,9)