I have the following code written in ANTLRWorks 1.4
grammar hmm;
s : (put_a_in_b)|(put_out_a)|(drop_kick)|(drop_a)|(put_on_a);
put_a_in_b : (PUT_SYN)(ID)(IN_SYN)(ID);
put_out_a : (PUT2_SYN)(OUT_SYN)(ID) | (E1)(ID);
drop_kick : ('drop')('kick')(ID);
drop_a : (DROP_SYN)(ID);
put_on_a : (E2)(ID);
PUT_SYN : 'put' | 'place' | 'drop';
PUT2_SYN : 'put' | 'douse';
IN_SYN : 'in' | 'into' | 'inside' | 'within';
OUT_SYN : 'out';
E1 : 'extinguish'|'douse';
DROP_SYN : 'drop' | 'throw' | 'relinquish';
WS : ( ' ' | '\t' | '\r' | '\n' ) {$channel=HIDDEN;};
ID : ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|'0'..'9'|'_')*;
E2 : 'put on'|'don'|'wear';
COMMENT
: '//' ~('\n'|'\r')* '\r'? '\n' {$channel=HIDDEN;}
| '/*' ( options {greedy=false;} : . )* '*/' {$channel=HIDDEN;}
;
When I run it with the input:
drop object
I get a MismatchedTokenException(5 != 15).
And with the input :
put o1 in o2
I get a NoViableAltException.
Though it runs fine with
place o2 in o2
I'm new to this, but it seems like there's ambiguities? Or maybe my usage of ANTLR is incorrect?
You've put 'drop'
and 'put'
in two different lexer-rules:
PUT_SYN : 'put' | 'place' | 'drop'; // drop & put
PUT2_SYN : 'put' | 'douse'; // put
...
DROP_SYN : 'drop' | 'throw' | 'relinquish'; // drop
When put
is encountered by the lexer, PUT_SYN
will always be the rule that matches it, so 'put'
could (or should) be removed from the PUT2_SYN
rule.
So, your problem with parsing the string drop object
: the parser will try to match drop_a : (DROP_SYN)(ID);
but the "drop"
will be matched in the lexer rule PUT_SYN
.
EDIT
Those synonym-lists can be better made into parser rules (instead of lexer-rules). Here's a small demo:
grammar TextAdventure;
parse
: command (EndCommand command)* EOF
;
command
: put_syn_1 OtherWord in_syn OtherWord
| put_syn_2 out_syn_1 OtherWord
| out_syn_2 OtherWord
| Drop Kick OtherWord
| drop_syn OtherWord
;
drop_syn
: Drop
| Throw
| Relinquish
;
in_syn
: In
| Into
| Inside
| Within
;
put_syn_1
: Put
| Place
| Drop
;
put_syn_2
: Put
| Douse
;
out_syn_1
: Out
;
out_syn_2
: Extinguish
| Douse
;
Space : (' ' | '\t' | '\r' | '\n'){$channel=HIDDEN;};
EndCommand : ';';
Put : 'put';
Place : 'place';
Drop : 'drop';
Douse : 'douse';
In : 'in';
Into : 'into';
Inside : 'inside';
Within : 'within';
Out : 'out';
Extinguish : 'extinguish';
Throw : 'throw';
Relinquish : 'relinquish';
Kick : 'kick';
OtherWord : ('a'..'z' | 'A'..'Z')+;
When interpreting the following source:
drop object ; put yourself in myshoes ; place it in avase
you'll see ANTLRWorks generate the following parse-tree:
来源:https://stackoverflow.com/questions/3799233/what-is-wrong-with-this-grammar-antlrworks-1-4