I want to parse few date format using following ANTLR4 grammar.
grammar Variables;
//varTable : tableNameFormat dateFormat? ;
//tableNameFormat: (ID SEPERATO
For such a task regex is much better solution. But if you have it as a study project, here it is...
It is important to realize that order of lexer rules is crucial. Input will be tested by these rules and the first applicable will be used. The rules should be written from the most specific to avoid conflicts. For example, if you have grammar with variable names and some keywords, keywords should be first otherwise they will be marked as variables.
There are many ways you can solve this, but the best would be one lexer rule named DATE : NUM NUM NUM NUM '-' NUM NUM '-' NUM NUM; Month and Day rules as you have them wont work, as they are ambigous. How can lexer tell if two numbers input is month or day?
In my case it works. I am getting a correct parsetree with the input: 2016-01-01
grammar date;
dateFormat : year UNDERSCORE month UNDERSCORE today
| year
;
year : DIGIT DIGIT DIGIT DIGIT
;
month : DIGIT DIGIT
;
today : DIGIT DIGIT
;
UNDERSCORE: ('_' | '-' );
DIGIT : [0-9] ;
But I would use for month
something like (0 [1-9] | 1 [0-2])
because there are only 12 months.
I never worked on Antlr before, but when I looked in GitHub if someone already did which I want. Found this library.
here is a library to parse the date from String.
https://github.com/masasdani/nangka
add this project as a dependency of your project
<dependency>
<groupId>com.masasdani</groupId>
<artifactId>nangka</artifactId>
<version>0.0.6</version>
</dependency>
Sample usage :
String exprEn = "a month later, 20-11-90";
Nangka nangka = new Nangka();
DateUnit dateUnit = nangka.parse(exprEn);
for(Date date : dateUnit.getRelatedDates()){
System.out.println(date);
}
Hope this helps someone like me who is searching.