C#/.NET Lexer Generators

為{幸葍}努か 提交于 2019-12-18 12:31:11

问题


I'm looking for a decent lexical scanner generator for C#/.NET -- something that supports Unicode character categories, and generates somewhat readable & efficient code. Anyone know of one?


EDIT: I need support for Unicode categories, not just Unicode characters. There are currently 1421 characters in just the Lu (Letter, Uppercase) category alone, and I need to match many different categories very specifically, and would rather not hand-write the character sets necessary for it.

Also, actual code is a must -- this rules out things that generate a binary file that is then used with a driver (i.e. GOLD)


EDIT: ANTLR does not support Unicode categories yet. There is an open issue for it, though, so it might fit my needs someday.


回答1:


GPLEX seems to support your requirements.




回答2:


The two solutions that come to mind are ANTLR and Gold. ANTLR has a GUI based grammar designer, and an excellent sample project in C# can be found here.




回答3:


I agree with @David Robbins, ANTLR is probably your best bet. However, the generated ANTLR code does need a seperate runtime library in order to use the generated code because there are some string parsing and other library commonalities that the generated code relies on. ANTLR generates a lexer AND a parser.

On a side note: ANTLR is great...I wrote a 400+ line grammar to generate over 10k or C# code to efficiently parse a language. This included built in error checking for every possible thing that could go wrong in the parsing of the language. Try to do that by hand, and you'll never keep up with the bugs.




回答4:


I just found this

http://www.seclab.tuwien.ac.at/projects/cuplex/lex.htm

It says that it's configurable enough to support unicode ;-).

Herber



来源:https://stackoverflow.com/questions/172189/c-net-lexer-generators

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!