I am looking into the same concept, and found this promising article by Joel Pobar,
Create a Language Compiler for the .NET Framework - not sure where this has gone
Create a Language Compiler for the .NET Framework - pdf copy of the original doc
he discusses a high level concept of a compiler and proceeds to invent his own langauge for the .Net framework. Although its aimed at the .Net Framework, many of the concepts should be able to be reproduced. The Article covers:
- Langauge definition
- Scanner
- Parser (the bit im mainly interested in)
- Targeting the .Net Framework The
- Code Generator
there are other topics, but you get the just.
Its aimed to people starting out, written in C# (not quite Java)
HTH
bones