employer/tame - tame - Mike Gerwitz's Forge

employer

tame

Author	SHA1	Message	Date
Mike Gerwitz	3f8e397e57	tamer: obj::xmlo::reader: Parse preproc:sym/preproc:from Ideally this would just be an attribute, but I guess I never got around to making that change in the compiler and I don't want a detour right now. DEV-10863	2022-03-30 12:06:38 -04:00
Mike Gerwitz	1e278cbe26	tamer: obj::xmlo::reader: preproc:symtable/preproc:sym parsing This integrates much of the work done so far to parse into a `XmloEvent::SymDecl`. The attribute parsing _is_ verbose, and I do intend to abstract it away later on, but I'm going to wait on that for now. The new reader should be finishing up soon, which is really exciting, since I started working on this months ago (before having to take a break on TAMER); I'm anticipating strong performance gains in the reader, and this is a test that will tell us how the compiler will perform moving forward with the abstractions that I've spent so much time on. DEV-10863	2022-03-30 09:09:48 -04:00
Mike Gerwitz	f42288f3a2	tamer: obj::xmlo::reader: Begin symbol table parsing This wasn't the simplest thing to start with, but I wanted to explore something with a higher level of complexity. There is some boilerplate to observe here, including: 1. The state stitching (as I guess I'm calling it now) of SymtableState with XmloReaderState is all boilerplate and requires no lookahead, presenting an abstraction opportunity that I was holding off on previously (attr parsing for XIRF requires lookahead). 2. This is simply collecting attributes into a struct. This can be abstracted away in the future. 3. Creating stub parsers to verify that generics are stitched rather than being tightly coupled with another state is boilerplate that maybe can be abstracted away after a pattern is observed in future tests. DEV-10863	2022-03-29 11:14:47 -04:00
Mike Gerwitz	b4a7591357	tamer: obj::xmlo::reader: Begin conversion to ParseState This begins to transition XmloReader into a ParseState. Unlike previous changes where ParseStates were composed into a single ParseState, this is instead a lowering operation that will take the output of one Parser and provide it to another. The mess in ld::poc (...which still needs to be refactored and removed) shows the concept, which will be abstracted away. This won't actually get to the ASG in order to test that that this works with the wip-xmlo-xir-reader flag on (development hasn't gotten that far yet), but since it type-checks, it should conceptually work. Wiring lowering operations together is something that I've been dreading for months, but my approach of only abstracting after-the-fact has helped to guide a sane approach for this. For some definition of "sane". It's also worth noting that AsgBuilder will too become a ParseState implemented as another lowering operation, so: XIR -> XIRF -> XMLO -> ASG These steps will all be streaming, with iteration happening only at the topmost level. For this reason, it's important that ASG not be responsible for doing that pull, and further we should propagate Parsed::Incomplete rather than filtering it out and looping an indeterminate number of times outside of the toplevel. One final note: the choice of 64 for the maximum depth is entirely arbitrary and should be more than generous; it'll be finalized at some point in the future once I actually evaluate what maximum depth is reasonable based on how the system is used, with some added growing room. DEV-10863	2022-03-22 14:06:52 -04:00
Mike Gerwitz	14638a612f	tamer: {xir::=>}parse: Move parser out of XIR The parsing framework originally created for XIR is now more general and useful to other things. We'll see how this evolves. This needs additional documentation, but I'd like to see how it changes as I implement XmloReader and then some of the source readers first. DEV-10863	2022-03-18 16:24:53 -04:00
Mike Gerwitz	0360226caa	tamer: xir::parse: Generalize input token type This adds a `Token` type to `ParseState`. Everything uses `xir::Token` currently, but `XmloReader` will use `xir::flat::Object`. Now that this has been generalized beyond XIR, the parser ought to be hoisted up a level. DEV-10863	2022-03-18 15:26:05 -04:00
Mike Gerwitz	5af698d15c	tamer: xir::{tree::=>}parse: Move module It's a bit odd that I've done next to nothing with TAMER for the past week or so, and decided to do this one small thing before I go on break for the holidays, but I felt compelled to do _something_. Besides, this gets me in a better spot for the inevitable mental planning and writing I'll be doing over the holidays. This move was natural, given what this has evolved into---it has nothing to do with the concept of a "tree", and the modules imports emphasized that fact given the level of inappropriate nesting.	2021-12-23 13:17:18 -05:00
Mike Gerwitz	61f7a12975	tamer: xir::tree: Integrate AttrParserState into Stack Note that AttrParse{r=>}State needs renaming, and Stack will get a better name down the line too. This commit message is accurate, but confusing. This performs the long-awaited task of trying to observe, concretely, how to combine two automata. This has the effect of stitching together the state machines, such that the union of the two is equivalent to the original monolith. The next step will be to abstract this away. There are some important things to note here. First, this introduces a new "dead" state concept, where here a dead state is defined as an _accepting_ state that has no state transitions for the given input token. This is more strict than a dead state as defined in, for example, the Dragon Book, where backtracking may occur. The reason I chose for a Dead state to be accepting is simple: it represents a lookahead situation. It says, "I don't know what this token is, but I've done my job, so it may be useful in a parent context". The "I've done my job" part is only applicable in an accepting state. If the parser is _not_ in an accepting state, then an unknown token is simply an error; we should _not_ try to backtrack or anything of the sort, because we want only a single token of lookahead. The reason this was done is because it's otherwise difficult to compose the two parsers without requiring that AttrEnd exist in every XIR stream; this has always been an awkward delimiter that was introduced to make the parser LL(0), but I tried to compromise by saying that it was optional. Of course, I knew that decision caused awkward inconsistencies, I had just hoped that those inconsistencies wouldn't manifest in practical issues. Well, now it did, and the benefits of AttrEnd that we had in the previous construction do not exist in this one. Consequently, it makes more sense to simply go from LL(0) to LL(1), which makes AttrEnd unnecessary, and a future commit will remove it entirely. All of this information will be documented, but I want to get further in the implementation first to make sure I don't change course again and therefore waste my time on docs. DEV-11268	2021-12-16 09:44:02 -05:00
Mike Gerwitz	29fdf5428c	tamer: xir::tree: {Parse=>Stack}Error Prepare to adopt parse::ParseError, which will contain StackError. DEV-11268	2021-12-13 15:27:20 -05:00
Mike Gerwitz	ba4c32383f	tamer: obj::xmlo::reader: Parse root package node attributes Well, parse to the extent that it was being parsed before, anyway. The core of this change demonstrates how well TAMER's abstractions work well together. (As long as you have an e.g. LSP to help you make sense of all of the inference, I suppose.) Token::Open(QN_LV_PACKAGE \| QN_PACKAGE, _) => { return Ok(XmloEvent::Package( attr_parser_from(&mut self.reader) .try_collect_ok()??, )); } This finally makes use of `attr_parser_from` and `try_collect_ok`. All of the types are inferred---from the iterator transformations, to the error conversions, to the destination PackageAttrs type. DEV-10863	2021-11-18 00:59:10 -05:00
Mike Gerwitz	7367e20c01	tamer: obj::xmlo: Extract error types into own module	2021-11-16 15:47:52 -05:00

11 Commits (3f8e397e57acd72483a82216698fee2fc1601779)