employer/tame - tame - Mike Gerwitz's Forge

employer

tame

Author	SHA1	Message	Date
Mike Gerwitz	343f5b34b3	tamer: asg::air: Template support for dangling expressions The intent was to have a very simple implementation of `hold_dangling` and have everything work. But, I had a nasty surprise when the system tests caught bug caused by some interesting depth interactions as it relates to `xmli` and auto-closing. I added an extra test/example in `asg::graph::visit::test` to illustrate the situation; it was difficult to derive from the traces, but trivially obvious once I wrote it out as an example. With that, templates can now aggregate tokens for dangling expressions. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	5c60c5fd15	tamer: asg::air::tpl: Parse template body expressions And finally we have tokens aggregated onto the ASG in the context of a template. I expected to arrive here much more quickly, but there was a lot of necessary refactoring. There's a lot more that could be done, but I need to continue; I had wanted this done a week ago. It is worth noting, though, that this finally achieves something I had been wondering about since the inception of this project---how I'd represent templates on the graph. I think this worked out rather nicely. It wasn't even until a few months ago that I decided to use AIR instead of NIR for that purpose (NIR wouldn't have worked). And note how I didn't have to touch the program derivation at all---the system test just works with the AIR change, because of the consistent construction of the graph. Beautiful. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	431df6cecb	tamer: asg::air::expr: Dead states for AirBind This hoists the errors back into `AirAggregate`; I need dead states for the `AirTplAggregate` parser so that it will know when to (and not to) interpret tokens in the context of the template itself. In a previous commit message, I had pondered whether it may be possible to eliminate the dead state transition, and yet here I've used it with both of the sub-parsers now. So it seems like the better option in the future may be to narrow the type further---to say precisely _what_ types of tokens may yield a dead state transition; otherwise you lose the match information from the parser that yielded it. A stubbornly persistent problem in Rust, this magical and hidden match knowledge. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	1770949b9a	tamer: asg::air::expr: Move Dangling expression handling into RootStrategy And with this, hopefully we are now finally prepared for dangling expressions in templates. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	231296d003	tamer: asg::air::expr: Introduce RootStrategy This sets us up to be able to determine how `Dangling` expressions will be rooted into templates. This new strategy isn't yet handling `Dangling`; I wanted to get this committed first so that the `Dangling` refactoring is more clear. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	fc1d55c4c5	tamer: asg::air::expr: Generic target ObjectKind Expressions were previously tied to packages. This prepares for using a `Tpl` as a container for expressions. This does not yet handle the situation of auto-rooting dangling expressions within the container. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	8cb781ccca	tamer: asg::air::expr::ExprStack: {SPair=>ObjectIndex} reachable evidence This result in less useful debug output, but it'll be needed for using a (possibly-anonymous) template as evidence. This evidence is simply for debugging, and to require some sort of value during development to help obviate when maybe something is being done incorrectly (if no obvious value exists). DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	c1d04f1cf4	tamer: asg::air: Extract template parsing into `tpl` Same as the previous commit. These commits have significantly reduced the cognitive burden of working on this subsystem. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	4fd8e9ea40	tamer: asg::air: Extract expression parsing into `expr` This is more of the same refactoring that has been happening. This extraction also helps emphasize the relationship between imported objects, and isolates the growing number of test cases. This parser will only grow. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	f307f2d70b	tamer: asg::air: Extract template parsing into own parser Just as was done with the expression parser, which this will utilize. This initializes it, but doesn't yet make use of it (`AirExprAggregate`). Refactoring was definitely needed; decomposing this is quite a bit of work, in no small part because of the complexity. This helps significantly. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	d99a8efbaf	tamer: asg::air::ir: {ExprRef=>RefIdent} This generalizes the IR, and relates the duals: identifying and referencing. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	e2714ce73f	tamer: asg::air::ir::sum_ir: impl Token for IR sum type This is necessary for the commit that follows. Maybe it wasn't worth separating this into its own commit. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	b6d0569b99	tamer: asg::air: Expression parser This delegates expression parsing to `AirExprAggregate`, in an effort to both begin to simplify the understanding and maintenance of `AirAggregate`; and allow for parser composition for template parsing. This utilizes the prior changes for token sum types to precisely define the subset of AIR tokens supported by the expression parser. This differs from prior approaches which delegated until a dead state, relying on runtime information to determine if a parser has finished. This allows us to determine that statically. I do want to be able to eliminate the dead state from the parser so we can get rid of the `unreachable!`, but I need to move on; that's something I had tried to do in the past too, which ended up adding a bit of complexity, and I'll have to consider my options in the future, including whether the dead state transition can be entirely eliminated in favor of the combination of these sum types and recovery; the parsing framework decisions were made while recovery was still an open question, at least in practice. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	dfeef4ec25	tamer: asg::air::ir::sum_ir: Support arbitrary sum types See the provided documentation. This allows for precisely defining sum types over all tokens accepted by parsers; see a following commit. DEV-13708	2023-03-10 14:27:59 -05:00
Mike Gerwitz	34b64fd619	tamer: asg::air: AIR as a sum IR This introduces a new macro `sum_ir!` to help with a long-standing problem of not being able to easily narrow types in Rust without a whole lot of boilerplate. This patch includes a bit of documentation, so see that for more information. This was not a welcome change---I jumped down this rabbit hole trying to decompose `AirAggregate` so that I can share portions of parsing with the current parser and a template parser. I can now proceed with that. This is not the only implementation that I had tried. I previously inverted the approach, as I've been doing manually for some time: manually create types to hold the sets of variants, and then create a sum type to hold those types. That works, but it resulted in a mess for systems that have to use the IR, since now you have two enums to contend with. I didn't find that to be appropriate, because we shouldn't complicate the external API for implementation details. The enum for IRs is supposed to be like a bytecode---a list of operations that can be performed with the IR. They can be grouped if it makes sense for a public API, but in my case, I only wanted subsets for the sake of delegating responsibilities to smaller subsystems, while retaining the context that `match` provides via its exhaustiveness checking but does not expose as something concrete (which is deeply frustrating!). Anyway, here we are; this'll be refined over time, hopefully, and portions of it can be generalized for removing boilerplate from other IRs. Another thing to note is that this syntax is really a compromise---I had to move on, and I was spending too much time trying to get creative with `macro_rules!`. It isn't the best, and it doesn't seem very Rust-like in some places and is therefore not necessarily all that intuitive. This can be refined further in the future. But the end result, all things considered, isn't too bad. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	d42a46d2b8	tamer: NIR->xmli template definition setup This sets the stage for template parsing, and finally decides how we're going to represent templates on the ASG. This is going to start simple, since my original plans for improving how templates are handled (conceptually) is going to have to wait. This is the last difficult object type to figure out, with respect to graph representation and derivation, so I wanted to get it out of the way. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	08278bc867	tamer: asg::air::Air::{ExprIdent=>BindIdent}: Rename I wasn't initially sure whether I'd want separate tokens for different types of identifying operations, but now that I see that it is clear from the current state of the parser, there's no need. This matches the name of the token in NIR. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	4afc8c22e6	tamer: asg::air: Merge Pkg closing span The `Pkg` span will now properly reflect the entire definition of the package including the opening and closing tags. This was found while I was working on a graph traversal. DEV-13597	2023-03-10 14:27:57 -05:00
Mike Gerwitz	39e98210be	tamer: asg::graph::object::ident::ObjectIndex::<Ident>::bind_definition: Replace ident span I noticed this while working on a graph traversal. The unit test used the same span for both the reference _and_ the binding, so I didn't notice. -_- The problem with this, though, is that we do not have a separate span representing the source location of the identifier reference. The reason is that we decided to re-use an existing node rather than creating another one, which would add another inconvenient layer of indirection (and complexity). So, I may have to add (optional?) spans to edges. DEV-13708	2023-03-10 14:27:57 -05:00
Mike Gerwitz	89700aa949	tamer: asg::graph::object::ObjectRel::is_cross_edge: New trait method This introduces the concept of ontological cross edges. The term "cross edge" is most often seen in the context of graph traversals, e.g. the trees formed by a depth-first search. This, however, refers to the trees that are inherent in the ontology of the graph. For example, an `ExprRef` will produce a cross edge to the referenced `Ident`, that that is a different tree than the current expression. (Well, I suppose technically it _could_ be a back edge, but then that'd be a cycle which would fail the process once we get to preventing it. So let's ignore that for now.) DEV-13708	2023-03-10 14:27:57 -05:00
Mike Gerwitz	2d3b27ac01	tamer: asg: Root package definition This causes a package definition to be rooted (so that it can be easily accessed for a graph walk). This keeps consistent with the new `ObjectIndex`-based API by introducing a unit `Root` `ObjectKind` and the boilerplate that goes with it. This boilerplate, now glaringly obvious, will be refactored at some point, since its repetition is onerous and distracting. DEV-13159	2023-02-01 10:34:17 -05:00
Mike Gerwitz	f753a23bad	tamer: asg: Introduce edge from Package to Ident Included in this diff are the corresponding changes to the graph to support the change. Adding the edge was easy, but we also need a way to get the package for an identifier. The easiest way to do that is to modify the edge weight to include not just the target node type, but also the source. DEV-13159	2023-02-01 10:34:17 -05:00
Mike Gerwitz	39d093525c	tamer: nir, asg: Introduce package to ASG This does not yet create edges from identifiers to the package; just getting this introduced was quite a bit of work, so I want to get this committed. Note that this also includes a change to NIR so that `Close` contains the entity so that we can pattern-match for AIR transformations rather than retaining yet another stack with checks that are already going to be done by AIR. This makes NIR stand less on its own from a self-validation point, but that's okay, given that it's the language that the user entered and, conceptually, they could enter invalid NIR the same as they enter invalid XML (e.g. from a REPL). In _practice_, of course, NIR is lowered from XML and the schema is enforced during that lowering and so the validation does exist as part of that parsing. These concessions speak more to the verbosity of the language (Rust) than anything. DEV-13159	2023-02-01 10:34:16 -05:00
Mike Gerwitz	39ebb74583	tamer: asg: Expression identifier references This adds support for identifier references, adding `Ident` as a valid edge type for `Expr`. There is nothing in the system yet to enforce ontology through levels of indirection; that will come later on. I'm testing these changes with a very minimal NIR parse, which I'll commit shortly. DEV-13597	2023-01-26 14:45:17 -05:00
Mike Gerwitz	8735c2fca3	tamer: asg::graph: Static- and runtime-enforced multi-kind edge ontolgoy This allows for edges to be multiple types, and gives us two important benefits: (a) Compiler-verified correctness to ensure that we don't generate graphs that do not adhere to the ontology; and (b) Runtime verification of types, so that bugs are still memory safe. There is a lot more information in the documentation within the patch. This took a lot of iterating to get something that was tolerable. There's quite a bit of boilerplate here, and maybe that'll be abstracted away better in the future as the graph grows. In particular, it was challenging to determine how I wanted to actually go about narrowing and looking up edges. Initially I had hoped to represent the subsets as `ObjectKind`s as well so that you could use them anywhere `ObjectKind` was expected, but that proved to be far too difficult because I cannot return a reference to a subset of `Object` (the value would be owned on generation). And while in a language like C maybe I'd pad structures and cast between them safely, since they _do_ overlap, I can't confidently do that here since Rust's discriminant and layout are not under my control. I tried playing around with `std::mem::Discriminant` as well, but `discriminant` (the function) requires a _value_, meaning I couldn't get the discriminant of a static `Object` variant without some dummy value; wasn't worth it over `ObjectRelTy.` We further can't assign values to enum variants unless they hold no data. Rust a decade from now may be different and will be interesting to look back on this struggle. DEV-13597	2023-01-26 14:45:14 -05:00
Mike Gerwitz	ee30600f67	tamer: asg::air::Air: {Expr=>Expr} Makes grouping and code completion easier when they're prefixed. DEV-13597	2023-01-23 11:48:28 -05:00
Mike Gerwitz	954b5a2795	Copyright year and name update Ryan Specialty Group (RSG) rebranded to Ryan Specialty after its IPO.	2023-01-20 23:37:30 -05:00
Mike Gerwitz	1be0f2fe70	tamer: asg::object: Move into graph module The ASG delegates certain operations to Objects so that they may enforce their own invariants and ontology. It is therefore important that only objects have access to certain methods on `Asg`, otherwise those invariants could be circumvented. It should be noted that the nesting of this module is such that AIR should _not_ have privileged access to the ASG---it too must utilize objects to ensure those invariants are enforced in a single place. DEV-13597	2023-01-20 23:37:30 -05:00
Mike Gerwitz	4e3a81d7f5	tamer: asg: Bind transparent ident This provides the initial implementation allowing an identifier to be defined (bound to an object and made transparent). I'm not yet entirely sure whether I'll stick with the "transparent" and "opaque" terminology when there's also "declare" and "define", but a `Missing` state is a type of declaration and so the distinction does still seem to be important. There is still work to be done on `ObjectIndex::<Ident>::bind_definition`, which will follow. I'm going to be balancing work to provide type-level guarantees, since I don't have the time to go as far as I'd like. DEV-13597	2023-01-20 23:37:29 -05:00
Mike Gerwitz	378fe3db66	tamer: asg::Asg::lookup: SymbolId=>SPair This seems to have been an oversight from when I recently introduced SPairs to ASG; I noticed it while working on another change and receiving back a `DUMMY_SPAN`. DEV-13597	2023-01-20 23:37:29 -05:00
Mike Gerwitz	f1cf35f499	tamer: asg: Add expression edges This introduces a number of abstractions, whose concepts are not fully documented yet since I want to see how it evolves in practice first. This introduces the concept of edge ontology (similar to a schema) using the type system. Even though we are not able to determine what the graph will look like statically---since that's determined by data fed to us at runtime---we _can_ ensure that the code _producing_ the graph from those data will produce a graph that adheres to its ontology. Because of the typed `ObjectIndex`, we're also able to implement operations that are specific to the type of object that we're operating on. Though, since the type is not (yet?) stored on the edge itself, it is possible to walk the graph without looking at node weights (the `ObjectContainer`) and therefore avoid panics for invalid type assumptions, which is bad, but I don't think that'll happen in practice, since we'll want to be resolving nodes at some point. But I'll addres that more in the future. Another thing to note is that walking edges is only done in tests right now, and so there's no filtering or anything; once there are nodes (if there are nodes) that allow for different outgoing edge types, we'll almost certainly want filtering as well, rather than panicing. We'll also want to be able to query for any object type, but filter only to what's permitted by the ontology. DEV-13160	2023-01-20 23:37:29 -05:00
Mike Gerwitz	8786ee74fa	tamer: asg::air: Expression building error cases This addresses the two outstanding `todo!` match arms representing errors in lowering expressions into the graph. As noted in the comments, these errors are unlikely to be hit when using TAME in the traditional way, since e.g. XIR and NIR are going to catch the equivalent problems within their own contexts (unbalanced tags and a valid expression grammar respectively). _But_, the IR does need to stand on its own, and I further hope that some tooling maybe can interact more directly with AIR in the future. DEV-13160	2023-01-20 23:37:29 -05:00
Mike Gerwitz	40c941d348	tamer: asg::air::AirAggregate: Initial impl of nested exprs This introduces a number of concepts together, again to demonstrate that they were derived. This introduces support for nested expressions, extending the previous work. It also supports error recovery for dangling expressions. The parser states are a mess; there is a lot of duplicate code here that needs refactoring, but I wanted to commit this first at a known-good state so that the diff will demonstrate the need for the change that will follow; the opportunities for abstraction are plainly visible. The immutable stack introduced here could be generalized, if needed, in the future. Another important note is that Rust optimizes away the `memcpy`s for the stack that was introduced here. The initial Parser Context was introduced because of `ArrayVec` inhibiting that elision, but Vec never had that problem. In the future, I may choose to go back and remove ArrayVec, but I had wanted to keep memory allocation out of the picture as much as possible to make the disassembly and call graph easier to reason about and to have confidence that optimizations were being performed as intended. With that said---it _should_ be eliding in tamec, since we're not doing anything meaningful yet with the graph. It does also elide in tameld, but it's possible that Rust recognizes that those code paths are never taken because tameld does nothing with expressions. So I'll have to monitor this as I progress and adjust accordingly; it's possible a future commit will call BS on everything I just said. Of course, the counter-point to that is that Rust is optimizing them away anyway, but Vec _does_ still require allocation; I was hoping to keep such allocation at the fringes. But another counter-point is that it _still_ is allocated at the fringe, when the context is initialized for the parser as part of the lowering pipeline. But I didn't know how that would all come together back then. ...alright, enough rambling. DEV-13160	2023-01-20 23:37:29 -05:00
Mike Gerwitz	0863536149	tamer: asg::Asg::get: Narrow object type This uses `ObjectIndex` to automatically narrow the type to what is expected. Given that `ObjectIndex` is supposed to mean that there must be an object with that index, perhaps the next step is to remove the `Option` from `get` as well. DEV-13160	2022-12-22 16:32:21 -05:00
Mike Gerwitz	646633883f	tamer: Initial concept for AIR/ASG Expr This begins to place expressions on the graph---something that I've been thinking about for a couple of years now, so it's interesting to finally be doing it. This is going to evolve; I want to get some things committed so that it's clear how I'm moving forward. The ASG makes things a bit awkward for a number of reasons: 1. I'm dealing with older code where I had a different model of doing things; 2. It's mutable, rather than the mostly-functional lowering pipeline; 3. We're dealing with an aggregate ever-evolving blob of data (the graph) rather than a stream of tokens; and 4. We don't have as many type guarantees. I've shown with the lowering pipeline that I'm able to take a mutable reference and convert it into something that's both functional and performant, where I remove it from its container (an `Option`), create a new version of it, and place it back. Rust is able to optimize away the memcpys and such and just directly manipulate the underlying value, which is often a register with all of the inlining. _But_ this is a different scenario now. The lowering pipeline has a narrow context. The graph has to keep hitting memory. So we'll see how this goes. But it's most important to get this working and measure how it performs; I'm not trying to prematurely optimize. My attempts right now are for the way that I wish to develop. Speaking to #4 above, it also sucks that I'm not able to type the relationships between nodes on the graph. Rather, it's not that I _can't_, but a project to created a typed graph library is beyond the scope of this work and would take far too much time. I'll leave that to a personal, non-work project. Instead, I'm going to have to narrow the type any time the graph is accessed. And while that sucks, I'm going to do my best to encapsulate those details to make it as seamless as possible API-wise. The performance hit of performing the narrowing I'm hoping will be very small relative to all the business logic going on (a single cache miss is bound to be far more expensive than many narrowings which are just integer comparisons and branching)...but we'll see. Introducing branching sucks, but branch prediction is pretty damn good in modern CPUs. DEV-13160	2022-12-22 14:33:28 -05:00
Mike Gerwitz	0b2e563cdb	tamer: asg: Associate spans with identifiers and introduce diagnostics This ASG implementation is a refactored form of original code from the proof-of-concept linker, which was well before the span and diagnostic implementations, and well before I knew for certain how I was going to solve that problem. This was quite the pain in the ass, but introduces spans to the AIR tokens and graph so that we always have useful diagnostic information. With that said, there are some important things to note: 1. Linker spans will originate from the `xmlo` files until we persist spans to those object files during `tamec`'s compilation. But it's better than nothing. 2. Some additional refactoring is still needed for consistency, e.g. use of `SPair`. 3. This is just a preliminary introduction. More refactoring will come as tamec is continued. DEV-13041	2022-12-16 14:44:38 -05:00
Mike Gerwitz	56d1ecf0a3	tamer: Air{Token=>} Consistency with `Nir` et al. DEV-13430	2022-12-13 14:36:38 -05:00
Mike Gerwitz	d55b3add77	tamer: asg::air::test: Extract into own file Just minor preparatory work. DEV-13160	2022-12-13 13:57:04 -05:00

38 Commits (2233c69bbf45eb36fa561866102b64bb3e03d638)