employer/tame - tame - Mike Gerwitz's Forge

employer

tame

Author	SHA1	Message	Date
Mike Gerwitz	ba38a3c1ba	tamer: src::asg::air: Pool identifiers into global environment This, finally, introduces identifier pooling in the global environment, represented by `Root`. All package-level identifiers will be scoped as such, which at the moment means anything that's not within a template. As mentioned in recent commits, this does require additional cleanup to finalize, and some more test will make additional rationale more clear. It's also worth noting the intent of storing the `ObjectIndex<Root>`---not only does it mean that the active root can be derived solely from the current parsing state, but it also means that in the future we can contribute to any, potentially multiple, roots. I had previously used Neo4J to effectively diff two dependency graphs between versions in the current XSLT-based TAMER; I'd like to be able to do that with TAMER in the future, which is an important concept when considering automated data migration, as well as querying for the effects of changes. More to come. I'm hoping this is finally nearing a conclusion and I can finally tie everything together with package imports. `AirIdent` will be introduced into the mix soon now too, now that this commit is able to root them. DEV-13162	2023-05-16 23:28:47 -04:00
Mike Gerwitz	9fb2169a06	tamer: asg::air: Begin to introduce explicit scope testing There's a lot of documentation on this in the commit itself, but this stems from a) frustration with trying to understand how the system needs to operate with all of the objects involved; and b) recognizing that if I'm having difficulty, then others reading the system later on (including myself) and possibly looking to improve upon it are going to have a whole lot of trouble. Identifier scope is something I've been mulling over for years, and more formally for the past couple of months. This finally begins to formalize that, out of frustration with package imports. But it will be a weight lifted off of me as well, with issues of scope always looming. This demonstrates a declarative means of testing for scope by scanning the entire graph in tests to determine where an identifier has been scoped. Since no such scoping has been implemented yet, the tests demonstrate how they will look, but otherwise just test for current behavior. There is more existing behavior to check, and further there will be _references_ to check, as they'll also leave a trail of scope indexing behind as part of the resolution process. See the documentation introduced by this commit for more information on that part of this commit. Introducing the graph scanning, with the ASG's static assurances, required more lowering of dynamic types into the static types required by the API. This was itself a confusing challenge that, while not all that bad in retrospect, was something that I initially had some trouble with. The documentation includes clarifying remarks that hopefully make it all understandable. DEV-13162	2023-05-12 14:07:29 -04:00
Mike Gerwitz	9b53a5e176	tamer: asg::graph::visit::topo: Cut cycles This commit includes plenty of documentation, so you should look there. It's desirable to describe the sorting that TAME performs as a topological sort, since that's the end result we want. This uses the ontology to determine what to do to the graph when a cycle is encountered. So technically we're sorting a graph with cycles, but you can equivalently view this as first transforming the graph to cut all cycles and then sorting it. For the sake of trivia, the term "cut" is used for two reasons: (1) it's an intuitive visualization, and (2) the term "cut" has precedence in logic programming (e.g. Prolog), where it (`!`) is used to prevent backtracking. We're also preventing backtracking, via a back edge, which would produce a cycle. DEV-13162	2023-04-28 14:33:48 -04:00
Mike Gerwitz	f183600c3a	tamer: asg: Move Ident-specific methods off of Asg Historically, the ASG was better described as a "dependency graph", containing only identifiers (which are simply called "symbols" in the XSLT-based compiler). Consequently, it was appropriate for the graph to have operations specific to identifiers. (Indeed, that's the only type of object the graph supported.) Much has changed since then. This cleans things up, and makes parenting identifiers to root an _explicit_ operation. This will make it easier to move forward with handling of scope, and importing identifiers into packages, and removing `Source`, and so on. DEV-13162	2023-04-19 12:40:35 -04:00
Mike Gerwitz	9cb6195046	tamer: asg: Add basic Doc support (for @desc) This introduces a new `Doc` object that can be owned by `Expr` (only atm) and contain what it describes as a concise independent clause. This construction is not enforced, and is only really obvious today via the Summary Pages. There's a lot of latent and unrealized potential in TAME's documentation philosophy that was never realized, so this will certainly evolve over time. But for now, the primary purpose was to get `@desc` working on things like classifications so that `xmli` output can compile for certain packages. DEV-13708	2023-04-12 11:59:48 -04:00
Mike Gerwitz	c0e5b1d750	tamer: asg::air: Template application within expressions This recognizes template application within expressions. Since expressions can occur within templates, this can occur arbitrarily deeply. And with that, we have the core of the template system represented on the graph. Of course, there are some glaring scoping issues to be resolved, but those aren't unique to template application. DEV-13708	2023-04-05 15:49:25 -04:00
Mike Gerwitz	a738a05461	tamer: asg::graph::object::rel: Hash impls for ObjectIndexTo{,Tree} All ObjectIndex-like objects hash using only the underlying identifier, which ultimately boils down to a `NodeIndex` (petgraph), which is just a u32. And so in that sense, the only purpose we have for hashing it is to (a) reduce the space required to store mappings, and (b) compose with other `Hash`es. DEV-13708	2023-04-05 15:46:42 -04:00
Mike Gerwitz	3660c15d5a	tamer: asg::graph::rel::ObjectIndexTreeRelTo: New trait and related This creates another trait and struct `ObjectIndexToTree` that assert a stronger invariant than `ObjectIndexRelTo`---that not only does it uphold the invariants of `ObjectIndexRelTo`, but also that it represents a _tree_ edge, which indicates _ownership_ rather than just a reference. This will be used to statically infer what can serve as a scope boundary for upcoming changes. Specifically, anything that can own an `Ident` introduces a new level of scope. DEV-13708	2023-04-04 14:33:34 -04:00
Mike Gerwitz	f1495f8cf4	tamer: asg::graph::object: Move `lookup_local_linear` to `ObjectIndexRelTo` This allows this method to be used on anything that is able to relate to an identifier, which is needed for the changes being made for the template system. This linear lookup is actually going away (as hinted at by preceding commits); this is extracted as part of a larger change and I wanted to get it committed to make it easier to follow upcoming changes. DEV-13708	2023-04-03 16:14:31 -04:00
Mike Gerwitz	a5b4eda369	tamer: asg::air::AirAggregate: Remove Pkg context from child parser states This is more of the same of the previous commit, but in a more digestable chunk. We now have child states that are able to be constructed using a simple `From`, which is important to making `AirAggregate` a `SuperState`. This also makes `AirStack` act like a prototype chain for `ObjectIndex`es, creating environments where context shadows. The linear search should only have to check the last two frames (e.g. an Expr has a parent Pkg or Tpl context which will have a `rooting_oi` value), and this is only done during a rooting operation. DEV-13708	2023-03-29 12:58:35 -04:00
Mike Gerwitz	2ae33a1dfa	tamer: asg::graph::object: ObjectIndexTo and ObjectIndexRelTo The graph's ontology is defined in the direction of the edge: from OA to OB. This is enforced by the type system to ensure that no code path is able to generate an invalid graph. But that also makes it very difficult to work with a generic source to a specific target. This introduces a `ObjectIndexRelTo` trait that says whether `Self` is able to be related to some `ObjectKind` `OB`, implements it for `ObjectIndex where ObjectRelTo<OB>`, and introduces a new semi-opaque type `ObjectIndexTo` that allows for the source `ObjectIndex` to be generic. This then redefines some existing graph primitives in terms of `ObjectIndexRelTo`, in particular creating edges, so that `ObjectIndex` can be used as today, and the new `ObjectIndexTo` can be used in the same way with the same API, without violating the graph ontology. This will be used by `AirAggregate` to create dynamic targets for rooting and splicing/expansion. DEV-13708	2023-03-29 12:58:35 -04:00
Mike Gerwitz	893da0ed20	tamer: asg: Dynamically determined cross edges Previous to this commit, ontological cross edges were declared statically. But this doesn't fare well with the decided implementation for template application. The documentation details it, but we have Tpl->Ident which could mean "I define this Ident once expanded", or it could mean "this is a reference to a template I will be applying". The former is a tree edge, the latter is a cross edge, and that determination can only be made by inspecting edge data at runtime. It could have been resolved by introducing new Object types, but that is a lot of work for little benefit, especially given that only (right now) the visitor uses this information. DEV-13708	2023-03-29 12:58:34 -04:00
Mike Gerwitz	be81878dd7	tamer: src::asg: Scaffolding for metasyntactic variables Also known as metavariables or template parameters. This is a bit of a tortured excursion, trying to figure out how I want to best represent this. I have a number of pages of hand-written notes that I'd like to distill over time, but the rendered graph ontology (via `asg-ontviz`) demonstrates the broad idea. `AirTpl::TplApply` highlights some remaining questions. What I had _wanted_ to do is to separate the concepts of application and expansion, and support partial application and such. But it's going to be too much work for now, when it isn't needed---partial application can be worked around by simply creating new templates and duplicating params, as we do today, although that sucks and is a maintenance issue. But I'd rather address that head-on in the future. So it's looking like Option B is going to be the approach for now, with templates being closed (as in, no free metavariables) and expanded at the same time. This simplifies the parser and error conditions significantly and makes it easier to utilize anonymous templates, since it'll still be the active context. My intent is to get at least the graph construction sorted out---not the actual expansion and binding yet---enough that I can use templates to represent parts of NIR that do not have proper graph representations or desugaring yet, so that I can spit them back out again in the `xmli` file and incrementally handle them. That was an option I had considered some months ago, but didn't want to entertain it at the time because I wasn't sure what doing so would look like; while it was an attractive approach since it pushes existing primitives into the template system (something I've wanted to do for years), I didn't want to potentially tank performance or compromise the design for it after I had spent so much effort on all of this so far. But my efforts have yielded a system that significantly exceeds my initial performance expectations, with a decent abstractions, and so this seems viable. DEV-13708	2023-03-15 16:40:07 -04:00
Mike Gerwitz	dd2232b58b	tamer: asg::graph: object_gen and object_rel macros The previous commit demonstrated the amount of boilerplate necessary for introducing new `ObjectKind`s; this abstracts away a lot of that boilerplate, and allows for declarative relationship definition for the ASG's ontology. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	454b91dfce	tamer: asg::graph::object: New Tpl object There's quite a bit of boilerplate here that'll eventually need factoring out. But it's also clear that it is somewhat onerous to add new object types. Note that a good chunk of this burden is _intentional_, via exhaustiveness checks---adding a new type of object is an exceptional occurrence (well, in principle, but we haven't added them all yet, so it'll be more common initially), and we'd rather be safe to ensure that everything is properly considering how that new type of object interacts with it. Let's not confuse coupling with safety---the latter causes a burden because of the former, not because of itself; it provides a service to us. But, nonetheless, we'll want to reduce this burden somewhat since there are a number more to add. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	3587d032c3	tamer: asg::graph::object::rel::DynObjectRel: Store source data This is generic over the source, just as the target, defaulting just the same to `ObjectIndex`. This allows us to use only the edge information provided rather than having to perform another lookup on the graph and then assert that we found the correct edge. In this case, we're dealing with an `Ident->Expr` edge, of which there is only one, but in other cases, there may be many such edges, and it wouldn't be possible to know _which_ was referred to without also keeping context of the previous edge in the walk. So, in addition to avoiding more indirection and being more immune to logic bugs, this also allows us to avoid states in `AsgTreeToXirf` for the purpose of tracking previous edges in the current path. And it means that the tree walk can seed further traversals in conjunction with it, if that is so needed for deriving sources. More cleanup will be needed, but this does well to set us up for moving forward; I was too uncomfortable with having to do the separate lookup. This is also a more intuitive API. But it does have the awkward effect that now I don't need the pair---I just need the `Object`---but I'm not going to remove it because I suspect I may need it in the future. We'll see. The TODO references the fact that I'm using a convenient `resolve_oi_pairs` instead of resolving only the target first and then the source only in the code path that needs it. I'll want to verify that Rust will properly optimize to avoid the source resolution in branches that do not need it. DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	cb5d54b2db	tamer: asg::graph::object: Generic Object inner type This makes the inner `Object` type generic (but defaulting to the same inner types as before) so that it can be used as a sum type for various types where `ObjectKind`-based narrowing is required. In this case, it's used to narrow `ObjectIndex` alongside the inner `ObjectKind` so that the two are definitely in sync. This not only results in cleaner code and a more intuitive API that's approachable to people less familiar with the system, but it also helps to eliminate logic bugs that might result form manually narrowing (as was done before this change). DEV-13708	2023-03-10 14:27:58 -05:00
Mike Gerwitz	79cc61f996	tamer: xir::flat::XirfToXir: New lowering operation This parser does exactly what it says it does. Its implementation is simple, but I added a test anyway just to prove that it works, and the test seems more complicated than the implementation itself, given the types involved. DEV-13708	2023-03-10 14:27:57 -05:00
Mike Gerwitz	a5a5a99dbd	tamer: asg::graph::visit::TreeWalkRel: New token type This introduces a `Token` in place of the original tuple for `TreePreOrderDfs` so that it can be used as input to a parser that will lower into XIRF. This requires that various things be describable (using `Display`), which this also adds. This is an example of where the parsing framework itself enforces system observability by ensuring that every part of the system can describe its state. DEV-13708	2023-03-10 14:27:57 -05:00
Mike Gerwitz	7f3ce44481	tamer: asg::graph: Formalize dynamic relationships (edges) The `TreePreOrderDfs` iterator needed to expose additional edge context to the caller (specifically, the `Span`). This was getting a bit messy, so this consolodates everything into a new `DynObjectRel`, which also emphasizes that it is in need of narrowing. Packing everything up like that also allows us to return more information to the caller without complicating the API, since the caller does not need to be concerned with all of those values individually. Depth is kept separate, since that is a property of the traversal and is not stored on the graph. (Rather, it _is_ a property of the graph, but it's not calculated until traversal. But, depth will also vary for a given node because of cross edges, and so we cannot store any concrete depth on the graph for a given node. Not even a canonical one, because once we start doing inlining and common subexpression elimination, there will be shared edges that are _not_ cross edges (the node is conceptually part of _both_ trees). Okay, enough of this rambling parenthetical.) DEV-13708	2023-03-10 14:27:57 -05:00
Mike Gerwitz	2b2776f4e1	tamer: asg::graph::object::rel: Extract object relationships DEV-13708	2023-03-10 14:27:57 -05:00

21 Commits (ba38a3c1ba4bd0c87fea98f8ca63f689ecaa7e9c)