employer/tame - tame - Mike Gerwitz's Forge

employer

tame

Author	SHA1	Message	Date
Mike Gerwitz	8d92667388	tamer: Integrate xir::reader as a parser in the lowering pipeline This allows `XmlXirReader` to be used in a `Lower` operation, just as everything else, bringing me one step closer to a pipeline that can be concisely represented; this is finally beginning to unify in a clear way, though it is still a bit of a mess. This causes `XmlXirReader` to _act_ like a `parse::Parser` in that it yields a `ParsedResult`, but it does not use `parse::Parser` itself; that was the _original_ plan: convert it into a `ParseState` where `XmlXirReader` became a context, and force `Parser` to yield by feeding it a stream of tokens with `repeat`, but that ended up performing poorly relative to this change. I did some investigation, which I might write about in the future, but for now, this solution works just fine. DEV-7145	2022-06-02 10:30:44 -04:00
Mike Gerwitz	f8c28655dc	tamer: parse: Split into multiple modules This abstraction has grown quite a bit, and it's time to start formalizing it a bit. This split doesn't change any behavior, but it does start to make it easier to reason about by clearly stating the broad components and how they interact with one-another. This doesn't yet move the tests; those will come next, but they are very few. The reason I gave previously for this was because (a) they're tested indirectly via the systems that utilize them and (b) because the abstraction was not yet settled on the process was already very expensive. No test coverage was lost---it's only that failures were potentially harder to debug on test failures, but in practice not even this was true, because the deeply expressive types all but ensured that, if it compiles, it will function in a way that is expected. Unit tests and documentation for this system will be added once I'm sure that this abstraction is in a proper state. DEV-7145	2022-06-01 11:32:58 -04:00
Mike Gerwitz	63aa452197	tamer: parse: Move parse::lower into Lower This also modifies `poc` such that `Lower` is invoked as an associated function rather than a method to emphasize the pattern that is forming, so that it can be later abstracted away. DEV-11864	2022-06-01 11:15:43 -04:00
Mike Gerwitz	f40f8bbafc	tamer: parse: Rename {lower__while_ok=>lower_} The `while_ok` can just be implied with a lowering operation, and that reduces the name complexity so that we can maybe introduce even more specialized methods without resulting in a huge sentence as a name. DEV-11864	2022-05-27 14:10:55 -04:00
Mike Gerwitz	b084e23497	tamer: Refactor asg_builder into obj::xmlo::lower and asg::air This finally uses `parse` all the way up to aggregation into the ASG, as can be seen by the mess in `poc`. This will be further simplified---I just need to get this committed so that I can mentally get it off my plate. I've been separating this commit into smaller commits, but there's a point where it's just not worth the effort anymore. I don't like making large changes such as this one. There is still work to do here. First, it's worth re-mentioning that `poc` means "proof-of-concept", and represents things that still need a proper home/abstraction. Secondly, `poc` is retrieving the context of two parsers---`LowerContext` and `Asg`. The latter is desirable, since it's the final aggregation point, but the former needs to be eliminated; in particular, packages need to be worked into the ASG so that `found` can be removed. Recursively loading `xmlo` files still happens in `poc`, but the compiler will need this as well. Once packages are on the ASG, along with their state, that responsibility can be generalized as well. That will then simplify lowering even further, to the point where hopefully everything has the same shape (once final aggregation has an abstraction), after which we can then create a final abstraction to concisely stitch everything together. Right now, Rust isn't able to infer `S` for `Lower<S, LS>`, which is unfortunate, but we'll be able to help it along with a more explicit abstraction. DEV-11864	2022-05-27 13:51:29 -04:00
Mike Gerwitz	eafb3b2a1b	tamer: Add Display impl for each ParseState for generic ParseErrors This is intended to describe, to the user, the state that the parser is in. This will be used to convey additional information for general parser errors, but it should also probably be integrated into parsers' individual errors as well when appropriate. This is something I expected to add at some point, but I wanted to add them because, when dealing with lowering errors, it can be difficult to tell what parser the error originated from. DEV-11864	2022-05-25 15:26:02 -04:00
Mike Gerwitz	9edc32dd3b	tamer: parse::LowerIter: Generic inner TripIter iterator This commit is preparing to compose LowerIter directly. DEV-11864	2022-05-24 10:27:14 -04:00
Mike Gerwitz	f218c452b9	tamer: iter::trip: Flatten Result The `*_iter_while_ok` functions now compose like monads, flattening `Result` at each step and drastically simplifying handling of error types. This also removes the bunch of `?`s at the end of the expression, and allows me to use `?` within the callback itself. I had originally not used `Result` as the return type of the callback because I was not entirely sure how I was going to use them, but it's now clear that I _always_ use `Result` as the return type, and so there's no use in trying to be too accommodating; it can always change in the future. This is desirable not just for cleanup, but because trying to refactor `asg_builder` into a pair of `Parser`s is really messy to chain without flattening, especially given some state that has to leak temporarily to the caller. More on that in a future commit. DEV-11864	2022-05-20 16:08:16 -04:00
Mike Gerwitz	958a707e02	tamer: asg: Hoist Root from Ident into Object This was always the intent, but I didn't have a higher-level object yet. This removes all the awkwardness that existed with working the root in as an identifier. DEV-11864	2022-05-19 12:48:43 -04:00
Mike Gerwitz	6252758730	tamer: asg::Object: Introduce Object::Ident This wraps `Ident` in a new `Object` variant and modifies `Asg` so that its nodes are of type `Object`. This unfortunately requires runtime type checking. Whether or not that's worth alleviating in the future depends on a lot of different things, since it'll require my own graph implementation, and I have to focus on other things right now. Maybe it'll be worth it in the future. Note that this also gets rid of some doc examples that simply aren't worth maintaining as the API evolves. DEV-11864	2022-05-19 12:33:59 -04:00
Mike Gerwitz	f75f1b605e	tamer: num: Header typo correction	2022-05-19 12:02:38 -04:00
Mike Gerwitz	ebf1de5a60	tamer: asg::Ident{Object=>}: Rename I think this may have been renamed _from_ `Ident` some time ago, but I'm too lazy to check. In any case, the name is redundant. DEV-11864	2022-05-19 11:17:04 -04:00
Mike Gerwitz	7d76cb53f6	tamer: asg: Move SymAttrs conversion into asg_builder This is a lowering operation and does not belong here. What a tangled mess this all was (see recent commits); no wonder it was so confusing. DEV-11864	2022-05-19 11:07:15 -04:00
Mike Gerwitz	eae194abc6	tamer: asg::object: Merge into asg::ident Everything in this file relates to identifiers, and I'm about to introduce a higher-level object, one of which may be an identifier. DEV-11864	2022-05-19 11:05:20 -04:00
Mike Gerwitz	92dba0a28c	tamer: obj::xmlo::asg_builder::IdentKindError: Merge into AsgBuilderError Now that these are in the same module, there's no need for them to be separate from one-another. DEV-11864	2022-05-19 10:56:07 -04:00
Mike Gerwitz	07d2ec1ffb	tamer: Move Dim and {Sym=>}Dtype into num module A previous commit mentioned that there's not a place for `Dim`, and duplicated it between `asg` and `xmlo`. Well, `Dtype` is also needed in both, and so here's a home for now. `Dtype` has always been an inappropriate detail for the system and will one day be removed entirely in favor of higher-level types; the machine representation is up to the compiler to decide. DEV-11864	2022-05-19 10:39:21 -04:00
Mike Gerwitz	b2a79e930b	tamer: Move SymAttrs lowering into asg_builder asg_builder is about to be replaced, but in the process of simplifying the destination IR (the ASG), I'm moving things into the proper place. This never belonged here---it belongs with the actual lowering operation. Previously, this was not reasoned about in terms of a lowering operation, and was written when I was first introducing myself to Rust and trying to get a proof-of-concept linker working. DEV-11864	2022-05-19 10:28:17 -04:00
Mike Gerwitz	8948452b71	tamer: asg::ident::Dim: Narrow type This matches xmlo::Dim, and could be the same thing, if we can find a home for it in the future; it's not worth creating such a home right now when I'm not yet sure what else ought to live there; the duplication may be fine. The conversion from xmlo needs to be moved, and `Dim` is going to be used for more than just identifiers (expressions will have type inference performed). DEV-11864	2022-05-19 09:32:43 -04:00
Mike Gerwitz	263cb68380	tamer: parse: Persistent context This allows retrieving and providing a context to a `Parser`. This is intended for use with an aggregating parser, in particular to construct the ASG and return it. This is a component of a change that replaces `asg_builder` with a `Parser`-based lowering into the ASG, but there are still changes that need to be made to simplify things and complete its integration. DEV-11864	2022-05-18 16:15:09 -04:00
Mike Gerwitz	001499d921	tamer: parse::ParseError: Remove Eq trait bound Just as in other commits, since it's an unnecessary limitation. DEV-11864	2022-05-18 16:06:22 -04:00
Mike Gerwitz	3e277270a7	tamer: asg: Track roots on graph Previously, since the graph contained only identifiers, discovered roots were stored in a separate vector and exposed to the caller. This not only leaked details, but added complexity; this was left over from the refactoring of the proof-of-concept linker some time ago. This moves the root management into the ASG itself, mostly, with one item being left over for now in the asg_builder (eligibility classifications). There are two roots that were added automatically: - __yield - __worksheet The former has been removed and is now expected to be explicitly mapped in the return map, which is now enforced with an extern in `core/base`. This is still special, in the sense that it is explicitly referenced by the generated code, but there's nothing inherently special about it and I'll continue to generalize it into oblivion in the future, such that the final yield is just a convention. `__worksheet` is the only symbol of type `IdentKind::Worksheet`, and so that was generalized just as the meta and map entries were. The goal in the future will be to have this more under the control of the source language, and to consolodate individual roots under packages, so that the _actual_ roots are few. As far as the actual ASG goes: this introduces a single root node that is used as the sole reference for reachability analysis and topological sorting. The edges of that root node replace the vector that was removed. DEV-11864	2022-05-17 10:42:05 -04:00
Mike Gerwitz	34eb994a0d	tamer: asg::Asg::set_fragment: {ObjectRef=>SymbolId} In the actual implementation (outside of tests), this is always looking up before adding the symbol. This will simplify the API, while still retaining errors, since the identifier will fail the state transition if the identifier did not exist before attempting to set a fragment. So while this is slower in microbenchmarks, this has no effect on real-world performance. Further, I'm refactoring toward a streaming ASG aggregation, which is a lot easier if we do not need to perform lookups in a separate step from the ASG's primitives. DEV-11864	2022-05-16 13:14:27 -04:00
Mike Gerwitz	c49d87976d	tamer: parse::Token: Remove Eq trait bound `PartialEq` remains, and is all that is needed. See previous commit regarding the removal of this same bound from `Context`. This can be re-added if it ends up actually being necessary. But Tokens are ephemeral and used only in lowering pipelines, using pattern matching. DEV-11864	2022-05-16 10:05:14 -04:00
Mike Gerwitz	d87006391e	tamer: asg::object: Remove IdentObjectState, IdentObjectData These traits are no longer necessary now that I'm using concrete types; they just add unnecessary noise and confusion as I attempt to further refactor. Don't abstract prematurely. DEV-11864	2022-05-12 16:31:36 -04:00
Mike Gerwitz	3748762d31	tamer: asg::graph::Asg: Remove type parameter O This removes the generic on the Asg (which was formerly BaseAsg), hard-coding `IdentObject`, which will further evolve. This makes the IR an actual concrete IR rather than an abstract data structure. These tests bring me back a bit, since they were written as I was still becoming familiar with Rust. DEV-11864	2022-05-12 15:46:17 -04:00
Mike Gerwitz	f2c5443176	tamer: asg: Remove generic Asg, rename {Base=>}Asg This is the beginning of an incremental refactoring to remove generics, to simplify the ASG. When I initially wrote the linker, I wasn't sure what direction I was going in, but I was also negatively influenced by more traditional approaches to both design and unit testing. If we're going to call the ASG an IR, then it needs to be one---if the core of the IR is generic, then it's more like an abstract data structure than anything. We can abstract around the IR to slice it up into components that are a little easier to reason about and understand how responsibilities are segregated. DEV-11864	2022-05-11 16:47:13 -04:00
Mike Gerwitz	0493e68cb3	tamer: parse::ParseState::Context: Add missing comment DEV-11864	2022-05-10 11:06:22 -04:00
Mike Gerwitz	0ef0d2b553	tamer: parse::ParseState:Error: Relax Eq trait bound This is unnecessarily restrictive, since we do not require anything further than `PartialEq` for the situations where we care about equality (tests). DEV-11864	2022-05-06 15:28:47 -04:00
Mike Gerwitz	9f990e19e9	tamer: parse::ParseState::Context: Remove Default trait bound This is too restrictive, especially for parsers that fold into something, like the ASG, which may exist prior to invoking the parser. This moves the trait bound to the functions that actually need it. Those obviously cannot be used if the Context does not implement `Default`, but I'll provide alternative conveniences. DEV-11864	2022-05-05 15:55:04 -04:00
Mike Gerwitz	ba9f429ee7	tamer: obj::xmlo::{XmloEvent=>XmloToken} The original "event" name was based on quick-xml's `Event`. This terminology shift is more closely matched with the new parsing system. DEV-11864	2022-05-05 12:25:59 -04:00
Mike Gerwitz	0281dfdf0d	tamer: Remove wip-frontends feature flag We want the new system to be used so that we can start catching any problems that may arise. Further changes will be flagged as necessary. DEV-10936	2022-05-04 09:37:10 -04:00
Mike Gerwitz	1ad2fb1dc8	Copyright year update 2022 RSG (Ryan Specialty Group) recently announced a rename to Ryan Specialty (no "Group"), but I'm not sure if the legal name has been changed yet or not, so I'll wait on that.	2022-05-03 14:14:29 -04:00
Mike Gerwitz	34fcd19cd0	tamer: obj::xmlo::reader: Replace todo! with error These are no longer TODOs---they represent invalid tokens. I'm going to put effort into providing further context with the diagnostic system [right now] because these are internal errors caused by either miscompilation or an incomplete reader. DEV-10936	2022-05-03 09:19:47 -04:00
Mike Gerwitz	5875477efa	tamer: xir::Token: Remove span from Display This was missed when removing it from other Display impls when the new diagnostic system was introduced. Raw `Span`s display byte offsets and the context, which is no longer desirable as part of an error message. DEV-10936	2022-05-03 09:09:55 -04:00
Mike Gerwitz	a2e6e37ed1	tamer: Bump nightly Rust version 1.{57=>62} This removes a couple of feature flags that are no longer necessary.	2022-05-02 11:05:32 -04:00
Mike Gerwitz	7248ef77e4	tamer: diagnose::resolve{r=>}: Rename Consistent with naming of other modules, which prefers to not needlessly transform words. DEV-12151	2022-05-02 09:49:22 -04:00
Mike Gerwitz	75b966c577	tamer: diagnose: Additional documentation I had waited to provide more documentation until I was sure that the abstraction was not going to change significantly; there was a lot of refactoring in prior commits. DEV-12151	2022-05-02 09:44:53 -04:00
Mike Gerwitz	fc1dad8483	tamer: diagnose::report::Section: Further refactor resolved constructor This speaks for itself. DEV-12151	2022-04-29 15:54:38 -04:00
Mike Gerwitz	ba0ceddd2d	tamer: diagnose::report::Section: Constructor refactoring This moves construction out of `From` and into separate associated functions, which can be further simplified in a bit. We also need unit tests for this, since this still relies on integration tests due to the cost of the aggressive and tight refactoring iterations. DEV-12151	2022-04-29 13:10:04 -04:00
Mike Gerwitz	3e04217741	tamer: diagnose::report::Section::maybe_squash_into: Remove syslabel TODO Previously, when adjacent duplicate spans were both resolved, if one failed, the other certainly would, which would result in duplicate labels each squash. Elided spans do not have syslabels, and so this is no longer a concern. DEV-12151	2022-04-29 13:07:51 -04:00
Mike Gerwitz	2ae6df38e7	tamer: diagnose::report: Restore source line preview for invalid UTF-8 This was removed in a previous commit while working on simplifying the implementation, with the hope of returning to it once things were in a better place. They are, so let's bring it back. DEV-12151	2022-04-29 12:41:56 -04:00
Mike Gerwitz	f8dda12fae	tamer: diagnose::report: Remove TODOs that are no longer applicable These relate to the most recent commits. DEV-12151	2022-04-29 12:34:48 -04:00
Mike Gerwitz	2ce0dbdd84	tamer: diagnose::report::SpanLabel: Remove in favor of separate Level and Label `SpanLabel` was created during a very early refactoring of this system, and I've just been fighting with it sense. This removes it, and simplifies some things in the process. It also makes clear that `Level` is never optional and removes the awkward `Level::default` that was there previously; the default is now the lowest level, which will always be able to be escalated. DEV-12151	2022-04-29 12:13:11 -04:00
Mike Gerwitz	9a5a2c4f3f	tamer: diagnose::report: Avoid re-resolving adjacent identical spans This does what the original proof-of-concept implementation did---skip a span that was just processed, since it'll be squashed into the previous anyway. These duplicate spans originate from the diagnostic system when producing supplemental help information. DEV-12151	2022-04-29 11:57:50 -04:00
Mike Gerwitz	a533244473	tamer: diagnose::report::VisualReporter::render: Avoid mspan collection This used to be necessary when `Report` stored references to heap-allocated strings, but `Report` now owns those values itself. DEV-12151	2022-04-29 09:53:22 -04:00
Mike Gerwitz	b0a5265ad3	tamer: diagnose::report::test: Extract into separate file Tests are large and will be getting larger. The source will also grow as it's better documented and cleaned up. It's getting more difficult to navigate efficiently and concurrently modify implementation and tests, and parsing via LSP is getting slower with certain types of changes. DEV-12151	2022-04-29 09:23:06 -04:00
Mike Gerwitz	5c0e224d3c	tamer: diagnose::report: Line numbers in gutter Alright, starting to settle on an abstraction now, and things are coming together. This gives us line numbers in the previously-empty gutter, and widens the gutter to accommodate. Gutters are normalized across sections. Sections are not yet collapsed for sequential line numbers in the same context. Exciting! Here's an example, on an xmlo file: error: expected closing tag for `preproc:symtable` --> /home/.../foo.xmlo:16:4 \| 16 \| <preproc:symtable xmlns:map="http://www.w3.org/2005/xpath-functions/map"> \| ----------------- note: element `preproc:symtable` is opened here --> /home/.../foo.xmlo:11326:4 \| 11326 \| </preproc:wrong> \| ^^^^^^^^^^^^^^^^ error: expected `</preproc:symtable>` DEV-12151	2022-04-28 23:53:38 -04:00
Mike Gerwitz	5744e08984	tamer: diagnostic::report: Hoist gutter output into Section The `Section` itself is now responsible for outputting the gutter, which puts us in a position to be able to apply consistent formatting without having to propagate width data to every line variant.	2022-04-28 22:59:13 -04:00
Mike Gerwitz	4e03a367a5	tamer: diagnose::report::SourceLine: Separate variants for each line Now `SourceLine` _does_ actually correspond to a line of output, which will allow for better formatting (e.g. collapsing padding) and, importantly, proper management of gutters. Note that the seemingly unnecessary `SectionSourceLine` allows for a subtle consistent formatting for all variants' gutters in `SectionLine`, which will allow us to hoist that rendering out in the next commit. The other option was to include a trailing space for padding and marks, but that is not only sloppy and undesirable, but asking for confusion, especially in editors (like mine) that trim trailing whitespace. DEV-12151	2022-04-28 22:49:35 -04:00
Mike Gerwitz	fd1c6430a8	tamer: diagnose::report::SectionSourceLine: {Option<Column>=>Column} If a column isn't present, it degrades to displaying labels like footnotes anyway, so this simplifies the system rather than catering to a rare case. With that said, this does lose functionality, since it does not render the source line at all, even though we _could_ do so. I may re-introduce that rendering after some further refactoring, specifically for gutters. DEV-12151	2022-04-28 22:23:58 -04:00

1 2 3 4 5 ...

457 Commits (8d926673883bf61ef339cc2d91d45be270ffcb2e)