added Format trait to support different BNF syntaxes, and a basic implementation of ABNF to demonstrate #154

Carlyle-Foster · 2025-01-02T06:40:52Z

i'm curious too hear peoples thoughts about this approach, other approaches i considered are:

functional programming
by this i mean making all the basic parsing functions higher-order functions
so you'd pass the format as a normal parameter and it 'binds' the parameter, returning a closure that has it
built in and thus can be composed with nom combinators

i rejected it because
it seems massively complicated to me and the use of closures would change the performance characteristics(EDIT: sic, we already use closures but they're monomorphized at compile-time, that's the whole idea behind nom)

global variables
pretty self-explanatory, you have a static format with a setter and getter to hide how it's implemented,
could be through a RwLock, i even tried an unsafe static mut because i wasn't sure that generating
multiple BNF's of different formats in parallel was really desirable to support

i rejected it because
aside from my obvious unease about globals, it turns out that it breaks the whole test suite in
unpredictable ways because reason tests all "see" the same global variable, having the tests that use
ABNF set it back to the default afterwards and using the RwLock implementation helped a bit,
but i think i'd have to rewrite all the tests to make sure this new "environment" is set correctly and i' don't see
why i should when other options set the bar much higher

code duplication
the idea is i just copy and paste the entirety of parsers.rs into a new module and just change the low-level
prod_lhs() and nonterminal, this was what i tried at first, you can see it working in the second commit
here, this option is already pretty good because there's not much code to duplicate and it's not very complex

i only rejected code duplication because i think the Format trait is a better way of generating the same code
but with the duplication done automatically so it's always in sync, at the minor cost of always requiring a type parameter,
even up to the interface, i wanted to have it be an argument in the API at first but now that would be misleading as to how
it works, no data is actually transferred at runtime

…tion

…ters ala parse_from::<ABNF>(input)

coveralls · 2025-01-02T07:01:26Z

coverage: 98.263% (-0.2%) from 98.471%
when pulling 64bea7e on Carlyle-Foster:main
into c6fb531 on shnewto:main.

Carlyle-Foster · 2025-01-02T13:35:25Z

i added support for comments! there's no dedicated tests but i added some random comments to the example example grammars in tests/fixtures so it is still tested, it's just not obvious

this closes #37 i think

…n tests/grammar.rs

Carlyle-Foster · 2025-01-02T14:27:40Z

added automatic detection of grammar in Grammar::FromStr(), as suggested in #17

CrockAgile · 2025-01-03T04:19:44Z

wow this looks great! I will be sure to give it a solid review soon 👍

Carlyle-Foster · 2025-01-03T17:22:05Z

@CrockAgile somewhat off-topic, but are the docs for benching out-of-date? it seems you've moved from criterion to divan but the docs don't even mention divan and criterion.rs is still there, when you run cargo bench like it says both get run but they seem to have the same tests so it takes twice as long as it should(more than that actually because criterion seems to be slower)

… increase compiletimes more than you'd expect(maybe?)

…e auto-detector

Carlyle-Foster · 2025-01-03T19:14:42Z

ideally every format should have it's own feature flag, but i'll have to genericise the test suite before i can do that

Carlyle-Foster · 2025-01-03T20:04:35Z

do we support unicode? the types imply it but i'm not certain that we aren't assuming it's ASCII somewhere, i would hope not since we're using &str

CrockAgile · 2025-01-05T00:42:58Z

First, thanks for the contribution! I don't see anything immediately blocking merging this. I particularly liked your explanation of the alternatives considered, and why the Format trait would be preferred.

I've gone thru the PR and I agree with your comments that there is work left to do, but I don't want to let perfection stand in the way of progress. So I am going to merge this, and we can iterate to address:

ideally every format should have it's own feature flag, but i'll have to genericise the test suite before i can do that
more property tests to flex the ABNF parser
anything else you think would be a good next step

@CrockAgile somewhat off-topic, but are the docs for benching out-of-date? it seems you've moved from criterion to divan but the docs don't even mention divan and criterion.rs is still there, when you run cargo bench like it says both get run but they seem to have the same tests so it takes twice as long as it should(more than that actually because criterion seems to be slower)

yes the benchmarking docs are a bit out of date! when divan was added, it was still relatively new. it may be time to reevaluate which benchmark suite best matches the crate's needs

do we support unicode? the types imply it but i'm not certain that we aren't assuming it's ASCII somewhere, i would hope not since we're using &str

yes unicode should be supported! I will make an issue to add track adding more tests and explicit documentation for this

Carlyle-Foster added 4 commits January 1, 2025 14:42

small stuff

2ec3f59

added support for some ABNF features, with a good bit of code duplica…

c9963ce

…tion

refactored formats into traits that are passed around via type parame…

1981c76

…ters ala parse_from::<ABNF>(input)

applied rustfmt

33120b0

applied clippy

a2881a3

Carlyle-Foster force-pushed the main branch from 912375f to a2881a3 Compare January 2, 2025 07:51

added support for comments yay

d3cf685

added autodetecion of format in Grammar::FromStr(), tested slightly i…

caa7fcd

…n tests/grammar.rs

shnewto requested a review from CrockAgile January 2, 2025 15:25

Carlyle-Foster added 2 commits January 3, 2025 09:55

added adefault feature flag for ABNF since the monomorphization could…

5bc4516

… increase compiletimes more than you'd expect(maybe?)

fixed a bug where comments at the start of a grammar would confuse th…

2326fba

…e auto-detector

made ABNF nonterminals less... all-consuming

64bea7e

CrockAgile approved these changes Jan 5, 2025

View reviewed changes

CrockAgile merged commit 2479db7 into shnewto:main Jan 5, 2025
9 checks passed

This was referenced Jan 5, 2025

ABNF review pass #155

Merged

unicode support #156

Open

update benchmark docs to mention divan #157

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added Format trait to support different BNF syntaxes, and a basic implementation of ABNF to demonstrate #154

added Format trait to support different BNF syntaxes, and a basic implementation of ABNF to demonstrate #154

Carlyle-Foster commented Jan 2, 2025 •

edited

Loading

coveralls commented Jan 2, 2025 •

edited

Loading

Carlyle-Foster commented Jan 2, 2025

Carlyle-Foster commented Jan 2, 2025

CrockAgile commented Jan 3, 2025

Carlyle-Foster commented Jan 3, 2025 •

edited

Loading

Carlyle-Foster commented Jan 3, 2025 •

edited

Loading

Carlyle-Foster commented Jan 3, 2025

CrockAgile commented Jan 5, 2025 •

edited

Loading

added Format trait to support different BNF syntaxes, and a basic implementation of ABNF to demonstrate #154

added Format trait to support different BNF syntaxes, and a basic implementation of ABNF to demonstrate #154

Conversation

Carlyle-Foster commented Jan 2, 2025 • edited Loading

coveralls commented Jan 2, 2025 • edited Loading

Carlyle-Foster commented Jan 2, 2025

Carlyle-Foster commented Jan 2, 2025

CrockAgile commented Jan 3, 2025

Carlyle-Foster commented Jan 3, 2025 • edited Loading

Carlyle-Foster commented Jan 3, 2025 • edited Loading

Carlyle-Foster commented Jan 3, 2025

CrockAgile commented Jan 5, 2025 • edited Loading

Carlyle-Foster commented Jan 2, 2025 •

edited

Loading

coveralls commented Jan 2, 2025 •

edited

Loading

Carlyle-Foster commented Jan 3, 2025 •

edited

Loading

Carlyle-Foster commented Jan 3, 2025 •

edited

Loading

CrockAgile commented Jan 5, 2025 •

edited

Loading