bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-07-23 10:20:32 +00:00

Author	SHA1	Message	Date
Akim Demaille	3da17724ad	doc: updates * NEWS, TODO: here.	2020-09-02 21:37:23 +02:00
Akim Demaille	2f8a874215	portability: we use termios.h and sys/ioctl.h Reported by Maarten De Braekeleer. https://lists.gnu.org/r/bison-patches/2020-07/msg00079.html * bootstrap.conf (gnulib_modules): Add termios and sys_ioctl.	2020-08-02 08:36:49 +02:00
Akim Demaille	f47a1bd622	todo: updates for D	2020-07-30 07:14:57 +02:00
Akim Demaille	dc72b3566d	bistromathic: demonstrate caret-diagnostics * examples/c/bistromathic/parse.y (user_context): We need the current line. (yyreport_syntax_error): Quote the guilty line, with squiggles. * examples/c/bistromathic/bistromathic.test: Adjust.	2020-07-11 18:06:45 +02:00
Akim Demaille	156e548341	cex: give more details about -Wcex and -rcex * data/bison-default.css: Cobalt does not seem to be supported. * doc/bison.texi (Counterexamples): A new section. (Understanding): Show the counterexamples as it shows in the report: with its items. (Bison Options): Document -Wcex and -rcex.	2020-07-04 11:43:35 +02:00
Akim Demaille	84ef175287	news, todo: update	2020-07-01 07:05:41 +02:00
Akim Demaille	330552ea49	yacc.c: push: don't clear the parser state when accepting/rejecting Currently when a push parser finishes its parsing (i.e., it did not return YYPUSH_MORE), it also clears its state. It is therefore impossible to see if it had parse errors. In the context of autocompletion, because error recovery might have fired, the parser is actually already in a different state. For instance on `(1 + + <TAB>` in the bistromathic, because there's a `exp: "(" error ")"` recovery rule, `1 + +` tokens have already been popped, replaced by `error`, and autocompletions think we are ready for the closing ")". So here, we would like to see if there was a syntax error, yet `yynerrs` was cleared. In the case of a successful parse, we still have a problem: if error recovery succeeded, we won't know it, since, again, `yynerrs` is clearer. It seems much more natural to leave the parser state available for analysis when there is a failure. To reuse the parser, we should either: 1. provide an explicit means to reinitialize a parser state for future parses. 2. automatically reset the parser state when it is used in a new parse. Option 2 requires to check whether we need to reinitialize the parser each time we call `yypush_parse`, i.e., each time we give a new token. This seems expensive compared to Option 1, but benchmarks revealed no difference. Option 1 is incompatible with the documentation ("After `yypush_parse` returns a status other than `YYPUSH_MORE`, the parser instance `yyps` may be reused for a new parse."). So Option 2 wins, reusing the private `yynew` member to record that a parse was finished, and therefore that the state must reset in the next call to `yypull_parse`. While at it, this implementation now reuses the previously enlarged stacks from one parse to another. * data/skeletons/yacc.c (yypstate_new): Set up the stacks in their initial configurations (setting their bottom to the stack array), and use yypstate_clear to reset them (moving their top to their bottom). (yypstate_delete): Adjust. (yypush_parse): At the beginning, clear yypstate if needed, and at the end, record when yypstate needs to be clearer. * examples/c/bistromathic/parse.y (expected_tokens): Do not propose autocompletion when there are parse errors. * examples/c/bistromathic/bistromathic.test: Check that case.	2020-06-29 19:36:41 +02:00
Akim Demaille	688b3404a2	doc: tidy the text files * etc/README: Rename/reformat as... * etc/README.md: this. And ship it.	2020-06-29 19:10:05 +02:00
Akim Demaille	feb0bb0a59	style: rename endtoken as eoftoken * src/symtab.h, src/symtab.c (endtoken): Rename as... (eoftoken): this. Adjust dependencies.	2020-06-27 17:31:59 +02:00
Akim Demaille	0895858d8e	style: use 'nonterminal' consistently * doc/bison.texi: Formatting changes. * src/gram.h, src/gram.c (nvars): Rename as... (nnterms): this. Adjust dependencies. (section): New. Use it. Replace "non terminal" and "non-terminal" by "nonterminal".	2020-06-27 11:39:32 +02:00
Akim Demaille	c4b1a2b68f	doc: use dot/'•' rather than point/'.' AFAICT, "dotted rule" is a more frequent synonym of "item" than "pointed rule". So let's migrate to using "dot" only. * doc/bison.texi: Use dot/'•' rather than point/'.'. * src/print-xml.c (print_core): Use dot rather than point. This is not backward compatible, but AFAICT, we don't have actual user of the XML output (but ourselves). So... * data/xslt/xml2dot.xsl, data/xslt/xml2text.xsl, * data/xslt/xml2xhtml.xsl, tests/report.at: ... adjust.	2020-06-23 07:37:29 +02:00
Akim Demaille	b65bd16e45	cex: display all the S/R conflicts, not just one per (state, rule) Before this commit, on %% exp : "if" exp "then" exp \| "if" exp "then" exp "else" exp \| exp "+" exp \| "num" we used to not display the third counterexample below: Shift/reduce conflict on token "+": Example exp "+" exp . "+" exp First derivation exp ::=[ exp ::=[ exp "+" exp . ] "+" exp ] Second derivation exp ::=[ exp "+" exp ::=[ exp . "+" exp ] ] Shift/reduce conflict on token "else": Example "if" exp "then" "if" exp "then" exp . "else" exp First derivation exp ::=[ "if" exp "then" exp ::=[ "if" exp "then" exp . ] "else" exp ] Second derivation exp ::=[ "if" exp "then" exp ::=[ "if" exp "then" exp . "else" exp ] ] Shift/reduce conflict on token "+": Example "if" exp "then" exp . "+" exp First derivation exp ::=[ exp ::=[ "if" exp "then" exp . ] "+" exp ] Second derivation exp ::=[ "if" exp "then" exp ::=[ exp . "+" exp ] ] Shift/reduce conflict on token "+": Example "if" exp "then" exp "else" exp . "+" exp First derivation exp ::=[ exp ::=[ "if" exp "then" exp "else" exp . ] "+" exp ] Second derivation exp ::=[ "if" exp "then" exp "else" exp ::=[ exp . "+" exp ] ] * src/counterexample.c (counterexample_report_state): Don't stop of the first conflicts. * tests/conflicts.at, tests/counterexample.at, tests/diagnostics.at, * tests/report.at: Adjust.	2020-06-23 06:56:04 +02:00
Akim Demaille	3dd8f2305a	cex: use the bullet in HTML * data/xslt/xml2xhtml.xsl: here.	2020-06-22 07:02:29 +02:00
Akim Demaille	efb65daa36	c++: get rid of global_tokens_and_yystype This was a hack to make it easier for people to migrate from yacc.c to lalr1.cc and from glr.c to glr.cc: when set, YYSTYPE and YYLTYPE were `#defined`. It was never documented (just mentioned in NEWS for Bison 2.2, 2006-05-19), but was used to simplify the test suite. Stop that: adjust the test suite to the skeletons, not the converse. In C++ use yy::parser::semantic_type, yy::parser::location_type, and yy::parser::token::MY_TOKEN, instead of YYSTYPE, YYLTYPE and MY_TOKEN. * data/skeletons/glr.cc, data/skeletons/lalr1.cc: Remove its support. * tests/actions.at, tests/c++.at, tests/calc.at: Adjust.	2020-06-16 08:14:42 +02:00
Akim Demaille	e077bf1ebc	cex: don't assume the terminal supports "•" Use of print_unicode_char suggested by Bruno Haible. https://lists.gnu.org/r/bug-gettext/2020-06/msg00012.html * src/gram.h (print_dot_fallback, print_dot): New. * src/gram.c, src/derivation.c: Use it. * tests/counterexample.at, tests/report.at: Adjust the test suite. * .travis.yml, README-hacking.md: Adjust.	2020-06-16 07:58:40 +02:00
Akim Demaille	c35e829a76	cex: also include in the report on --report=counterexamples And let --report=all include the counterexamples. * src/getargs.h, src/getargs.c (report_cex): New. * src/main.c: Compute counterexamples when -rcex is specified. * src/print.c: Include the counterexamples when -rcex is specified. * tests/conflicts.at, tests/existing.at, tests/local.at: Adjust.	2020-06-16 07:30:46 +02:00
Akim Demaille	d4f854e5b2	cex: also include the counterexamples in the report The report is the best place to show the details about counterexamples, since we have the state right under the nose. For instance: State 7 1 exp: exp . "⊕" exp 2 \| exp . "+" exp 2 \| exp "+" exp . [$end, "+", "⊕"] 3 \| exp . "+" exp 3 \| exp "+" exp . [$end, "+", "⊕"] "⊕" shift, and go to state 6 $end reduce using rule 2 (exp) $end [reduce using rule 3 (exp)] "+" reduce using rule 2 (exp) "+" [reduce using rule 3 (exp)] "⊕" [reduce using rule 2 (exp)] "⊕" [reduce using rule 3 (exp)] $default reduce using rule 2 (exp) Conflict between rule 2 and token "+" resolved as reduce (%left "+"). Shift/reduce conflict on token "⊕": 2 exp: exp "+" exp . 1 exp: exp . "⊕" exp Example exp "+" exp • "⊕" exp First derivation exp ::=[ exp ::=[ exp "+" exp • ] "⊕" exp ] Example exp "+" exp • "⊕" exp Second derivation exp ::=[ exp "+" exp ::=[ exp • "⊕" exp ] ] Reduce/reduce conflict on tokens $end, "+", "⊕": 2 exp: exp "+" exp . 3 exp: exp "+" exp . Example exp "+" exp • First derivation exp ::=[ exp "+" exp • ] Example exp "+" exp • Second derivation exp ::=[ exp "+" exp • ] Shift/reduce conflict on token "⊕": 3 exp: exp "+" exp . 1 exp: exp . "⊕" exp Example exp "+" exp • "⊕" exp First derivation exp ::=[ exp ::=[ exp "+" exp • ] "⊕" exp ] Example exp "+" exp • "⊕" exp Second derivation exp ::=[ exp "+" exp ::=[ exp • "⊕" exp ] ] * src/conflicts.h, src/conflicts.c (has_conflicts): New. * src/counterexample.h, src/counterexample.c (print_counterexample): Add a `prefix` argument. (counterexample_report_shift_reduce) (counterexample_report_reduce_reduce): Show the items when there's a prefix. * src/state-item.h, src/state-item.c (print_state_item): Add a `prefix` argument. * src/derivation.h, src/derivation.c (derivation_print) (derivation_print_leaves): Add a prefix argument. * src/print.c (print_state): When -Wcex is enabled, show the conflicts. * tests/report.at: Adjust.	2020-06-16 07:30:26 +02:00
Akim Demaille	c662b23735	Merge 'maint' * upstream/maint: maint: post-release administrivia version 3.6.4 glr.cc: don't leak glr.c/glr.cc scaffolding to the user Some fixes were needed to adjust to recent changes in glr.cc and glr.c. * data/skeletons/glr.cc: Stop messing with the user's epilogue to insert glr.cc code. We need that code to be inserted _before_ the user's epilogue, not after. So define b4_glr_cc_pre_epilogue. * data/skeletons/glr.c: Use it.	2020-06-16 07:16:00 +02:00
Akim Demaille	3f4ffea6f2	glr.cc: don't leak glr.c/glr.cc scaffolding to the user Until we have a decent reimplementation of glr.cc, we have to use tricks to shoehorn C++ symbols to the C engine of glr.c. Some of them are done via #define. Unfortunately in Bison 3.6 some of these we done in the header file, which broke valid user code. Reported by Egor Pugin. https://lists.gnu.org/r/bug-bison/2020-06/msg00003.html * data/skeletons/glr.cc: Stop playing tricks with b4_pre_epilogue. (b4_glr_cc_setup, b4_glr_cc_cleanup): New. Much cleaner way to instal glr.cc's scaffolding around glr.c. * data/skeletons/glr.c: Adjust to use them.	2020-06-15 20:18:47 +02:00
Akim Demaille	a53c6026cd	api.header.include: document it, and fix its default value While defining api.header.include worked as expected, its default value was incorrectly defined. As a result, by default, the generated parsers still duplicated the content of the generated header instead of including it. * data/skeletons/yacc.c (api.header.include): Fix its default value. * tests/output.at: Check it. * doc/bison.texi (%define Summary): Document api.header.include. While at it, move the definition of api.namespace at the proper place.	2020-06-09 08:09:26 +02:00
Akim Demaille	e7aff57122	style: rename user_token_number as code This should have been done in 3.6, but I wanted to avoid introducing conflicts into Vincent's work on counterexamples. It turns out it's completely orthogonal. * data/README.md, data/skeletons/bison.m4, data/skeletons/c++.m4, * data/skeletons/c.m4, data/skeletons/glr.c, data/skeletons/java.m4, * data/skeletons/lalr1.d, data/skeletons/lalr1.java, * data/skeletons/variant.hh, data/skeletons/yacc.c, src/conflicts.c, * src/derives.c, src/gram.c, src/gram.h, src/output.c, * src/parse-gram.c, src/parse-gram.y, src/print-xml.c, src/print.c, * src/reader.c, src/symtab.c, src/symtab.h, tests/input.at, * tests/types.at: s/user_token_number/code/g. Plus minor changes.	2020-05-23 08:43:58 +02:00
Akim Demaille	da5317cc9d	cex: isolate missing API from gl_list * src/counterexample.c (list_get_end): New. Use it. Reduce scopes.	2020-05-22 07:52:27 +02:00
Akim Demaille	8ef0b12eb7	Merge branch 'maint' * upstream/maint: maint: post-release administrivia version 3.6.2 tests: improve update-test CI: add GCC 10 and Clang 10 fix: do not emit nested comments todo: update examples: use markdown hyperlinks tests: don't use == to compare const char *... gnulib: update	2020-05-17 09:16:51 +02:00
Akim Demaille	4619b32dc0	examples: don't promote unchecked function calls * etc/bench.pl.in, examples/c/bistromathic/parse.y, * examples/c/calc/calc.y, examples/c/pushcalc/calc.y: Check scanf's return value. * doc/bison.texi: Likewise, but only for the second example, to avoid cluttering the very simple case.	2020-05-16 14:39:57 +02:00
Akim Demaille	6a28e6d412	todo: update	2020-05-15 07:18:15 +02:00
Akim Demaille	2b63c54f5a	style: minor fixes * examples/c/README.md: here.	2020-05-10 08:03:30 +02:00
Akim Demaille	2ab4058de0	style: minor fixes * examples/c/README.md: here.	2020-05-09 16:43:59 +02:00
Akim Demaille	7727693711	todo: update	2020-05-04 07:37:40 +02:00
Akim Demaille	13a1537dba	java: demonstrate push parsers * data/skeletons/lalr1.java (Location): Make it a static class. (Lexer.yylex, Lexer.getLVal, Lexer.getStartPos, Lexer.getEndPos): These are not needed in push parsers. * examples/java/calc/Calc.y: Demonstrate push parsers in the Java. * doc/bison.texi: Push parsers have been supported for a long time, remove incorrect statements stating the opposite.	2020-05-03 11:28:36 +02:00
Akim Demaille	dbd8fd71ba	todo: more	2020-05-02 08:18:20 +02:00
Akim Demaille	0c0e778bd1	news: make it more consistent * NEWS: Use the same pattern for titles.	2020-05-01 10:36:05 +02:00
Akim Demaille	99efa35369	doc: document YYEOF, YYUNDEF and YYerror * doc/bison.texi (Special Tokens): New. * examples/c/bistromathic/parse.y: Formatting changes.	2020-04-29 08:23:55 +02:00
Akim Demaille	3b05de2d05	yacc.c: install backward compatibility for YYERRCODE Some people have been using that symbol. Some even have #defined it themselves. https://lists.gnu.org/r/bison-patches/2020-04/msg00138.html Let's provide backward compatibility, having it point to YYUNDEF, so that an error message is generated. * data/skeletons/yacc.c (YYERRCODE): New, at the exact same location it was defined before.	2020-04-28 08:26:49 +02:00
Akim Demaille	902a235ad3	style: c++: s/type/kind/ where appropriate These are internal details. `type_get ()` is still there to ensure backward compatibility, `kind ()` being the modern way. * data/skeletons/c++.m4 (by_type, by_type::type): Rename as... (by_kind, by_kind::kind_): this. Adjust dependencies.	2020-04-28 08:16:05 +02:00
Akim Demaille	11027558c8	java: clean up the definition of token kinds From public interface Lexer { /* Token kinds. / /* Token number, to be returned by the scanner. / static final int YYEOF = 0; /* Token number, to be returned by the scanner. / static final int YYERRCODE = 256; /* Token number, to be returned by the scanner. / static final int YYUNDEF = 257; /* Token number, to be returned by the scanner. / static final int BANG = 258; ... /* Deprecated, use b4_symbol(0, id) instead. / public static final int EOF = YYEOF; to public interface Lexer { / Token kinds. / /* Token "end of file", to be returned by the scanner. / static final int YYEOF = 0; /* Token error, to be returned by the scanner. / static final int YYerror = 256; /* Token "invalid token", to be returned by the scanner. / static final int YYUNDEF = 257; /* Token "!", to be returned by the scanner. / static final int BANG = 258; ... /* Deprecated, use YYEOF instead. / public static final int EOF = YYEOF; data/skeletons/java.m4 (b4_token_enum): Display the symbol's tag in comment. * data/skeletons/lalr1.java: Address overquotation issue. * examples/java/calc/Calc.y, examples/java/simple/Calc.y: Use YYEOF, not EOF.	2020-04-28 07:56:00 +02:00
Akim Demaille	cd4e799da4	error: rename the error token from YYERRCODE to YYerror See https://lists.gnu.org/r/bison-patches/2020-04/msg00162.html. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/glr.cc, * data/skeletons/lalr1.java, doc/bison.texi, * examples/c/bistromathic/parse.y, src/scan-gram.l, src/symtab.c (YYERRCODE): Rename as... (YYerror): this. Adjust dependencies.	2020-04-28 07:54:07 +02:00
Akim Demaille	e6d928c4e8	todo: update	2020-04-26 19:55:52 +02:00
Akim Demaille	401e7c5c36	todo: update for YYERRCODE	2020-04-24 19:03:12 +02:00
Akim Demaille	5ab0086157	tokens: clean up the translation of special symbols * src/output.c (prepare_symbol_names): Don't play tricks with the symbols, it's quite too late. (has_translations): Move to... * src/symtab.c: here. (symbols_pack): Use it to enable translation for special symbols.	2020-04-19 15:40:12 +02:00
Akim Demaille	d6ae95fb50	c++: give public access to the symbol kind symbol_type::token () was removed: it returned the token kind of a symbol. To do that, one needs to convert from the symbol kind to the token kind, which requires a table. This broke some users' unit tests for scanners, see https://lists.gnu.org/r/bug-bison/2020-01/msg00001.html https://lists.gnu.org/r/bug-bison/2020-03/msg00020.html https://lists.gnu.org/r/help-bison/2020-04/msg00005.html Instead of making this possible again, let's check the symbol's kind instead. So give proper access to a symbol's kind. That feature existed, undocumented, as 'type_get()'. Let's rename this as 'kind()'. * data/skeletons/c++.m4, data/skeletons/glr.cc, * data/skeletons/lalr1.cc (type_get): Rename as... (kind): This. (type_get): Install a backward compatibility alias. * doc/bison.texi (Complete Symbols): Document symbol_type and symbol_type::kind.	2020-04-18 08:03:59 +02:00
Akim Demaille	e86b14069d	doc: token_kind_type in C++ * data/skeletons/c++.m4: Define the old names in terms on the new ones, instead of the converse. * doc/bison.texi (C++ Parser Interface): Be more extensive about token_kind_type.	2020-04-17 08:53:37 +02:00
Akim Demaille	5d983253f7	doc: updates for 3.6 * doc/bison.texi: More s/token type/token kind/. * NEWS: Update.	2020-04-16 08:44:36 +02:00
Akim Demaille	758172a8b9	doc: spell check * doc/bison.texi, NEWS, README-hacking.md: here. And elsewhere.	2020-04-13 18:50:05 +02:00
Akim Demaille	dab08da605	java: promote YYEOF rather that Lexer.EOF * doc/bison.texi: here. * data/skeletons/lalr1.java: Use YYEOF.	2020-04-13 17:08:53 +02:00
Akim Demaille	258c2c967f	doc: java: SymbolKind, etc. Why didn't I think about this before??? symbolName should be a method of SymbolKind. * data/skeletons/lalr1.java (YYParser::yysymbolName): Move as... * data/skeletons/java.m4 (SymbolKind::getName): this. Make the table a static final table, not a local variable. Adjust dependencies. * doc/bison.texi (Java Parser Interface): Document i18n. (Java Parser Context Interface): Document SymbolKind. * examples/java/calc/Calc.y, tests/local.at: Adjust.	2020-04-13 16:54:48 +02:00
Akim Demaille	71e3f6d4da	d: put YYEMPTY in the TokenKind * data/skeletons/d.m4, data/skeletons/lalr1.d (b4_token_enums): Rename YYTokenType as TokenKind. Define YYEMPTY. * examples/d/calc.y, tests/calc.at, tests/scanner.at: Adjust.	2020-04-13 16:49:54 +02:00
Akim Demaille	5e2e9af56d	doc: use "code", not "number", for token (and symbol) kinds "Number" is too much about arithmethics. "Code" conveys better the "enum" nature of token kinds. And of symbol kinds. * doc/bison.texi: Here.	2020-04-12 19:24:44 +02:00
Akim Demaille	7a226860ef	doc: promote yytoken_kind_t, not yytokentype * data/skeletons/c.m4 (yytoken_kind_t): New. * data/skeletons/c++.m4, data/skeletons/lalr1.cc (yysymbol_kind_type): New. * examples/c/lexcalc/parse.y, examples/c/reccalc/parse.y, * tests/regression.at: Use them. * doc/bison.texi: Replace "enum yytokentype" by "yytoken_kind_t". (api.token.raw): Explain that it forces "yytoken_kind_t" to coincide with "yysymbol_kind_t". (Calling Convention): Mention YYEOF. (Table of Symbols): Add entries for "yytoken_kind_t" and "yysymbol_kind_t". (Glossary): Add entries for "Kind", "Token kind" and "Symbol kind".	2020-04-12 19:24:12 +02:00
Akim Demaille	72c9fa4510	skeletons: use "end of file" instead of "$end" The name "$end" is nice in the report, in particular it avoids that pointed-rules (aka items) be too long. It also helps keeping them "standard". But it is bad in error messages, we should report "end of file" (or maybe "end of input", this is debatable). So, unless the user already defined the alias for the error token herself, make it "end of file". It should even be translated if the user already translated some tokens, so that there is now no strong reason to redefine the $end token. * src/output.c (prepare_symbol_names): Issue "end of file" instead of "$end". * data/skeletons/lalr1.java (yytnamerr_): Remove the renaming hack. * build-aux/update-test: Accept files with names containing a "+", such as c++.at. * tests/actions.at, tests/c++.at, tests/conflicts.at, * tests/glr-regression.at, tests/regression.at, tests/skeletons.at: Adjust.	2020-04-12 13:56:44 +02:00
Akim Demaille	a555b41990	diagnostics: replace "user token number" by "token code" Yet, don't change the structure identifier to avoid introducing conflicts in Vincent Imbimbo's PR (which, amusingly enough, is about conflicts). * src/symtab.c: here. * tests/diagnostics.at, tests/input.at: Adjust.	2020-04-12 13:56:44 +02:00

1 2 3 4 5 ...