bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-06-08 08:42:35 +00:00

Author	SHA1	Message	Date
Akim Demaille	c22902e360	tables: fix handling for useless tokens In some rare conditions, the generated parser can be wrong when there are useless tokens. Reported by Balázs Scheidler. https://github.com/akimd/bison/issues/72 Balázs managed to prove that the bug was introduced in commit `af1c6f973a` Author: Theophile Ranquet <ranquet@lrde.epita.fr> Date: Tue Nov 13 10:38:49 2012 +0000 tables: use bitsets for a performance boost Suggested by Yuri at <http://lists.gnu.org/archive/html/bison-patches/2012-01/msg00000.html>. The improvement is marginal for most grammars, but notable for large grammars (e.g., PosgreSQL's postgre.y), and very large for the sample.y grammar submitted by Yuri in http://lists.gnu.org/archive/html/bison-patches/2012-01/msg00012.html. Measured with --trace=time -fsyntax-only. parser action tables postgre.y sample.y Before 0,129 (44%) 37,095 (99%) After 0,117 (42%) 5,046 (93%) * src/tables.c (pos): Replace this set of integer coded as an unsorted array of integers with... (pos_set): this bitset. which was implemented long ago, but that I installed only recently (March 2019), first published in v3.3.90. That patch introduces a bitset to represent a set of integers. It managed negative integers by using a (fixed) base (the smallest integer to represent). It avoided negative accesses into the bitset by ignoring integers smaller than the base, under the asumption that these cases correspond to useless tokens that are ignored anyway. While it turns out to be true for all the test cases in the test suite (!), Balázs' use case demonstrates that it is not always the case. So we need to be able to accept negative integers that are smaller than the current base. "Amusingly" enough, the aforementioned patch was visibly unsure about itself: /* Store PLACE into POS_SET. PLACE might not belong to the set of possible values for instance with useless tokens. It would be more satisfying to eliminate the need for this 'if'. / This commit needs several improvements in the future: - support from bitset for bit assignment and shifts - amortized resizing of pos_set - test cases src/tables.c (pos_set_base, pos_set_dump, pos_set_set, pos_set_test): New. Use them instead of using bitset_set and bitset_test directly.	2021-01-24 08:28:45 +01:00
Vincent Imbimbo	2c294c1325	cex: fix state-item pruning There were several bugs in pruning that would leave the state-item graph in an inconsistent state which could cause crashes later on: - Pruning now happens in one pass instead of two. - Disabled state-items no longer prune the state-items they transition to if that state-item has other states that transition to it. - State-items that transition to disabled state-items are always pruned even if they have productions. Reported by Michal Bartkowiak <michal.bartkowiak@nokia.com> https://lists.gnu.org/r/bug-bison/2021-01/msg00000.html and Zartaj Majeed https://github.com/akimd/bison/issues/71 * src/state-item.c (prune_forward, prune_backward): Fuse into... (prune_state_item): this. Adjust callers.	2021-01-24 08:24:56 +01:00
Akim Demaille	4109d56aa9	news: update	2021-01-23 15:02:49 +01:00
Akim Demaille	003ca0498d	package: bump copyrights to 2021 Run 'make update-copyright'.	2021-01-23 15:02:49 +01:00
Paul Eggert	8358090292	c: port to HP-UX 11.23 Problem reported by Albert Chin in: https://lists.gnu.org/r/bug-bison/2021-01/msg00029.html * data/skeletons/c.m4 (b4_c99_int_type_define): Work around HP-UX bug.	2021-01-21 11:45:34 -08:00
Akim Demaille	0364dbacbf	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-11-14 12:26:42 +01:00
Akim Demaille	7a11a9308c	version 3.7.4 * NEWS: Record release date.	2020-11-14 12:04:26 +01:00
Akim Demaille	d8cc6b073e	c++: shorten the assertions that check whether tokens are correct Before: YY_ASSERT (tok == token::YYEOF \|\| tok == token::YYerror \|\| tok == token::YYUNDEF \|\| tok == 120 \|\| tok == 49 \|\| tok == 50 \|\| tok == 51 \|\| tok == 52 \|\| tok == 53 \|\| tok == 54 \|\| tok == 55 \|\| tok == 56 \|\| tok == 57 \|\| tok == 97 \|\| tok == 98); After: YY_ASSERT (tok == token::YYEOF \|\| (token::YYerror <= tok && tok <= token::YYUNDEF) \|\| tok == 120 \|\| (49 <= tok && tok <= 57) \|\| (97 <= tok && tok <= 98)); Clauses are now also wrapped on several lines. This is nicer to read and diff, but also avoids pushing Visual C++ to its arbitrary limits (640K and lines of 16380 bytes ought to be enough for anybody, otherwise make an C2026 error). The useless parens are there for the dummy warnings about precedence (in the future, will we also have to put parens in `1+23`?). data/skeletons/variant.hh (_b4_filter_tokens, b4_tok_in, b4_tok_in): New. (_b4_token_constructor_define): Use them.	2020-11-13 06:17:52 +01:00
Akim Demaille	8b424b865e	lalr1.cc: YY_ASSERT should use api.prefix Working on the previous commit I realized that YY_ASSERT was used in the generated headers, so must follow api.prefix to avoid clashes when multiple C++ parser with variants are used. Actually many more macros should obey api.prefix (YY_CPLUSPLUS, YY_COPY, etc.). There was no complaint so far, so it's not urgent enough for 3.7.4, but it should be addressed in 3.8. * data/skeletons/variant.hh (b4_assert): New. Use it. * tests/local.at (AT_YYLEX_RETURN): Fix. * tests/headers.at: Make sure variant-based C++ parsers are checked too. This test did find that YY_ASSERT escaped renaming (before the fix in this commit).	2020-11-13 06:17:52 +01:00
Akim Demaille	f4431ea115	c++: don't use YY_ASSERT at all if parse.assert is disabled In some extreme situations (about 800 tokens), we generate a single-line assertion long enough for Visual C++ to discard the end of the line, thus falling into parse ends for the missing `);`. On a shorter example: YY_ASSERT (tok == token::TOK_YYEOF \|\| tok == token::TOK_YYerror \|\| tok == token::TOK_YYUNDEF \|\| tok == token::TOK_ASSIGN \|\| tok == token::TOK_MINUS \|\| tok == token::TOK_PLUS \|\| tok == token::TOK_STAR \|\| tok == token::TOK_SLASH \|\| tok == token::TOK_LPAREN \|\| tok == token::TOK_RPAREN); Whether NDEBUG is used or not is irrelevant, the parser dies anyway. Reported by Jot Dot <jotdot@shaw.ca>. https://lists.gnu.org/r/bug-bison/2020-11/msg00002.html We should avoid emitting lines so long. We probably should also use a range-based assertion (with extraneous parens to pacify fascist compilers): YY_ASSERT ((token::TOK_YYEOF <= tok && tok <= token::TOK_YYUNDEF) \|\| (token::TOK_ASSIGN <= tok && ...) But anyway, we should simply not emit this assertion at all when not asked for. * data/skeletons/variant.hh: Do not define, nor use, YY_ASSERT when it is not enabled.	2020-11-13 06:17:52 +01:00
Akim Demaille	21c147b6e5	yacc.c: provide the Bison version as an integral macro Suggested by Balazs Scheidler. https://github.com/akimd/bison/issues/55 * src/muscle-tab.c (muscle_init): Move/rename `b4_version` to/as... * src/output.c (prepare): `b4_version_string`. Also define `b4_version`. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/d.m4, * data/skeletons/java.m4: Adjust. * doc/bison.texi: Document it.	2020-11-11 09:08:57 +01:00
Todd C. Miller	c47bb87f9f	yacc.c: fix #definition of YYEMPTY When generating a C parser, YYEMPTY is present in enum yytokentype but there is no corresponding #define like there is for the other values. There is a special case for YYEMPTY in b4_token_enums but no corresponding case in b4_token_defines. * data/skeletons/c.m4 (b4_token_defines): Do define YYEMPTY.	2020-11-11 08:47:21 +01:00
Akim Demaille	a15879c623	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-10-13 07:23:28 +02:00
Akim Demaille	5d501ee728	version 3.7.3 * NEWS: Record release date.	2020-10-13 07:01:24 +02:00
Akim Demaille	bc5e4541da	build: don't link bison against libreadline Reported by Paul Smith <psmith@gnu.org>. https://lists.gnu.org/r/bug-bison/2020-10/msg00001.html * src/local.mk (src_bison_LDADD): here.	2020-10-13 06:57:33 +02:00
Akim Demaille	dcdd119f69	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-09-05 18:31:25 +02:00
Akim Demaille	a0bc06b703	version 3.7.2 * NEWS: Record release date.	2020-09-05 18:06:16 +02:00
Akim Demaille	f7b642cff7	build: fix incorrect dependencies Commit `af000bab11` ("doc: work around Texinfo 6.7 bug"), published in 3.4.91, added a dependency on the "all" target. This is a super bad idea, since "make all" will run this target before "all", which builds bison. It turns out that this new dependency actually needed bison to be built. So all the regular process (i) build $(BUILT_SOURCES) and then (ii) build bison, was wrecked since some of the $(BUILT_SOURCES) depended on bison... It was "easy" to see in the logs of "make V=1" because we were building bison files (such as src/files.o) before displaying the banner for "all-recursive". With this fix, we finally get again the proper sequence: rm -f examples/c/reccalc/scan.stamp examples/c/reccalc/scan.stamp.tmp /opt/local/libexec/gnubin/mkdir -p examples/c/reccalc touch examples/c/reccalc/scan.stamp.tmp flex -oexamples/c/reccalc/scan.c --header=examples/c/reccalc/scan.h ./examples/c/reccalc/scan.l mv examples/c/reccalc/scan.stamp.tmp examples/c/reccalc/scan.stamp rm -f lib/fcntl.h-t lib/fcntl.h && \ { echo '/* DO NOT EDIT! GENERATED AUTOMATICALLY! /'; \ ... } > lib/fcntl.h-t && \ mv lib/fcntl.h-t lib/fcntl.h ... mv -f lib/alloca.h-t lib/alloca.h make all-recursive Reported by Mingli Yu <mingli.yu@windriver.com>. https://github.com/akimd/bison/issues/31 https://lists.gnu.org/r/bison-patches/2020-05/msg00055.html Reported by Claudio Calvelli <bugb@w42.org>. https://lists.gnu.org/r/bug-bison/2020-09/msg00001.html https://bugs.gentoo.org/716516 doc/local.mk (all): Rename as... (all-local): this. So that we don't compete with BUILT_SOURCES.	2020-09-05 17:42:20 +02:00
Akim Demaille	3da17724ad	doc: updates * NEWS, TODO: here.	2020-09-02 21:37:23 +02:00
Akim Demaille	a1b7fef045	c: always use YYMALLOC/YYFREE Reported by Kovalex <kovalex.pro@gmail.com>. https://lists.gnu.org/r/bug-bison/2020-08/msg00015.html * data/skeletons/yacc.c: Don't make direct calls to malloc/free. * tests/calc.at: Check it.	2020-08-30 10:05:18 +02:00
Akim Demaille	cb7dcb011e	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-08-02 09:32:34 +02:00
Akim Demaille	71579c7219	version 3.7.1 * NEWS: Record release date.	2020-08-02 09:10:02 +02:00
Akim Demaille	2f8a874215	portability: we use termios.h and sys/ioctl.h Reported by Maarten De Braekeleer. https://lists.gnu.org/r/bison-patches/2020-07/msg00079.html * bootstrap.conf (gnulib_modules): Add termios and sys_ioctl.	2020-08-02 08:36:49 +02:00
Akim Demaille	d975c2f76e	libtextstyle: be sure to have ostream_printf and hyperlink support Older versions of libtextstyle do not support them, rule them out. Reported by Lars Wendler https://lists.gnu.org/r/bug-bison/2020-07/msg00030.html and by Arnold Robbins https://lists.gnu.org/r/bug-bison/2020-07/msg00041.html and https://lists.gnu.org/mailman/private/gawk-devel/2020-July/003988.html and by Nelson H. F. Beebe https://lists.gnu.org/mailman/private/gawk-devel/2020-July/003993.html With support from Bruno Haible in gnulib https://lists.gnu.org/r/bug-gnulib/2020-08/msg00000.html thread starting at https://lists.gnu.org/r/bug-gnulib/2020-07/msg00148.html * configure.ac: Require libtextstyle 0.20.5. * gnulib: Update.	2020-08-02 08:19:35 +02:00
Akim Demaille	cb65553449	diagnostics: better location for type redeclarations From foo.y:1.7-11: error: %type redeclaration for bar 1 \| %type <foo> bar bar \| ^~~~~ foo.y:1.7-11: note: previous declaration 1 \| %type <foo> bar bar \| ^~~~~ to foo.y:1.17-19: error: %type redeclaration for bar 1 \| %type <foo> bar bar \| ^~~ foo.y:1.13-15: note: previous declaration 1 \| %type <foo> bar bar \| ^~~ * src/symlist.h, src/symlist.c (symbol_list_type_set): There's no need for the tag's location, use that of the symbol. * src/parse-gram.y: Adjust. * tests/input.at: Adjust.	2020-08-01 08:54:46 +02:00
Akim Demaille	6accee7716	doc: refer to cex from sections dealing with conflicts The documentation about -Wcex should be put forward. * doc/bison.texi: Refer to -Wcex from the sections about conflicts.	2020-07-28 07:45:07 +02:00
Akim Demaille	72b3c1a673	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-07-23 20:15:38 +02:00
Akim Demaille	6675d36e25	version 3.7 * NEWS: Record release date.	2020-07-23 19:58:16 +02:00
Akim Demaille	79e68b6c4d	doc: fix definition of -Wall * doc/bison.texi (Diagnostics): here.	2020-07-23 09:17:18 +02:00
Akim Demaille	431774d1f6	cex: update NEWS for 3.7 * NEWS: Update to the current style of cex display.	2020-07-22 07:36:02 +02:00
Akim Demaille	28769d608e	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-07-20 07:58:03 +02:00
Akim Demaille	a22588bcb9	version 3.6.93 * NEWS: Record release date.	2020-07-20 07:37:47 +02:00
Akim Demaille	744da03955	glyphs: fix types The code was written on top of buffers of `char[26]`, and then was changed to use `char `, yet was still using `sizeof buf`, which became `sizeof (char )` instead of `sizeof (char[26])`. Reported by Dagobert Michelsen. https://lists.gnu.org/r/bug-bison/2020-07/msg00023.html * src/glyphs.h, src/glyphs.c: Get rid of uses of `char *`, use only glyph_buffer_t.	2020-07-19 17:09:01 +02:00
Akim Demaille	b28d67b6b0	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-07-19 09:45:36 +02:00
Akim Demaille	cf890692f1	version 3.6.92 * NEWS: Record release date.	2020-07-19 09:24:57 +02:00
Akim Demaille	fff17fe8fe	cex: display derivations as trees Sometimes, understanding the derivations is difficult, because they are serialized to fit in one line. For instance, the example taken from the NEWS file: %token ID %% s: a ID a: expr expr: expr ID ',' \| "expr" gave First example expr • ID ',' ID $end Shift derivation $accept → [ s → [ a → [ expr → [ expr • ID ',' ] ] ID ] $end ] Second example expr • ID $end Reduce derivation $accept → [ s → [ a → [ expr • ] ID ] $end ] Printing as trees, it gives: First example expr • ID ',' ID $end Shift derivation $accept ↳ s $end ↳ a ID ↳ expr ↳ expr • ID ',' Second example expr • ID $end Reduce derivation $accept ↳ s $end ↳ a ID ↳ expr • * src/glyphs.h, src/glyphs.c (down_arrow, empty, derivation_separator): New. * src/derivation.c (derivation_print, derivation_print_impl): Rename as... (derivation_print_flat, derivation_print_flat_impl): These. (fputs_if, derivation_depth, derivation_width, derivation_print_tree) (derivation_print_tree_impl, derivation_print): New. * src/counterexample.c (print_counterexample): Adjust. * tests/conflicts.at, tests/counterexample.at, tests/diagnostics.at, * tests/report.at: Adjust.	2020-07-18 07:54:02 +02:00
Akim Demaille	4f9ae5de07	cex: display shifts before reductions When reporting counterexamples for s/r conflicts, put the shift first. This is more natural, and displays the default resolution first, which is also what happens for r/r conflicts where the smallest rule number is displayed first, and "wins". * src/counterexample.c (counterexample): Add a shift_reduce member. (new_counterexample): Adjust. Swap the derivations when this is a s/r conflict. (print_counterexample): For s/r conflicts, prefer "Shift derivation" and "Reduce derivation" rather than "First/Second derivation". * tests/conflicts.at, tests/counterexample.at, tests/report.at: Adjust. * NEWS, doc/bison.texi: Ditto.	2020-07-14 06:48:48 +02:00
Akim Demaille	ee86ea8839	cex: prefer → to ::= It does not make a lot of sense to use ::= in our counterexamples, that's not something that belongs to the Bison "vocabulary". Using the colon makes sense, but it's too discreet. Let's use the arrow, which we already use in some reports (HTML and Dot). * src/gram.h (print_dot_fallback): Generalize into... (print_fallback): this. (print_arrow): New. * src/derivation.c: Use it. * NEWS, tests/conflicts.at, tests/counterexample.at, * tests/diagnostics.at, tests/report.at: Adjust. * doc/bison.texi: Ditto. Unfortunately the literal `→` is output as `↦`. So we need to use @arrow.	2020-07-11 18:43:46 +02:00
Akim Demaille	dc72b3566d	bistromathic: demonstrate caret-diagnostics * examples/c/bistromathic/parse.y (user_context): We need the current line. (yyreport_syntax_error): Quote the guilty line, with squiggles. * examples/c/bistromathic/bistromathic.test: Adjust.	2020-07-11 18:06:45 +02:00
Akim Demaille	a7ed13b25f	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-07-09 21:38:23 +02:00
Akim Demaille	2d90916067	version 3.6.91 * NEWS: Record release date.	2020-07-09 21:10:23 +02:00
Akim Demaille	91e5a23ff2	news: update	2020-07-09 20:29:24 +02:00
Akim Demaille	7c0d36b760	maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update.	2020-07-04 12:37:45 +02:00
Akim Demaille	b4d18c7fc6	version 3.6.90 * NEWS: Record release date.	2020-07-04 12:14:28 +02:00
Akim Demaille	3e6e51cf5c	news: update	2020-07-04 12:12:33 +02:00
Akim Demaille	526ef45f30	news: update	2020-07-03 07:07:28 +02:00
Akim Demaille	413908e5a4	news: formatting changes	2020-07-01 07:05:48 +02:00
Akim Demaille	84ef175287	news, todo: update	2020-07-01 07:05:41 +02:00
Vincent Imbimbo	1247d94ba6	doc: cex documentation * NEWS, doc/bison.texi: Add documentation for conflict counterexample generation.	2020-06-30 08:01:40 +02:00
Akim Demaille	330552ea49	yacc.c: push: don't clear the parser state when accepting/rejecting Currently when a push parser finishes its parsing (i.e., it did not return YYPUSH_MORE), it also clears its state. It is therefore impossible to see if it had parse errors. In the context of autocompletion, because error recovery might have fired, the parser is actually already in a different state. For instance on `(1 + + <TAB>` in the bistromathic, because there's a `exp: "(" error ")"` recovery rule, `1 + +` tokens have already been popped, replaced by `error`, and autocompletions think we are ready for the closing ")". So here, we would like to see if there was a syntax error, yet `yynerrs` was cleared. In the case of a successful parse, we still have a problem: if error recovery succeeded, we won't know it, since, again, `yynerrs` is clearer. It seems much more natural to leave the parser state available for analysis when there is a failure. To reuse the parser, we should either: 1. provide an explicit means to reinitialize a parser state for future parses. 2. automatically reset the parser state when it is used in a new parse. Option 2 requires to check whether we need to reinitialize the parser each time we call `yypush_parse`, i.e., each time we give a new token. This seems expensive compared to Option 1, but benchmarks revealed no difference. Option 1 is incompatible with the documentation ("After `yypush_parse` returns a status other than `YYPUSH_MORE`, the parser instance `yyps` may be reused for a new parse."). So Option 2 wins, reusing the private `yynew` member to record that a parse was finished, and therefore that the state must reset in the next call to `yypull_parse`. While at it, this implementation now reuses the previously enlarged stacks from one parse to another. * data/skeletons/yacc.c (yypstate_new): Set up the stacks in their initial configurations (setting their bottom to the stack array), and use yypstate_clear to reset them (moving their top to their bottom). (yypstate_delete): Adjust. (yypush_parse): At the beginning, clear yypstate if needed, and at the end, record when yypstate needs to be clearer. * examples/c/bistromathic/parse.y (expected_tokens): Do not propose autocompletion when there are parse errors. * examples/c/bistromathic/bistromathic.test: Check that case.	2020-06-29 19:36:41 +02:00

1 2 3 4 5 ...

848 Commits