bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-07-23 13:50:33 +00:00

Author	SHA1	Message	Date
Akim Demaille	78f72a4516	style: s/lookahead_tokens/lookaheads/g Currently we use both names. Let's stick to the short one. * src/AnnotationList.c, src/conflicts.c, src/counterexample.c, * src/getargs.c, src/getargs.h, src/graphviz.c, src/ielr.c, * src/lalr.c, src/print-graph.c, src/print-xml.c, src/print.c, * src/state-item.c, src/state.c, src/state.h, src/tables.c: s/lookahead_token/lookahead/gi.	2020-07-14 06:48:48 +02:00
Akim Demaille	c04693d651	cex: factor memory allocation * src/counterexample.c (counterexample_report_state): Allocate once per conflicted state, instead of once per r/r conflict.	2020-07-14 06:48:48 +02:00
Akim Demaille	12191911ba	cex: use state_item_number consistently * src/counterexample.c, src/state-item.c: here. (counterexample_report_state): While at it, prefer c2 to j/k, to match c1.	2020-07-14 06:48:48 +02:00
Akim Demaille	d7f27477f4	cex: more consistent memory allocation/copy * src/counterexample.c, src/parse-simulation.c: It is more usual in Bison to use sizeof on expressions than on types, especially for allocation. Let the compiler do it's job instead of calling memcpy ourselves.	2020-07-14 06:48:48 +02:00
Akim Demaille	5bad15d7ea	cex: minor renaming * src/counterexample.c (has_common_prefix): Rename as... (have_common_prefix): this.	2020-07-14 06:48:48 +02:00
Akim Demaille	cd099edf2d	cex: use better type names There are too many gl_list_t in there, it's hard to understand what is going on. Introduce and use more precise types. I sure can be wrong in some places, it's hard to tell without proper tool support. * src/counterexample.c, src/lssi.c, src/lssi.h, src/parse-simulation.c, * src/parse-simulation.h, src/state-item.c, src/state-item.h (si_bfs_node_list, search_state_list, ssb_list, lssi_list) (state_item_list): New.	2020-07-14 06:48:48 +02:00
Akim Demaille	1e12219775	cex: minor style changes * src/counterexample.h, src/derivation.h, src/derivation.c: More comments. Use `out` for FILE*, as elsewhere.	2020-07-14 06:48:48 +02:00
Akim Demaille	ee86ea8839	cex: prefer → to ::= It does not make a lot of sense to use ::= in our counterexamples, that's not something that belongs to the Bison "vocabulary". Using the colon makes sense, but it's too discreet. Let's use the arrow, which we already use in some reports (HTML and Dot). * src/gram.h (print_dot_fallback): Generalize into... (print_fallback): this. (print_arrow): New. * src/derivation.c: Use it. * NEWS, tests/conflicts.at, tests/counterexample.at, * tests/diagnostics.at, tests/report.at: Adjust. * doc/bison.texi: Ditto. Unfortunately the literal `→` is output as `↦`. So we need to use @arrow.	2020-07-11 18:43:46 +02:00
Akim Demaille	a2ad33dca6	style: cex: prefer the array notation Prefer `&foos[i]` to `foos + i` when `foos` is an array. IMHO, it makes the semantics clearer. * src/counterexample.c, src/lssi.c, src/parse-simulation.c, * src/state-item.c: With arrays, prefer the array notation rather than the pointer one.	2020-07-11 18:07:09 +02:00
Akim Demaille	5b2b7b1ffb	style: cex: remove variables that don't make it simpler to read * src/counterexample.c: With arrays, prefer the array notation rather than the pointer one.	2020-07-11 18:07:09 +02:00
Akim Demaille	44ad466a32	reports: let xml reports catch up with --report and --graph The text and Dot reports are expected to be identical when generated directly (--report, --graph) or indirectly (via XML). The xml testsuite had not be run for ages, let it catch up a bit. * src/print-xml.c: Pass the type of the symbols. * data/xslt/xml2text.xsl Catch up with the new layout. Display the symbol types. Use '•', not '.' * tests/local.at: Smash '•' to '.' when matching against the direct text report. * tests/report.at: Adjust XML expectations.	2020-07-11 12:58:44 +02:00
Akim Demaille	2608b0cf12	style: factor complex expressions * src/print-xml.c, src/print.c: Introduce a variable pointing to the current symbol.	2020-07-11 12:58:44 +02:00
Akim Demaille	0820f16ca8	style: update comments * src/reader.c: action_obstack was removed in 2002... * src/parse-gram.y: Better names. * src/scan-code.h: More comments.	2020-07-05 09:59:45 +02:00
Akim Demaille	49f1e5f428	style: update comments in the skeletons * data/skeletons/c++.m4, data/skeletons/glr.c, data/skeletons/lalr1.d, * data/skeletons/lalr1.java, data/skeletons/yacc.c: Be more accurate about yychar and yytoken. Don't name local variables as if they were members.	2020-07-05 09:59:25 +02:00
Akim Demaille	5f95583da7	regen	2020-07-05 08:18:51 +02:00
Akim Demaille	964fb2aa6f	examples: include the generated header * examples/c/bistromathic/parse.y, examples/c/lexcalc/parse.y, * examples/c/reccalc/parse.y: here. Add some comments. * src/parse-gram.y (api_version): Pull out of handle_require. Bump to 3.7.	2020-07-05 08:18:51 +02:00
Akim Demaille	d7f7fcd9c7	dot: also use a dot in the output * src/print-graph.c (print_core): Use a dot instead of a point. * doc/figs/example-reduce.gv, doc/figs/example-reduce.txt, * doc/figs/example-shift.gv, doc/figs/example-shift.txt, * doc/figs/example.gv: Update. * tests/output.at, tests/report.at: Adjust.	2020-07-03 06:51:57 +02:00
Akim Demaille	b91566edd1	regen	2020-06-29 19:10:05 +02:00
Akim Demaille	e0b0a67b86	java: rename package as api.package * data/skeletons/lalr1.java: here. * doc/bison.texi: Update. * src/muscle-tab.c: Ensure backward compat. * tests/java.at: Check it.	2020-06-28 09:49:00 +02:00
Akim Demaille	0e5cbd38b2	style: shift/reduce, not shift-reduce * src/reader.c: here.	2020-06-28 08:33:24 +02:00
Akim Demaille	feb0bb0a59	style: rename endtoken as eoftoken * src/symtab.h, src/symtab.c (endtoken): Rename as... (eoftoken): this. Adjust dependencies.	2020-06-27 17:31:59 +02:00
Akim Demaille	0895858d8e	style: use 'nonterminal' consistently * doc/bison.texi: Formatting changes. * src/gram.h, src/gram.c (nvars): Rename as... (nnterms): this. Adjust dependencies. (section): New. Use it. Replace "non terminal" and "non-terminal" by "nonterminal".	2020-06-27 11:39:32 +02:00
Akim Demaille	eeafc706e8	c++: by default, use const std::string for file names Reported by Martin Blais and Yuriy Solodkyy. https://lists.gnu.org/r/help-bison/2020-05/msg00011.html https://lists.gnu.org/r/bug-bison/2020-06/msg00038.html While at it, modernize filename_type as api.filename.type and document it properly. * data/skeletons/c++.m4 (filename_type): Rename as... (api.filename.type): this. Default to const std::string. * data/skeletons/location.cc (position, location): Expose the filename_type type. Use api.filename.type. * doc/bison.texi (%define Summary): Document api.filename.type. (C++ Location Values): Document position::filename_type. * src/muscle-tab.c (muscle_percent_variable_update): Ensure backward compatibility. * tests/c++.at: Check that using const file names is ok. tests/input.at: Check backward compat.	2020-06-27 10:06:00 +02:00
Akim Demaille	cf6d8d0631	ielr: fix crash on memory management Reported by Dwight Guth. https://lists.gnu.org/r/bug-bison/2020-06/msg00037.html * src/AnnotationList.c (AnnotationList__computePredecessorAnnotations): Beware that SBITSET__FOR_EACH nests _two_ for-loops, so "break" does not actually break out of it. That was the only occurrence in the code. * src/Sbitset.h (SBITSET__FOR_EACH): Warn passersby.	2020-06-27 08:16:07 +02:00
Akim Demaille	8f44164443	style: factor the access to a rule from its items * src/counterexample.c (item_rule): Move to... * src/counterexample.h: here. * src/AnnotationList.c, src/counterexample.c, src/ielr.c: Use it.	2020-06-25 19:36:07 +02:00
Akim Demaille	1001f48416	style: clean up nullable * src/nullable.c: Reduce scopes. Prefer `r` to `rules_ruleno`, which is truly an ugly name.	2020-06-25 19:36:07 +02:00
Akim Demaille	3be228f64c	style: clean up ielr * src/AnnotationList.c, src/ielr.c: Fix include order. Prefer `res` to `result`. Reduce scopes. Be free of the oldish 76 cols limitation when it clutters too much the code. Denest when possible (we're starving for horizontal width).	2020-06-25 19:30:06 +02:00
Akim Demaille	670c7e7a75	don't use strlen to compute visual width * src/output.c (prepare_symbol_names): Use mbswidth.	2020-06-23 08:27:26 +02:00
Akim Demaille	c4b1a2b68f	doc: use dot/'•' rather than point/'.' AFAICT, "dotted rule" is a more frequent synonym of "item" than "pointed rule". So let's migrate to using "dot" only. * doc/bison.texi: Use dot/'•' rather than point/'.'. * src/print-xml.c (print_core): Use dot rather than point. This is not backward compatible, but AFAICT, we don't have actual user of the XML output (but ourselves). So... * data/xslt/xml2dot.xsl, data/xslt/xml2text.xsl, * data/xslt/xml2xhtml.xsl, tests/report.at: ... adjust.	2020-06-23 07:37:29 +02:00
Akim Demaille	b65bd16e45	cex: display all the S/R conflicts, not just one per (state, rule) Before this commit, on %% exp : "if" exp "then" exp \| "if" exp "then" exp "else" exp \| exp "+" exp \| "num" we used to not display the third counterexample below: Shift/reduce conflict on token "+": Example exp "+" exp . "+" exp First derivation exp ::=[ exp ::=[ exp "+" exp . ] "+" exp ] Second derivation exp ::=[ exp "+" exp ::=[ exp . "+" exp ] ] Shift/reduce conflict on token "else": Example "if" exp "then" "if" exp "then" exp . "else" exp First derivation exp ::=[ "if" exp "then" exp ::=[ "if" exp "then" exp . ] "else" exp ] Second derivation exp ::=[ "if" exp "then" exp ::=[ "if" exp "then" exp . "else" exp ] ] Shift/reduce conflict on token "+": Example "if" exp "then" exp . "+" exp First derivation exp ::=[ exp ::=[ "if" exp "then" exp . ] "+" exp ] Second derivation exp ::=[ "if" exp "then" exp ::=[ exp . "+" exp ] ] Shift/reduce conflict on token "+": Example "if" exp "then" exp "else" exp . "+" exp First derivation exp ::=[ exp ::=[ "if" exp "then" exp "else" exp . ] "+" exp ] Second derivation exp ::=[ "if" exp "then" exp "else" exp ::=[ exp . "+" exp ] ] * src/counterexample.c (counterexample_report_state): Don't stop of the first conflicts. * tests/conflicts.at, tests/counterexample.at, tests/diagnostics.at, * tests/report.at: Adjust.	2020-06-23 06:56:04 +02:00
Akim Demaille	0f120354b6	cex: don't display twice unifying examples if there is no color It makes no sense, and is actually confusing, to display twice the same example with no visible difference. * src/complain.h, src/complain.c (is_styled): New. * src/counterexample.c (print_counterexample): Display the unified example a second time only if it makes a difference. * tests/conflicts.at, tests/counterexample.at, tests/report.at: Adjust. * tests/diagnostics.at: Make sure we do display the unifying examples twice when colors are enabled. And check those colors.	2020-06-22 19:33:30 +02:00
Vincent ImbimboandAkim Demaille	69e3b405d9	cex: fix reporting of null nonterminals I implemented this to print A ::= [ ], but A ::= [ %empty ] might be clearer. * src/parse-simulation.c (nullable_closure): Don't generate null nonterminal derivations as leaves. * src/derivation.c (derivation_print_impl): Don't print seperator spaces for null nonterminal. * tests/counterexample.at: Update test results.	2020-06-22 07:11:31 +02:00
Akim Demaille	9e75066819	cex: style changes * src/counterexample.c: Simplify a bit. * src/parse-simulation.c, src/parse-simulation.h: Enforce coding style.	2020-06-19 08:02:18 +02:00
Akim Demaille	e077bf1ebc	cex: don't assume the terminal supports "•" Use of print_unicode_char suggested by Bruno Haible. https://lists.gnu.org/r/bug-gettext/2020-06/msg00012.html * src/gram.h (print_dot_fallback, print_dot): New. * src/gram.c, src/derivation.c: Use it. * tests/counterexample.at, tests/report.at: Adjust the test suite. * .travis.yml, README-hacking.md: Adjust.	2020-06-16 07:58:40 +02:00
Akim Demaille	c35e829a76	cex: also include in the report on --report=counterexamples And let --report=all include the counterexamples. * src/getargs.h, src/getargs.c (report_cex): New. * src/main.c: Compute counterexamples when -rcex is specified. * src/print.c: Include the counterexamples when -rcex is specified. * tests/conflicts.at, tests/existing.at, tests/local.at: Adjust.	2020-06-16 07:30:46 +02:00
Akim Demaille	d4f854e5b2	cex: also include the counterexamples in the report The report is the best place to show the details about counterexamples, since we have the state right under the nose. For instance: State 7 1 exp: exp . "⊕" exp 2 \| exp . "+" exp 2 \| exp "+" exp . [$end, "+", "⊕"] 3 \| exp . "+" exp 3 \| exp "+" exp . [$end, "+", "⊕"] "⊕" shift, and go to state 6 $end reduce using rule 2 (exp) $end [reduce using rule 3 (exp)] "+" reduce using rule 2 (exp) "+" [reduce using rule 3 (exp)] "⊕" [reduce using rule 2 (exp)] "⊕" [reduce using rule 3 (exp)] $default reduce using rule 2 (exp) Conflict between rule 2 and token "+" resolved as reduce (%left "+"). Shift/reduce conflict on token "⊕": 2 exp: exp "+" exp . 1 exp: exp . "⊕" exp Example exp "+" exp • "⊕" exp First derivation exp ::=[ exp ::=[ exp "+" exp • ] "⊕" exp ] Example exp "+" exp • "⊕" exp Second derivation exp ::=[ exp "+" exp ::=[ exp • "⊕" exp ] ] Reduce/reduce conflict on tokens $end, "+", "⊕": 2 exp: exp "+" exp . 3 exp: exp "+" exp . Example exp "+" exp • First derivation exp ::=[ exp "+" exp • ] Example exp "+" exp • Second derivation exp ::=[ exp "+" exp • ] Shift/reduce conflict on token "⊕": 3 exp: exp "+" exp . 1 exp: exp . "⊕" exp Example exp "+" exp • "⊕" exp First derivation exp ::=[ exp ::=[ exp "+" exp • ] "⊕" exp ] Example exp "+" exp • "⊕" exp Second derivation exp ::=[ exp "+" exp ::=[ exp • "⊕" exp ] ] * src/conflicts.h, src/conflicts.c (has_conflicts): New. * src/counterexample.h, src/counterexample.c (print_counterexample): Add a `prefix` argument. (counterexample_report_shift_reduce) (counterexample_report_reduce_reduce): Show the items when there's a prefix. * src/state-item.h, src/state-item.c (print_state_item): Add a `prefix` argument. * src/derivation.h, src/derivation.c (derivation_print) (derivation_print_leaves): Add a prefix argument. * src/print.c (print_state): When -Wcex is enabled, show the conflicts. * tests/report.at: Adjust.	2020-06-16 07:30:26 +02:00
Akim Demaille	35c0fe6789	cex: indent the diagnostics to highlight the structure Instead of Shift/reduce conflict on token D: Example A a • D First derivation s ::=[ A a a ::=[ b ::=[ c ::=[ • ] ] ] d ::=[ D ] ] Example A a • D Second derivation s ::=[ A a d ::=[ • D ] ] display Shift/reduce conflict on token D: Example A a • D First derivation s ::=[ A a a ::=[ b ::=[ c ::=[ • ] ] ] d ::=[ D ] ] Example A a • D Second derivation s ::=[ A a d ::=[ • D ] ] * src/counterexample.c (print_counterexample): Indent. * tests/counterexample.at: Adjust.	2020-06-16 07:29:46 +02:00
Akim Demaille	22f62414f9	cex: don't report the items Showing the items (with the state numbers) is really something we should restrict to the report. * src/counterexample.c (counterexample_report_shift_reduce) (counterexample_report_reduce_reduce): Don't show the pointed rules, we will do that in the report. * tests/counterexample.at: Adjust.	2020-06-16 07:29:46 +02:00
Akim Demaille	9206b15c4e	cex: make sure traces go to stderr * src/parse-simulation.h, src/parse-simulation.c (print_parse_state): here.	2020-06-16 07:29:46 +02:00
Akim Demaille	5edac5e15a	cex: add an argument to the reporting functions to specify the stream * src/conflicts.c (find_state_item_number, report_state_counterexamples): Move to... * src/counterexample.h, src/counterexample.c (find_state_item_number) (counterexample_report_state): this. Add support for `out` as an argument. (counterexample_report_reduce_reduce, counterexample_report_shift_reduce): Accept an `out` argument, and be static.	2020-06-16 07:29:46 +02:00
Akim Demaille	1c3189734c	style: more uses of const * src/print.c, src/state.h, src/state.c: here.	2020-06-16 07:29:46 +02:00
Akim Demaille	251e1b137f	reports: the column width differs from the byte count From "number" shift, and go to state 1 "Ñùṃéℝô" shift, and go to state 2 to "number" shift, and go to state 1 "Ñùṃéℝô" shift, and go to state 2 * src/print.c: Use mbswidth, not strlen, to compute visual columns. * tests/report.at: Adjust.	2020-06-13 17:21:51 +02:00
Akim Demaille	efbcadeca7	reports: don't escape the labels Currently we use "quotearg" to escape the strings output in Dot. As a result, if the user's locale is C for instance, all the non-ASCII are escaped. Unfortunately graphviz does not interpret this style of escaping. For instance: 5 -> 2 [style=solid label="\"\303\221\303\271\341\271\203\303\251\342\204\235\303\264\""] was displayed as a sequence of numbers. We now output: 5 -> 2 [style=solid label="\"Ñùṃéℝô\""] independently of the user's locale. * src/system.h (obstack_backslash): New. * src/graphviz.h, src/graphviz.c (escape): Remove, use obstack_backslash instead. * src/print-graph.c: Likewise. * tests/report.at: Adjust.	2020-06-13 16:58:13 +02:00
Akim Demaille	e4d33cf579	regen	2020-06-13 16:58:03 +02:00
Akim Demaille	5855da4722	parser: keep string aliases as the user wrote it Currently our scanner decodes all the escapes in the strings, and we later reescape the strings when we emit them. This is troublesome, as we do not respect the user input. For instance, when the user writes in UTF-8, we destroy her string when we write it back. And this shows everywhere: in the reports we show the escaped string instead of the actual alias: 0 $accept: . exp $end 1 exp: . exp "\342\212\225" exp 2 \| . exp "+" exp 3 \| . exp "+" exp 4 \| . "number" 5 \| . "\303\221\303\271\341\271\203\303\251\342\204\235\303\264" "number" shift, and go to state 1 "\303\221\303\271\341\271\203\303\251\342\204\235\303\264" shift, and go to state 2 This commit preserves the user's exact spelling of the string aliases, instead of interpreting the escapes and then reescaping. The report now shows: 0 $accept: . exp $end 1 exp: . exp "⊕" exp 2 \| . exp "+" exp 3 \| . exp "+" exp 4 \| . "number" 5 \| . "Ñùṃéℝô" "number" shift, and go to state 1 "Ñùṃéℝô" shift, and go to state 2 Likewise, the XML (and therefore HTML) outputs are fixed. * src/scan-gram.l (STRING, TSTRING): Do not interpret the escapes in the resulting string. * src/parse-gram.y (unquote, parser_init, parser_free, unquote_free) (handle_defines, handle_language, obstack_for_unquote): New. Use them to unquote where needed. * tests/regression.at, tests/report.at: Update.	2020-06-13 16:56:40 +02:00
Akim Demaille	cef13e11f5	style: factor common bits about string scanning * src/scan-gram.l: here.	2020-06-13 15:20:56 +02:00
Akim Demaille	b7fbfd050e	style: introduce & use STRING_1GROW * src/flex-scanner.h (STRING_1GROW): New. * src/scan-gram.l, src/scan-skel.l: Use it.	2020-06-13 15:20:56 +02:00
Akim Demaille	e088b4f90f	style: reduce scopes * src/scan-gram.l (STRING_GROW_ESCAPE): Move the static_assert about type sizes here.	2020-06-13 15:20:56 +02:00
Akim Demaille	c857ed4f72	style: prefer 'FOO ()' to 'FOO' for function-like macros * src/flex-scanner.h (STRING_GROW, STRING_FINISH, STRING_FREE): Make them function-like macros. Adjust dependencies.	2020-06-13 15:20:56 +02:00
Akim Demaille	1998606a90	regen	2020-06-11 19:11:33 +02:00

... 2 3 4 5 6 ...