bison

mirror of https://git.savannah.gnu.org/git/bison.git synced 2026-04-24 10:39:38 +00:00

Author	SHA1	Message	Date
Akim Demaille	82133a6103	cex: add support for $TIME_LIMIT * src/counterexample.c (TIME_LIMIT): Replace with... (time_limit): this. (counterexample_init): Check $TIME_LIMIT. * src/scan-gram.l: Reorder includes.	2021-01-14 06:35:09 +01:00
Akim Demaille	430ca0fc63	cex: send traces to stderr, not stdout When comparing traces from different machines, the mixture of stdout/stderr in the output are making things uselessly difficult. * src/lssi.c, src/state-item.c: Output debug traces on stderr.	2021-01-13 08:03:45 +01:00
Akim Demaille	e2199d0fb2	style: YYUSE is private, make it YY_USE This macro is not exposed to users, make start it with 'YY_'. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/glr.c, * data/skeletons/glr.cc, data/skeletons/glr2.cc, data/skeletons/lalr1.cc, * src/parse-gram.c, tests/actions.at, tests/c++.at, tests/headers.at, * tests/local.at (YYUSE): Rename as... (YY_USE): this.	2021-01-03 19:57:10 +01:00
Akim Demaille	83f2eb3737	glr2.cc: the example requires Bison 3.8 This will save us from generating the position.hh file. * src/parse-gram.y: Claim we are 3.8. * examples/c++/glr/c++-types.yy: Require 3.8.	2020-12-31 08:21:25 +01:00
Akim Demaille	3911aba39a	%merge: associate it to its first definition, not the latest Currently each time we meet %merge we record this location as the defining location (and symbol). Instead, record the first definition. In the generated code we go from yy0->A = merge (yy0, yy1); to yy0->S = merge (yy0, yy1); where S was indeed the first symbol, and in the diagnostics we go from glr-regr18.y:30.18-24: error: result type clash on merge function 'merge': <type2> != <type1> 30 \| sym2: sym3 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ glr-regr18.y:29.18-24: note: previous declaration 29 \| sym1: sym2 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ glr-regr18.y:31.13-19: error: result type clash on merge function 'merge': <type3> != <type2> 31 \| sym3: %merge<merge> { $$ = 0; } ; \| ^~~~~~~ glr-regr18.y:30.18-24: note: previous declaration 30 \| sym2: sym3 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ to glr-regr18.y:30.18-24: error: result type clash on merge function 'merge': <type2> != <type1> 30 \| sym2: sym3 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ glr-regr18.y:29.18-24: note: previous declaration 29 \| sym1: sym2 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ glr-regr18.y:31.13-19: error: result type clash on merge function 'merge': <type3> != <type1> 31 \| sym3: %merge<merge> { $$ = 0; } ; \| ^~~~~~~ glr-regr18.y:29.18-24: note: previous declaration 29 \| sym1: sym2 %merge<merge> { $$ = $1; } ; \| ^~~~~~~ where both duplicates are reported against definition 1, rather than using definition 1 as a reference when diagnosing about definition 2, and then 2 as a reference for 3. * src/reader.c (record_merge_function_type): Keep the first definition. * tests/glr-regression.at: Adjust.	2020-12-31 08:07:34 +01:00
Akim Demaille	c09f2e4c7b	%merge: delegate the generation of calls to mergers to m4 Don't generate C code from bison, leave that to the skeletons. * src/output.c (merger_output): Emit invocations to b4_call_merger. * data/skeletons/glr.c, data/skeletons/glr2.cc (b4_call_merger): New.	2020-12-31 08:07:11 +01:00
Akim Demaille	ac3d5b76f7	%merge: let mergers record a typing-symbol, rather than a type Symbols are richer than types, and in M4 it is my simpler (and more common) to deal with symbols rather than types. So let's associate mergers to a symbol rather than a type name. * src/reader.h (merger_list): Replace the 'type' member by a symbol member. * src/reader.c (record_merge_function_type): Take a symbol as argument, rather than a type name. * src/output.c (merger_output): Adjust.	2020-12-31 08:07:11 +01:00
Akim Demaille	c0f3b55b25	style: address syntax-check diagnostics * examples/c/glr/c++-types.y: Formatting changes. * po/POTFILES.in: Add missing files. * src/reader.c: Remove useless include. * tests/calc.at: Avoid magic values for exit. Obfuscate calls to error.	2020-12-21 07:51:02 +01:00
Akim Demaille	03d33fd3a4	skeletons: better comments for some tables And also, remove the incorrect indentation of these comments: - /* YYR2[YYN] -- Number of symbols on the right hand side of rule YYN. / +/ YYR2[RULE-NUM] -- Number of symbols on the right-hand side of rule RULE-NUM. / static const yytype_int8 yyr2[] = { 0, 2, 4, 0, 2, 1, 1, 1, 3, 2, I don't remember why this indentation was added (in `0991e29b75`), but it seems wrong, at least for yacc.c. I suspect this was done with lalr1.cc (where this is embeded in the class definition, so it should be indented), but today lalr1.cc uses other routines to output these comments. data/skeletons/bison.m4 (b4_integral_parser_tables_map): Improve the wording of the comments of some tables. * data/skeletons/c.m4 (b4_integral_parser_table_define): Remove indentation.	2020-12-20 14:54:46 +01:00
Akim Demaille	0e78a9028e	portability: beware of GCC 4.6 src/reader.c: In function 'grammar_start_symbols_add': src/reader.c:67:24: error: declaration of 'dup' shadows a global declaration [-Werror=shadow] * src/reader.c (grammar_start_symbols_add): Rename dup as dupl.	2020-12-03 19:46:20 +01:00
Akim Demaille	24233748ec	tables: avoid warnings and save bits The yydefgoto table uses -1 as an invalid for an impossible case (we never use yydefgoto[0], since it corresponds to the reduction to $accept, which never happens). Since yydefgoto is a table of state numbers, this -1 forces a signed type uselessly, which (1) might trigger compiler warnings when storing a value from yydefgoto into a state number (nonnegative), and (2) wastes bits which might result in using a int16 where a uint8 suffices. Reported by Jot Dot <jotdot@shaw.ca>. https://lists.gnu.org/r/bug-bison/2020-11/msg00027.html * src/tables.c (default_goto): Use 0 rather than -1 as invalid value. * tests/regression.at: Adjust.	2020-12-03 06:21:53 +01:00
Akim Demaille	5b19f91ccf	multistart: check duplicates * src/symlist.h, src/symlist.c (symbol_list_find_symbol) (symbol_list_last): New. (symbol_list_append): Use symbol_list_last. * src/reader.c (grammar_start_symbols_add): Check and discard duplicates. * tests/input.at (Duplicate %start symbol): New. * tests/reduce.at (Bad start symbols): Add the multistart keyword.	2020-11-30 16:48:03 +01:00
Akim Demaille	7fe9205b9f	style: change the format of a debugging function * src/symlist.c (symbol_list_syms_print): Use braces to make traces easier to read.	2020-11-22 16:01:05 +01:00
Akim Demaille	d798851e48	style: rename grammar_start_symbols_set as grammar_start_symbols_add * src/reader.h, src/reader.c (grammar_start_symbols_set): Rename as... (grammar_start_symbols_add): this. Adjust dependencies.	2020-11-22 11:18:20 +01:00
Akim Demaille	23472033ee	Merge branch 'maint' * maint: c++: shorten the assertions that check whether tokens are correct c++: don't glue functions together lalr1.cc: YY_ASSERT should use api.prefix c++: don't use YY_ASSERT at all if parse.assert is disabled c++: style: follow the Bison m4 quoting pattern yacc.c: provide the Bison version as an integral macro regen style: make conversion of version string to int public %require: accept version numbers with three parts ("3.7.4") yacc.c: fix #definition of YYEMPTY gnulib: update doc: fix incorrect section title doc: minor grammar fixes in counterexamples section	2020-11-13 07:01:19 +01:00
Akim Demaille	21c147b6e5	yacc.c: provide the Bison version as an integral macro Suggested by Balazs Scheidler. https://github.com/akimd/bison/issues/55 * src/muscle-tab.c (muscle_init): Move/rename `b4_version` to/as... * src/output.c (prepare): `b4_version_string`. Also define `b4_version`. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/d.m4, * data/skeletons/java.m4: Adjust. * doc/bison.texi: Document it.	2020-11-11 09:08:57 +01:00
Akim Demaille	d3c575a6c6	regen	2020-11-11 08:47:23 +01:00
Akim Demaille	d8b49e2b73	style: make conversion of version string to int public * src/parse-gram.y (str_to_version): Rename as/move to... * src/strversion.h, src/strversion.c (strversion_to_int): these new files.	2020-11-11 08:47:23 +01:00
Akim Demaille	14c65a35f0	%require: accept version numbers with three parts ("3.7.4") * src/parse-gram.y (str_to_version): Support three parts. * data/skeletons/location.cc, data/skeletons/stack.hh: Adjust.	2020-11-11 08:47:23 +01:00
Akim Demaille	25ded505a3	ielr: fix incorrect function call * src/ielr.c: s/rule_is_accepting/rule_is_initial/.	2020-11-10 07:15:43 +01:00
Akim Demaille	6c0ba6089a	ielr: more comments and logs * src/ielr.c: More comments. (state_list_print): New.	2020-11-10 07:08:07 +01:00
Akim Demaille	70ee8c77a8	multistart: fix IELR computations IELR needs to rule out the successors of the kernel items of the initial state (`$accept: input • $end`). In the case of multistart, this condition must be expressed differently: the mere item index does not suffice. * src/ielr.c (ielr_item_has_lookahead, ielr_compute_lookaheads): Don't rely on the item index to check whether is_successor_of_initial_item. It is certainly more costly than just checking the item index, but (i) we need to compute the rule anyway, so it's not very much more costly, and (ii) in ielr_item_has_lookahead, this situation is actually impossible, so an optimizing compiler reading the assertions should actually avoid this computation.	2020-11-10 07:08:07 +01:00
Akim Demaille	e9a43ed4ae	ielr: make some conditions about items easier to understand Checking that an item index is > 1 means ruling out `$accept: • input $end` and `$accept: input • $end`. But actually only the latter is possible there, i.e., we're checking whether this item is about a successor of a (kernel) item of the initial state ($accept: input • $end). * src/ielr.c (is_successor_of_initial_item): Use a variable to name this condition.	2020-11-10 07:08:07 +01:00
Akim Demaille	a38d0b9145	multistart: introduce and use rule_is_initial * src/gram.h (rule_is_initial): New. * src/graphviz.c, src/print-xml.c, src/print.c, src/lalr.c: Use it. Some of these occurrences were incorrect (checking whether this is rule 0), and not behaving properly in the case of multistart.	2020-11-10 07:08:03 +01:00
Akim Demaille	4b0cd01fb7	style: comment and formatting changes, and fixes * examples/c/lexcalc/parse.y: Fix option handling. * src/gram.h: Clarify comments. * src/ielr.c: Fix indentation. * src/print.c, src/state.h: More comments.	2020-11-08 13:42:15 +01:00
Akim Demaille	0328cbad64	lalr: add assertions * src/lalr.c: Remove incorrect comment (subsumed anyway by the (correct) one in the header. (set_goto_map): More debug traces. (map_goto): Add an assertion.	2020-11-08 08:25:20 +01:00
Akim Demaille	98691fcd2d	Merge branch 'maint' * upstream/maint: doc: fix typo maint: post-release administrivia version 3.7.3 build: don't link bison against libreadline gnulib: update glr.cc: fix: use symbol_name build: fix a concurrent build issue in examples	2020-10-14 21:12:45 +02:00
Akim Demaille	bc5e4541da	build: don't link bison against libreadline Reported by Paul Smith <psmith@gnu.org>. https://lists.gnu.org/r/bug-bison/2020-10/msg00001.html * src/local.mk (src_bison_LDADD): here.	2020-10-13 06:57:33 +02:00
Akim Demaille	36143b5ecc	report: put the dot after %empty in items When printing items, it is clearer to put the dot after %emtpy rather than before: 0 $accept: . unit "end of file" 1 unit: . assignments exp - 2 assignments: . %empty + 2 assignments: %empty . 3 \| . assignments assignment Also, use the Unicode characters if they are supported. * src/gram.c (item_print): Put the dot after %emtpy. * tests/conflicts.at, tests/reduce.at, tests/report.at: Adjust.	2020-10-07 06:28:52 +02:00
Akim Demaille	f4d33ff4b4	yacc.c: also count calls to YYERROR in yynerrs * data/skeletons/yacc.c: here.	2020-09-27 11:58:27 +02:00
Akim Demaille	683040b324	multistart: allow tokens as start symbols After all, why not? * src/reader.c (switching_token): Use symbol_id_get. (check_start_symbols): Require that the start symbol is a token only if it's the only one. * examples/c/lexcalc/parse.y: Let NUM be a start symbol.	2020-09-27 09:44:23 +02:00
Akim Demaille	d9cf99b6a5	multistart: use b4_accept instead of action post-processing For each start symbol, generate a parsing function with a richer return value than the usual of yyparse. Reserve a place for the returned semantic value, in order to avoid having to pass a pointer as argument to "return" that value. This also makes the call to the parsing function independent of whether a given start-symbol is typed. For instance, if the grammar file contains: %type <int> expression %start input expression (so "input" is valueless) we get typedef struct { int yystatus; } yyparse_input_t; yyparse_input_t yyparse_input (void); typedef struct { int yyvalue; int yystatus; } yyparse_expression_t; yyparse_expression_t yyparse_expression (void); This commit also changes the implementation of the parser termination: when there are multiple start symbols, it is the initial rules that explicitly YYACCEPT. They do that after having exported the start-symbol's value (if it is typed): switch (yyn) { case 1: /* $accept: YY_EXPRESSION expression $end / { ((yyvalue).TOK_expression) = (yyvsp[-1].TOK_expression); YYACCEPT; } break; case 2: /* $accept: YY_INPUT input $end / { YYACCEPT; } break; I have tried several ways to deal with termination, and this is the one that appears the best one to me. It is also the most natural. src/scan-code.h, src/scan-code.l (obstack_for_actions): New. * src/reader.c (grammar_rule_check_and_complete): Generate the actions of the rules for each start symbol. * data/skeletons/bison.m4 (b4_symbol_slot): New, with safer semantics than type and type_tag. * data/skeletons/yacc.c (b4_accept): New. Generates the body of the action of the start rules. (_b4_declare_sub_yyparse): For each start symbol define a dedicated return type for its parsing function. Adjust the declaration of its parsing function. (_b4_define_sub_yyparse): Adjust the definition of the function. * examples/c/lexcalc/parse.y: Check the case of valueless symbols. * examples/c/lexcalc/lexcalc.test: Check start symbols.	2020-09-27 09:44:18 +02:00
Akim Demaille	a6805bb8d9	multistart: adjust reader checks for generated rules So far we were not checking the generated rule 0 at all. Now there can be several of them. Instead of not checking at all, let's be more selective on the check to run on them. * src/reader.c (grammar_rule_check_and_complete): Don't check for value usage for generated rules, it is ok to have a valued start symbol, in which case it is ok for the generated rule ("accept: start $end {}") to not use $1. (packgram): Call grammar_rule_check_and_complete for all the rules.	2020-09-27 09:23:51 +02:00
Akim Demaille	05d6b54703	multistart: pass the list of start symbols to the backend * src/output.c (start_symbols_output): New. (muscles_output): Use it.	2020-09-27 09:23:51 +02:00
Akim Demaille	85ccc1bab3	multistart: adjust computation of initial core and adjust reports Currently the core of the initial state is limited to the single rule on $accept. * src/lr0.c (generate_states): There may now be several rules on $accept. * src/graphviz.c (conclude_red): Recognize "final" transitions by the fact that we reduce to "$accept". * src/print.c (print_reduction): Likewise. * src/print-xml.c (print_reduction): Likewise.	2020-09-27 09:23:51 +02:00
Akim Demaille	4646be7db4	regen	2020-09-27 09:23:51 +02:00
Akim Demaille	8eaddf326b	multistart: turn start symbols into rules on $accept Now that the parser can read several start symbols, let's process them, and create the corresponding rules. * src/parse-gram.y (grammar_declaration): Accept a list of start symbols. * src/reader.h, src/reader.c (grammar_start_symbol_set): Rename as... (grammar_start_symbols_set): this. * src/reader.h, src/reader.c (start_flag): Replace with... (start_symbols): this. * src/reader.c (grammar_start_symbols_set): Build a list of start symbols. (switching_token, create_start_rules): New. (check_and_convert_grammar): Use them to turn the list of start symbols into a set of rules. * src/reduce.c (nonterminals_reduce): Don't complain about $accept, it's an internal detail. (reduce_grammar): Complain about all the start symbols that don't derive sentences. * src/symtab.c (startsymbol, startsymbol_loc): Remove, replaced by start_symbols. symbols_pack): Move the check about the start symbols to... * src/symlist.c (check_start_symbols): here. Adjust to multiple start symbols. * tests/reduce.at (Empty Language): Generalize into... (Bad start symbols): this.	2020-09-27 09:23:51 +02:00
Akim Demaille	db68f61595	regen	2020-09-27 09:23:51 +02:00
Akim Demaille	7eca26e87b	parser: expose a list of symbols * src/parse-gram.y (%type): Also use current_class. (symbol_decl.1): Rename as... (symbols.1): this.	2020-09-27 09:23:51 +02:00
Akim Demaille	e50ec28153	reader: get ready to create several initial rules * src/reader.c (create_start_rule): New. Use it.	2020-09-27 09:23:50 +02:00
Akim Demaille	f7f2c99c28	gram: more debugging information * src/gram.c (ritem_print): Show indices in ritem.	2020-09-27 09:23:50 +02:00
Akim Demaille	72946549ed	style: formatting changes * src/scan-code.l: here.	2020-09-20 08:23:28 +02:00
Akim Demaille	bad4fc09a7	style: introduce parse_positional_ref * src/scan-code.l: here.	2020-09-20 08:23:28 +02:00
Akim Demaille	aac79ca103	style: clarify the way state kernels (aka cores) are built Use state_list_append in a more natural way. * src/lr0.c (generate_states): Here.	2020-09-20 08:23:28 +02:00
Akim Demaille	843f99886c	style: reorder and comment * src/reader.h: here.	2020-09-20 08:23:28 +02:00
Akim Demaille	0711dca9d9	add support for --html * bootstrap.conf: We need the "execute" module. * src/files.h, src/files.c (spec_html_file, html_flag): New. * src/getargs.h, src/getargs.c (--html): New. * src/print-xml.h, src/print-xml.c (print_html): New. * src/main.c: Use them. * tests/output.at, tests/report.at: Check --html.	2020-09-19 17:49:03 +02:00
Akim Demaille	f5d4b64909	regen	2020-09-19 17:49:03 +02:00
Akim Demaille	b327f38832	deprecate %defines in favor of %header This is consistent with --defines being deprecated in favor of --header. The directive %defines is also too similar to %define. And %header matches nicely with api.header.name. * src/scan-gram.l (%defines): Deprecate to %header. (%header): Scan it. * src/parse-gram.y (PERCENT_DEFINES): Replace with... (PERCENT_HEADER): this. * data/skeletons/lalr1.java * doc/bison.texi * tests/actions.at, tests/c++.at, tests/calc.at, tests/conflicts.at, * tests/input.at, tests/java.at, tests/local.at, tests/output.at, * tests/synclines.at, tests/types.at: Convert most tests to check %header instead of %defines.	2020-09-19 17:49:03 +02:00
Akim Demaille	75c3746ce2	options: rename --defines as --header The name "defines" is incorrect, the generated file contains far more than just #defines. * src/getargs.h, src/getargs.c (-H, --header): New option. With optional argument, just like --defines, --xml, etc. (defines_flag): Rename as... (header_flag): this. Adjust dependencies. * data/skeletons/bison.m4, data/skeletons/c.m4, data/skeletons/glr.c, * data/skeletons/glr.cc, data/skeletons/glr2.cc, data/skeletons/lalr1.cc, * data/skeletons/yacc.c: Adjust. * examples, doc/bison.texi: Adjust. * tests/headers.at, tests/local.at, tests/output.at: Convert most tests from using --defines to using --header.	2020-09-19 08:31:49 +02:00
Akim Demaille	325ec7d324	cex: always show ε/%empty in counterexamples On a case such as %% exp : empty "a" \| "a" empty empty : %empty we used to display warning: shift/reduce conflict on token "a" [-Wcounterexamples] Example: • "a" Shift derivation exp ↳ 2: • "a" empty ↳ 2: ε Example: • "a" Reduce derivation exp ↳ 1: empty "a" ↳ 3: • where the shift derivation shows an item "2: empty → ε", with an explicit "ε", but the reduce derivation shows "3: empty → •", without "ε". For consistency, let's always show ε/%empty in rules with an empty rhs: Reduce derivation exp ↳ 1: empty "a" ↳ 3: ε • * src/derivation.c (derivation_width, derivation_print_tree_impl): Always show ε/%empty in counterexamples. * tests/diagnostics.at: Check that case. * tests/conflicts.at, tests/counterexample.at: Adjust.	2020-09-02 07:31:55 +02:00

1 2 3 4 5 ...

2886 Commits